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Preface 


Goal 


Quaternion algebras sit prominently at the intersection of many mathematical 
subjects. They capture essential features of noncommutative ring theory, number 
theory, K-theory, group theory, geometric topology, Lie theory, functions of a 
complex variable, spectral theory of Riemannian manifolds, arithmetic geometry, 
representation theory, the Langlands program—and the list goes on. Quaternion 
algebras are especially fruitful to study because they often reflect some of the 
general aspects of these subjects, while at the same time they remain amenable to 
concrete argumentation. Moreover, quaternions often encapsulate unique features 
that are absent from the general theory (even as they provide motivation for it). 

With this in mind, the main goal in writing this text is to introduce a large subset 
of the above topics to graduate students interested in algebra, geometry, and number 
theory. To get the most out of reading this text, readers will likely want to have 
been exposed to some algebraic number theory, commutative algebra (e.g., module 
theory, localization, and tensor products), as well as the fundamentals of linear 
algebra, topology, and complex analysis. For certain sections, further experience 
with objects in differential geometry or arithmetic geometry (e.g., Riemannian 
manifolds and elliptic curves), may be useful. With these prerequisites in mind, I 
have endeavored to present the material in the simplest, motivated version—full of 
rich interconnections and illustrative examples—so even if the reader is missing a 
piece of background, it can be quickly filled in. 

Unfortunately, this text only scratches the surface of most of the topics covered 
in the book! In particular, some appearances of quaternion algebras in arithmetic 
geometry that are dear to me are absent, as they would substantially extend the 
length and scope of this already long book. I hope that the presentation herein will 
serve as a foundation upon which a detailed and more specialized treatment of these 
topics will be possible. 

I have tried to maximize exposition of ideas and minimize technicality: 
sometimes I allow a quick and dirty proof, but sometimes the “right level of 
generality” (where things can be seen most clearly) is pretty abstract. So my efforts 
have resulted in a level of exposition that is occasionally uneven jumping between 
sections. I consider this a feature of the book, and I hope that the reader will agree 
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and feel free to skip around (see How to use this book below). I tried to “reboot” at 
the beginning of each part and again at the beginning of each chapter, to refresh our 
motivation. For researchers working with quaternion algebras, I have tried to collect 
results otherwise scattered in the literature and to provide some clarifications, 
corrections, and complete proofs in the hope that this text will provide a convenient 
reference. In order to provide these features, to the extent possible I have opted for 
an organizational pattern that is “horizontal” rather than “vertical”: the text has 
many chapters, each representing a different slice of the theory. 

I tried to compactify the text as much as possible, without sacrificing com- 
pleteness. There were a few occasions when I thought a topic could use further 
elaboration or has evolved from the existing literature, but did not want to over- 
burden the text; I collected these in a supplementary text Quaternion algebras 
companion, available at the website for the text at http://quatalg.org. 

As usual, each chapter also contains a number of exercises at the end, ranging 
from checking basic facts used in a proof to more difficult problems that stretch the 
reader. Exercises that are used in the text are marked by >. For a subset of exercises 
(including many of those marked with >), there are hints, comments, or a complete 
solution available online. 


How to Use This Book 


With apologies to Whitman, this book is large, it contains multitudes—and hope- 
fully, it does not contradict itself! 

There is no obligation to read the book linearly cover to cover, and the reader is 
encouraged to find their own path, such as the following. 


1. For an introductory survey course on quaternion algebras, read just the 
introductory sections in each chapter, those labeled with >, and supplement 
with sections from the text when interested. These introductions usually 
contain motivation and a summary of the results in the rest of the chapter, 
and I often restrict the level of generality or make simplifying hypotheses so 
that the main ideas are made plain. The reader who wants to quickly and 
gently grab hold of the basic concepts may digest the book in this way. The 
instructor may desire to fill in some further statements or proofs to make for a 
one semester course: Chapters 1, 2, 11, 25, and 35 could be fruitfully read in 
their entirety. 

2. For a minicourse in noncommutative algebra with emphasis on quaternion 
algebras, read just Part I. Such an early graduate course would have minimal 
prerequisites and in a semester could be executed at a considered pace; it 
would provide the foundation for further study in many possible directions. 

3. For quaternion algebras and algebraic number theory, read Part I and II. This 
course would be a nice second-semester addition following a standard 
first-semester course in algebraic number theory, suitable for graduate stu- 
dents in algebra and number theory who are motivated to study quaternion 


Preface Vii 


algebras as “noncommutative quadratic fields”. For a lighter course, 
Chapters 6, 20, and 21 could be skipped, and the instructor may opt to cover 
only the introductory section of a chapter for reasons of time and interest. To 
reinforce concepts from algebraic number theory, special emphasis could be 
placed on Chapter 13 (where local division algebras are treated like local 
fields) and 18 (where maximal orders are treated like noncommutative 
Dedekind domains). 


There are also more specialized options, beginning with the introductory sec- 
tions in Part I and continuing as follows: 


4. For quaternion algebras and analytic number theory, continue with the 
introductory sections in Part II Gust Chapters 9-17), and then cover Part III (at 
least through Chapter 29). This course could follow a first-semester course in 
analytic number theory, enriching students' understanding of zeta functions 
and L-functions (roughly speaking, beginning the move from GL; to GL). 
The additional prerequisite of real analysis (measure theory) is recommended. 
Optionally, this course could break after Chapter 26 to avoid adeles, and 
perhaps resume in an advanced topics course with the remaining chapters. 

5. For quaternionic applications to geometry (specifically, hyperbolic geometry 
and low-dimensional topology), continue with the introductory sections in 
Part II (through Chapter 14), and then cover Part IV (optionally skipping 
Chapter 32). 

6. For an advanced course on quaternion algebras and arithmetic geometry, 
continue with Part II, the introductory sections in Parts IV and V. Chapter 41 
could be read immediately after Part II. This path is probably most appro- 
priate for an advanced course for students with some familiarity with mod- 
ular forms and some hyperbolic geometry, and Chapter 42 is probably only 
meaningful for students with a background in elliptic curves (though the 
relevant concepts are reviewed at the start). 

7. Finally, for the reader who is studying quaternion algebras with an eye to 
applications with supersingular elliptic curves, the reader may follow Chapters 
2—4, 9-10, 13-14, 16-17, 23, then the main event in Chapter 42. For further 
reading on quaternion orders and ternary quadratic forms, I suggest Chapters 5, 
22, and 24. 


Sections of the text that are more advanced (requiring more background) or 
those may be omitted are labeled with x. The final chapter (Chapter 43) is neces- 
sarily more advanced, and additional prerequisites in algebraic and arithmetic 
geometry are indicated. 

It is a unique feature of quaternion algebras that topics overlap and fold together 
like this, and so I hope the reader will forgive the length of the book. The reader 
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may find the symbol definition list at the end to help in identifying unfamiliar 
notation. Finally, to ease in location, I have chosen to number all objects 
(theorem-like environments, equations, and figures) consecutively. 


Companion Reading 


Several general texts can serve as companion reading for this monograph: 


e The lecture notes of Vignéras [Vig80a] have been an essential reference for the 
arithmetic of quaternion algebras since their publication. 

e The seminal text by Reiner [Rei2003] on maximal orders treats many intro- 
ductory topics that overlap this text. 

e The book of Maclachlan—Reid [MR2003] gives an introduction to quaternion 
algebras with application to the geometry of 3-manifolds. 

e The book by Deuring [Deu68] (in German) develops the theory of algebras over 
fields, culminating in the treatment of zeta functions of division algebras over 
the rationals, and may be of historical interest as well. 

e Finally, Pizer [Piz76a] and Alsina—Bayer [AB2004] present arithmetic and 
algorithmic aspects of quaternion algebras over Q. 
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Chapter 1 


Introduction 


We begin following the historical arc of quaternion algebras and tracing their impact 
on the development of mathematics. Our account is selective: for further overview, see 
Lam [Lam2003] and Lewis [Lew2006a]. 


1.1 Hamilton’s quaternions 


In perhaps the “most famous act of mathematical vandalism”, on October 16, 1843, Sir 
William Rowan Hamilton (1805-1865, Figure 1.1.2) carved the following equations 
into the Brougham Bridge (now Broom Bridge) in Dublin: 


pep ok =ijk=-l1. (1.1.1) 


His discovery was a defining moment in the history of algebra (Figure 1.1.3). 


Figure 1.1.2: William Rowan Hamilton 
(public domain; scan by Wellesley College Library) 
© The Author(s) 2021 1 


J. Voight, Quaternion Algebras, Graduate Texts in Mathematics 288, 
https://doi.org/10.1007/978-3-030-56694-4_1 
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For at least ten years (on and off), Hamilton had been attempting to model (real) 
three-dimensional space with a structure like the complex numbers, whose addition 
and multiplication occur in two-dimensional space. Just like the complex numbers had 
a “real” and “imaginary” part, so too did Hamilton hope to find an algebraic system 
whose elements had a “real” and two-dimensional “imaginary” part. In the early part 
of the month of October 1843, his sons Archibald Henry and William Edwin Hamilton, 
while still very young, would ask their father at breakfast [Ham67, p. xv]: “Well, papa, 
can you multiply triplets?” To which Hamilton would reply, “with a sad shake of the 
head, ‘No, I can only add and subtract them’” [Ham67, p. xv]. For a history of the 
“multiplying triplets” problem—the nonexistence of division algebra over the reals of 
dimension 3—see May [May66, p. 290]. 


part of the 2012 Dublin castle exhibition, Irish Science 
(© Daniel Doyle; reproduced with permission) 


Then, on the dramatic day in 1843, Hamilton’s had a flash of insight [Ham67, 
p. xv—xvi], which he described in a letter to Archibald (written in 1865): 


On the 16th day of [October]—which happened to be a Monday, and a 
Council day of the Royal Irish Academy—I was walking in to attend and 
preside, and your mother was walking with me, along the Royal Canal, to 
which she had perhaps driven; and although she talked with me now and 
then, yet an under-current of thought was going on in my mind, which 
gave at last a result, whereof it is not too much to say that I felt at once 
the importance. An electric circuit seemed to close; and a spark flashed 
forth, the herald (as I foresaw, immediately) of many long years to come 
of definitely directed thought and work, by myself if spared, and at all 
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events on the part of others, if I should even be allowed to live long 
enough distinctly to communicate the discovery. Nor could I resist the 
impulse—unphilosophical as it may have been—to cut with a knife on a 
stone of Brougham Bridge, as we passed it, the fundamental formula with 
the symbols, i, j, k; namely, 


i? = j? =k? =ijk =-1 


which contains the Solution of the Problem, but of course, as an inscrip- 
tion, has long since mouldered away. 


In this moment, Hamilton realized that he needed a fourth dimension; he later 
coined the term quaternions for the real space spanned by the elements 1, i, j, k, subject 
to his multiplication laws. He presented his theory of quaternions to the Royal Irish 
Academy in a paper entitled “On a new Species of Imaginary Quantities connected 
with a theory of Quaternions” [Ham1843]. Today, we denote this algebra H := R + 
Ri + Rj + Rk and call H the ring of Hamilton quaternions in his honor. 

This charming story of quaternionic discovery remains in the popular conscious- 
ness, and to commemorate Hamilton’s discovery of the quaternions, there is an annual 
“Hamilton walk” in Dublin [0Ca2010]. Although his carvings have long since worn 
away, a plaque on the bridge now commemorates this significant event in mathematical 
history (Figure 1.1.4). 


Here as he NATTY by ‘ 
on the toth of October 1843 
a iiaa Rowan Hèrttren 

flash of genius discovered 


Figure 1.1.4: The Broom Bridge plaque (author’s photo) 


For more on the history of Hamilton’s discovery, see the extensive and detailed 
accounts of Dickson [Dic19] and Van der Waerden [vdW76]. There are also three 
main biographies written about the life of William Rowan Hamilton, a man sometimes 
referred to as “Ireland’s greatest mathematician”: by Graves [Grav1882, Grav1885, 
Grav 1889] in three volumes, Hankins [Hankin80], and O’ Donnell [O’ D083]. Numer- 
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ous other shorter biographies have been written [DM89, Lanc67, OCa2000]. (Certain 
aspects of Hamilton’s private life deserve a more positive portrayal, however: see Van 
Weerden—Wepster [WW2018].) 


160 ELEMENTS OF QUATERNIONS, [Book 11. 


the laws of i, j, k agree with usual and algebraic laws: namely, 
in the Associative Property of Multiplication; or in the pro- 
perty that the new symbols always obey the associative for- 
mula (comp. 9), 
t-KA =k. À, 
whichever of them may be substituted for :, for x, and for A; 
in virtue of which equality of values we may omit the point, in 
any such symbol of a ternary product (whether of equal or of 
unequal factors), and write it simply as «A. In particular 
we have thus, 
ijkeiie®e-1; ijkekkek=-1l; 


or briefly, 
ijk=-1. 
We may, therefore, by 182, establish the following important 
Formula : 
Pape d= jhe -1; (A) 


to which we shall occasionally refer, as to ‘* Formula A,” and 
which we shall find to contain (virtually) all the laws of the 
symbols ijk, and therefore to be a sufficient symbolical basis 
for the whole Calculus of Quaternions :* because it will be 
shown that every quaternion can be reduced to the Quadrino- 
mial Form, . 
q=w+iz + jy+ kz, 

where w, 2, y, z compose a system of four scalars, while i, j, k 
are the same three right versors as above. 


(1.) A direct proof of the equation, ijk =— 1, may be derived from the definitions 
of the symbols in Art. 181. In fact, we have only to remember that those defini- 
tions were seen to give, 


© This formula (A) was accordingly made the basis of that Calculus in the first 
communication on the subject, by the present writer, to the Royal Irish Academy in 
1848 ; and the letters, i, j, 4, continued to be, for some time, the only peculiar sym- 
bols of the calculus in question. But it was gradually found to be useful to incor- 
porate with these a few other notations (such as K and U, &c.), for representing 
Operations on Quaternions. It was also thought to be instructive to establish the 
principles of that Calculus, on a more geometrical (or less exclusively symbolical) 
foundation than at first; which was accordingly afterwards done, in the volume en- 
titled: Lectures on Quaternions (Dublin, 1858); and is again attempted in the pre~ 
sent work, although with many differences in the adopted plan of exposition, and in 
the applications brought forward, or suppressed. 


Figure 1.1.5: A page from Hamilton’s Elements of quaternions [Ham1866] 
(public domain) 


There are several precursors to Hamilton’s discovery that bear mentioning. First, 
the quaternion multiplication laws are already implicit in the four-square identity of 
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Leonhard Euler (1707-1783): 


(a? +a? +a? +a? + bZ + b2 +b?) = e? tetee 
(a,b, — ab — a3b3 — a4b4) + (a1b2 + azb; + azb4 — a4b3) (1.1.6) 
+ (a,b3 — azb4 + a3bı + a4b2} + (ayb4 + azb; — a3bz + a4b,)°. 
Indeed, the full multiplication law for quaternions reads precisely 
(ay + ai + a3j + a4k)(bı + boi + b3j + bak) = c1 + c2i + c3j + c4k 


with c1, C2, C3, c4 as defined in (1.1.6); the four-square identity corresponds to taking a 
norm on both sides. 

It was perhaps Carl Friedrich Gauss (1777—1855) who first observed this connec- 
tion. In a note dated around 1819 [Gau00], he interpreted the formula (1.1.6) as a 
way of composing real quadruples: to the quadruples (a1, a2, a3, a4) and (b1, b2, b3, b4) 
in R4, he defined the composite tuple (c1, c2, C3, C4) and noted the noncommutativity 
of this operation. Gauss elected not to publish these findings (as he chose not to do 
with many of his discoveries). In letters to De Morgan [Grav1885, Grav1889, p. 330, 
p. 490], Hamilton attacks the allegation that Gauss had discovered quaternions first. 

Finally, Olinde Rodrigues (1795-1851) (of the Rodrigues formula for Legendre 
polynomials) gave a formula for the angle and axis of a rotation in R3 obtained 
from two successive rotations—essentially giving a different parametrization of the 
quaternions—but had left mathematics for banking long before the publication of his 
paper [Rod1840]. The story of Rodrigues and the quaternions is given by Altmann 
[Alt89] and Pujol [Puj2012], and the fuller story of his life is recounted by Altmann- 
Ortiz [AO2005]. See also the description by Pujol [Puj2014] of Hamilton’s derivation 
of the relation between rotations and quaternions from 1847, set in historical context. 

In any case, the quaternions consumed the rest of Hamilton’s academic life and 
resulted in the publication of two bulky treatises [Ham1853, Ham1866] (see also the 
review [Ham1899]). Hamilton’s mathematical writing over these years, an example of 
which can be found in Figure 1.1.5, was at times opaque; nevertheless, many physicists 
used quaternions extensively and for a long time in the mid-19th century, quaternions 
were an essential notion in physics. 

Other figures contemporaneous with Hamilton were also developing vectorial sys- 
tems, most notably Hermann Grassmann (1809-1877) [Gras1862]. The modern notion 
of vectors was developed by Willard Gibbs (1839-1903) and Oliver Heaviside (1850- 
1925), independently. In 1881 and 1884 (in two halves), Gibbs introduced in a pamphlet 
Elements of Vector Analysis the now standard vector notation of the cross product and 
dot product, with the splendid equality 


vw=-v:-wt+vxXw (1.1.7) 


for v,w € Ri+Rj+Rk c H relating quaternionic multiplication on the left to dot and 
cross products on the right. (The equality (1.1.7) also appears in Hamilton’s work, but in 
different notation.) Gibbs did not consider the quaternion product to be a “fundamental 
notion in vector analysis” [Gib1891, p. 512], and argued for a vector analysis that 
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would apply in arbitrary dimension; on the relationship between these works, Gibbs 
wrote after learning of the work of Grassmann: “I saw that the methods whfich] I 
was using, while nearly those of Hamilton, were almost exactly those of Grassmann” 
[Whe62, p. 108]. For more on the history of quaternionic and vector calculus, see 
Crowe [Cro64] and Simons [Sim2010]. 

The rivalry between physical notations flared into a war in the latter part of the 19th 
century between the “quaternionists’ and the “vectorists’, and for some the preference 
of one system versus the other became an almost partisan split. On the side of quater- 
nions, James Clerk Maxwell (1831-1879), who derived the equations which describe 
electromagnetic fields, wrote [Max 1869, p. 226]: 


The invention of the calculus of quaternions is a step towards the knowl- 
edge of quantities related to space which can only be compared, for its 
importance, with the invention of triple coordinates by Descartes. The 
ideas of this calculus, as distinguished from its operations and symbols, 
are fitted to be of the greatest use in all parts of science. 


And Peter Tait (1831-1901), Hamilton’s “chief disciple” [Hankin80, p. 316], wrote in 
1890 [Tail890] decrying notation and attacking Willard Gibbs (1839-1903): 


It is disappointing to find how little progress has recently been made 
with the development of Quaternions. One cause, which has been spe- 
cially active in France, is that workers at the subject have been more 
intent on modifying the notation, or the mode of presentation of the 
fundamental principles, than on extending the applications of the Calcu- 
lus. ... Even Prof. Willard Gibbs must be ranked as one the retarders of 
quaternions progress, in virtue of his pamphlet on Vector Analysis, a sort 
of hermaphrodite monster, compounded of the notation of Hamilton and 
Grassman. 


Game on! On the vectorist side, Lord Kelvin (a.k.a. William Thomson, who formulated 
the laws of thermodynamics), said in an 1892 letter to R. B. Hayward about his textbook 
in algebra (quoted in Thompson [Tho10, p. 1070)]): 


Quaternions came from Hamilton after his really good work had been 
done; and, though beautifully ingenious, have been an unmixed evil to 
those who have touched them in any way, including Clerk Maxwell. 


(There is also a rompous fictionalized account by Pynchon in his tome Against the Day 
[Pyn2006].) Ultimately, the superiority and generality of vector notation carried the 
day, and only certain useful fragments of Hamilton’s quaternionic notation—e.g., the 
“right-hand rule” i x j = k in multivariable calculus—remain in modern usage. 


1.2 Algebra after the quaternions 


The debut of Hamilton’s quaternions was met with some resistance in the mathematical 
world: it proposed a system of “numbers” that did not satisfy the usual commutative 
rule of multiplication. Quaternions predated even the notion of matrices, introduced in 
1855 by Arthur Cayley (1821-1895). Hamilton’s bold proposal of a noncommutative 
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multiplication law was the harbinger of a burgeoning array of algebraic structures. In 
the words of J.J. Sylvester [Syl1883, pp. 271-272]: 


In Quaternions (which, as will presently be seen, are but the simplest 
order of matrices viewed under a particular aspect) the example had been 
given of Algebra released from the yoke of the commutative principle 
of multiplication—an emancipation somewhat akin to Lobachevsky’s of 
Geometry from Euclid’s noted empirical axiom; and later on, the Peirces, 
father and son (but subsequently to 1858) had prefigured the universal- 
ization of Hamilton’s theory, and had emitted an opinion to the effect that 
probably all systems of algebraical symbols subject to the associative law 
of multiplication would be eventually found to be identical with linear 
transformations of schemata susceptible of matriculate representation. 


So with the introduction of the quaternions, the floodgates of algebraic possibility had 
been opened. See Happel [Hap80] for an overview of the early development of algebra 
following Hamilton’s quaternions, as well as the more general history given by Van 
der Waerden [vdW85, Chapters 10-11]. 

The day after his discovery, Hamilton sent a letter [Ham1844] describing the 
quaternions to his friend John T. Graves (1806-1870). Graves replied on October 26, 
1843, with his compliments, but added: 


There is still something in the system which gravels me. I have not yet any 
clear views as to the extent to which we are at liberty arbitrarily to create 


imaginaries, and to endow them with supernatural properties. ... If with 
your alchemy you can make three pounds of gold, why should you stop 
there? 


Following through on this invitation, on December 26, 1843, Graves wrote to Hamilton 
that he had successfully generalized the quaternions to the “octaves”, now called 
octonions ©, an algebra in eight dimensions, with which he was able to prove that the 
product of two sums of eight perfect squares is another sum of eight perfect squares, 
a formula generalizing (1.1.6). In fact, Hamilton first invented the term associative in 
1844, around the time of his correspondence with Graves. Unfortunately for Graves, the 
octonions were discovered independently and published in 1845 by Cayley [Cay 1845b], 
who often is credited for their discovery. (Even worse, the eight squares identity was 
also previously discovered by C. F. Degen.) For a more complete account of this 
story and the relationships between quaternions and octonions, see the survey article 
by Baez [Bae2002], the article by Van der Blij [vdB60], and the delightful book by 
Conway-Smith [CSm2003]. 

Cayley also studied quaternions themselves [Cay 1845a] and was able to reinterpret 
them as arising from a doubling process, also called the Cayley—Dickson construction, 
which starting from R produces C then H then O, taking the ordered, commuta- 
tive, associative algebra R and progressively deleting one adjective at a time. So 
algebras were first studied over the real and complex numbers and were accordingly 
called hypercomplex numbers in the late 19th and early 20th century. And this theory 
flourished. Hamilton himself considered the algebra over C defined by his famous 
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equations (1.1.1), calling them biquaternions. In 1878, Ferdinand Frobenius (1849- 
1917) proved that the only finite-dimensional associative real division algebras are R, 
C, and H [Fro1878]. This result was also proven independently by C.S. Peirce, the 
son of Benjamin Peirce, below. Adolf Hurwitz (1859-1919) later showed that the only 
normed finite-dimensional not-necessarily-associative real division algebras are R, C, 
H, and O. (The same statement is true without the condition that the algebra be normed, 
but currently the proofs use topology, not algebra! Bott—Milnor [BM58] and Kervaire 
[Ker58] proved that the (n — 1)-dimensional sphere {x € R” : ix? = 1} has trivial 
tangent bundle if and only if there is an n-dimensional not-necessarily-associative 
real division algebra if and only if n = 1,2,4,8. The solution to the Hopf invariant 
one problem by Adams also implies this result; an elegant and concise proof using K- 
theory, Adams operations, and elementary number theory was given by Adams—Atiyah 
[AA66]. See Hirzebruch [Hir91] or Ranicki [Ran2011] for a more complete account.) 


In another attempt to seek a generalization of the quaternions to higher dimension, 
William Clifford (1845-1879) developed a way to build algebras from quadratic forms 
in 1876 [Clil878]. Clifford constructed what we now call a Clifford algebra C(V) 
associated to V = R” (with the standard Euclidean norm); it is an algebra of dimension 
2” containing V with multiplication induced from the relation x? = —||x||* for all 
x € V. We have C(R!) = C and C(R?) = H, so the Hamilton quaternions arise as a 
Clifford algebra—but C(R*) is not the octonions. The theory of Clifford algebras tightly 
connects the theory of quadratic forms and the theory of normed division algebras and 
its impact extends in many mathematical directions. For more on the history of Clifford 
algebras, see Diek—Kantowski [DK95]. 


A further physically motivated generalization was pursued by Alexander Macfar- 
lane (1851-1913): he developed a theory of what he called hyperbolic quaternions 
[Macf00] (a revised version of an earlier, nonassociative attempt [Macf1891]), with 
the multiplication laws 


ref sk el 


J = 
ij =V-1k =-ji, jk =V-li=-kj, ki=V-1j =-ik. 


Thought of as an algebra over C = R(V-—1), Macfarlane’s hyperbolic quaternions 
are isomorphic to Hamilton’s biquaternions (and therefore isomorphic to M2(C)). 
Moreover, the restriction of the norm to the real span of the basis 1, i, j, k in Macfarlane’s 
algebra is a quadratic form of signature (1, 3): this gives a quaternionic version of space- 
time, something also known as Minkowski space (but with Macfarlane’s construction 
predating that of Minkowski). For more on the history and further connections, see 
Crowe [Cro64]. 


Around this time, other types of algebras over the real numbers were also being 
investigated, the most significant of which were Lie algebras. In the seminal work 
of Sophus Lie (1842-1899), group actions on manifolds were understood by looking 
at this action infinitesimally; one thereby obtains a Lie algebra of vector fields that 
determines the local group action. The simplest nontrivial example of a Lie algebra 
is the cross product of two vectors, related to quaternion multiplication in (1.1.7): it 
defines, a linear, alternating, but nonassociative binary operation on R? that satisfies 


(1.2.1) 
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the Jacobi identity emblematized by 
ixGxk)+kx@xjp+jx(k xi) =0. (1.2.2) 


The Lie algebra “linearizes” the group action and is therefore more accessible. Wilhelm 
Killing (1847-1923) initiated the study of the classification of Lie algebras in a series 
of papers [Kil1888], and this work was completed by Elie Cartan (1869-1951). We 
refer to Hawkins [Haw2000] for a description of this rich series of developments. 

In this way, the study of division algebras gradually evolved, independent of phys- 
ical interpretations. Benjamin Peirce (1809-1880) in 1870 developed what he called 
linear associative algebras [Pei1882]; he provided a decomposition of an algebra rel- 
ative to an idempotent (his terminology). The first definition of an algebra over an 
arbitrary field seems to have been given by Leonard E. Dickson (1874-1954) [Dic03]: 
at first he still called the resulting object a system of complex numbers and only later 
adopted the name linear algebra. 

The notion of a simple algebra had been discovered by Cartan, and Theodor Molien 
(1861-1941) had earlier shown in his terminology that every simple algebra over the 
complex numbers is a matrix algebra [Mol1893]. But it was Joseph Henry Maclagan 
Wedderburn (1882—1948) who was the first to find meaning in the structure of simple 
algebras over an arbitrary field, in many ways leading the way forward. The jewel 
of his 1908 paper [Wed08] is still foundational in the structure theory of algebras: a 
simple algebra (finite-dimensional over a field) is isomorphic to a matrix ring over a 
division ring. Wedderburn also proved that a finite division ring is a field, a result that 
like his structure theorem has inspired much mathematics. For more on the legacy of 
Wedderburn, see Artin [Art50]. 

In the early 1900s, Dickson was the first to consider quaternion algebras over a 
general field [Dic 12, (8), p. 65]. He began by considering more generally those algebras 
in which every element satisfies a quadratic equation [Dic12], exhibited a diagonalized 
basis for such an algebra, and considered when such an algebra can be a division 
algebra. This led him to multiplication laws for what he later called a generalized 
quaternion algebra [Dic 14, Dic23], with multiplication laws 


P =a, f=b, k? =-ab, 

af roa ; ee a (1.2.3) 

ij=k=-ji, ik=aj=-ki, kj =bi=-jk 
with a,b nonzero elements in the base field. (To keep track of these, it is helpful 
to write i, j,k around a circle clockwise.) Today, we no longer employ the adjective 
“generalized”— over fields other than R, there is no reason to privilege the Hamiltonian 
quaternions—and we can reinterpret this vein of Dickson’s work as showing that every 
4-dimensional central simple algebra is a quaternion algebra (a statement that holds 
even over a field F with char F = 2). See Fenster [Fen98] for a summary of Dickson’s 
work in algebra, and Lewis [Lew2006b] for a broad survey of the role of involutions 
and anti-automorphisms in the classification of algebras. 
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1.3 Quadratic forms and arithmetic 


Hamilton’s quaternions also fused a link between quadratic forms and arithmetic, 
phrased in the language of noncommutative algebra. Indeed, part of Dickson’s interest 
in quaternion algebras stemmed from earlier work of Hurwitz [Hur1898], alluded to 
above. Hurwitz had asked for generalizations of the composition laws arising from sum 
of squares laws like that of Euler (1.1.6) for four squares and Cayley for eight squares: 
for which n does there exist an identity 


(ait + abit t+ Pyacit te 


with each c; bilinear in the variables a and b? He then proved [Hur1898] that over a 
field where 2 is invertible, these identities exist only for n = 1,2,4,8 variables (so in 
particular, there is no formula expressing the product of two sums of 16 squares as 
the sum of 16 squares). As Dickson [Dic19] further explained, this result of Hurwitz 
is intimately tied to the theory of algebras. For more on compositions of quadratic 
forms and their history, including theorems of Hurwitz—Radon and Pfister, see Shapiro 
[Sha90]. 

Thinking along similar lines, Hurwitz gave a new proof of the four-square theorem 
of Lagrange, that every positive integer is the sum of four integer squares: he first wrote 
about this in 1896 on quaternionic number theory (“Uber die Zahlentheorie der Quater- 
nionen’”) [Hur1896], then published a short book on the subject in 1919 [Hur19]. To 
this end, Hurwitz considered Hamilton’s equations over the rational numbers and said 
that a quaternion t+ xi+ yj +zk with t, x, y, z € Q was an integer if t, x, y, z all belonged 
to Z or all to 5 + Z, conditions for the quaternion to satisfy a quadratic polynomial with 
integer coefficients. Hurwitz showed that his ring of integer quaternions, today called 
the Hurwitz order, admits a generalization of the Euclidean algorithm and thereby a 
factorization theory. He then applied this to count the number of ways of representing 
an integer as the sum of four squares, a result due to Jacobi. The notion of integral 
quaternions was also explored in the 1920s by Venkov [Ven22, Ven29] and the 1930s by 
Albert [Alb34]. Dickson considered further questions of representing positive integers 
by integral quaternary quadratic forms [Dic19, Dic23, Dic24] in the same vein. 

So by the end of the 1920s, quaternion algebras were used to study quadratic 
forms in a kind of noncommutative algebraic number theory [Lat26, Gri28]. It was 
known that a (generalized) quaternion algebra (1.2.3) was semisimple in the sense of 
Wedderburn, and thus it was either a division algebra or a full matrix algebra over the 
ground field. Indeed, a quaternion algebra is a matrix algebra if and only if a certain 
ternary quadratic form has a nontrivial zero, and over the rational numbers this problem 
was already studied by Legendre. Helmut Hasse (1898-1979) reformulated Legendre’s 
conditions: a quadratic form has a nontrivial zero over the rationals if and only if it 
has a nontrivial zero over the real numbers and Hensel’s field of p-adic numbers for 
all odd primes p. This result paved the way for many further advances, and it is now 
known as the Hasse principle or the local-global principle for quadratic forms. For an 
overview of this history, see Scharlau [Scha2009, $1]. 

Further deep results in number theory were soon to follow. Dickson [Dic14] had 
defined cyclic algebras, reflecting many properties of quaternion algebras, and in 
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1929 lectures Emmy Noether (1882-1935) considered the even more general crossed 
product algebras. Not very long after, in a volume dedicated to Hensel’s seventieth 
birthday, Richard Brauer (1901-1977), Hasse, and Noether proved a fundamental 
theorem for the structure theory of algebras over number fields [BHN31]: every central 
division algebra over a number field is a cyclic algebra. This crucial statement had 
profound implications for class field theory, the classification of abelian extensions of 
a number field, with a central role played by the Brauer group of a number field, a 
group encoding its division algebras. For a detailed history and discussion of these 
lines, see Fenster-Schwermer [FS2007], Roquette [Roq2006], and the history of class 
field theory summarized by Hasse himself [Hass67]. 


At the same time, Abraham Adrian Albert (1905—1972), a doctoral student of Dick- 
son, was working on the structure of division algebras and algebras with involution, 
and he had written a full book on the subject [Alb39] collecting his work in the area, 
published in 1939. Albert had examined the tensor product of two quaternion algebras, 
called a biquaternion algebra (not to be confused with Hamilton’s biquaternions), and 
he characterized when such an algebra was a division algebra in terms of a senary (six 
variable) quadratic form. Albert’s classification of algebras with involution was moti- 
vated by understanding possible endomorphism algebras of abelian varieties, viewed 
as multiplier rings of Riemann matrices and equipped with the Rosati involution: a 
consequence of this classification is that quaternion algebras are the only noncommu- 
tative endomorphism algebras of simple abelian varieties. He also proved that a central 
simple algebra admits an involution if and only if the algebra is isomorphic to its oppo- 
site algebra (equivalently, it has order at most 2 in the Brauer group). For a biography of 
Albert and a survey of his work, see Jacobson [Jacn74]. Roquette argues convincingly 
[Roq2006, §8] that because of Albert’s contributions to its proof (for example, his work 
with Hasse [AH32]), we should refer to the Albert-Brauer—Hasse—Noether theorem in 
the previous paragraph. 


1.4 Modular forms and geometry 


Quaternion algebras also played a formative role in what began as a subfield of complex 
analysis and ordinary differential equations and then branched into the theory of 
modular forms—and ultimately became a central area of modern number theory. 
Returning to a thread from the previous section, the subject of representing numbers 
as the sum of four squares saw considerable interest in the 17th and 18th centuries 
[Dic71, Chapter VII]. Carl Jacobi (1804-1851) approached the subject from the 
analytic point of view of theta functions, the basic building blocks for elliptic functions; 
these were first studied in connection with the problem of the arc length of an ellipse, 
going back to Abel. Jacobi studied the series 
A(T) := 2. exp(2rin?t) = 1 +2q +2qf +20? +... (1.4.1) 


n=—oco 


where T is a complex number with positive imaginary part and q = exp(2ziT). Jacobi 
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proved the remarkable identity 


@(r)* = py gt htc? g A o*(n)q", (1.4.2) 
a,b,c,dEZ n=l 


where o*(n) := istajn d is the sum of divisors of n not divisible by 4. In this way, 
Jacobi gave an explicit formula for the number of ways of expressing a number as the 
sum of four squares. For a bit of history and an elementary derivation in the style of 
Gauss and Jacobi, see Ewell [Ewe82]. 

As a Fourier series, the Jacobi theta function @ (1.4.1) visibly satisfies @(7 + 1) = 
O(t). Moreover, owing to its symmetric description, Jacobi showed using Poisson 
summation that 6 also satisfies the transformation formula 


6(-1/t) = Vr/i (2). (1.4.3) 


Felix Klein (1849-1925) saw geometry in formulas like (1.4.3). In his Erlangen Pro- 
gram (1872), he recast 19th century geometry in terms of the underlying group of 
symmetries, unifying Euclidean and non-Euclidean formulations. Turning then to 
hyperbolic geometry, he studied the modular group SL2(Z) acting by linear fractional 
transformations on the upper half-plane, and interpreted transformation formulas for 
elliptic functions: in particular, Klein defined his absolute invariant J(t) [Kle1878], a 
function invariant under the modular group. Together with his student Robert Fricke 
(1861-1930), this led to four volumes [FK 1890-2, FK1897, FK12] on elliptic modular 
functions and automorphic functions, combining brilliant advances in group theory, 
number theory, geometry, and invariant theory (Figure 1.4.4). 


Figure 1.4.4: The (2, 3, 7)-tiling by Fricke and Klein [FK1890-2] 
(public domain) 
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At the same time, Henri Poincaré (1854-1912) brought in the theory of linear 
differential equations—and a different, group-theoretic approach. In correspondence 
with Fuchs in 1880 on hypergeometric differential equations, he writes about the 
beginnings of his discovery of a new class of analytic functions [Gray2000, p.177]: 


They present the greatest analogy with elliptic functions, and can be 
represented as the quotient of two infinite series in infinitely many ways. 
Amongst those series are those which are entire series playing the role of 
Theta functions. These converge in a certain circle and do not exist outside 
it, as thus does the Fuchsian function itself. Besides these functions there 
are others which play the same role as the zeta functions in the theory 
of elliptic functions, and by means of which I solve linear differential 
equations of arbitrary orders with rational coefficients whenever there are 
only two finite singular points and the roots of the three determinantal 
equations are commensurable. 


As he reminisced later in his Science et Méthode [Poil908, p. 53]: 


I then undertook to study some arithmetical questions without any great 
result appearing and without expecting that this could have the least con- 
nection with my previous researches. Disgusted with my lack of success, 
I went to spend some days at the sea-side and thought of quite different 
things. One day, walking along the cliff, the idea came to me, always with 
the same characteristics of brevity, suddenness, and immediate certainty, 
that the arithmetical transformations of ternary indefinite quadratic forms 
were identical with those of non-Euclidean geometry. 


In other words, like Klein, Poincaré launched a program to study complex analytic 
functions defined on the unit disc that are invariant with respect to a discrete group 
of matrix transformations that preserve a rational indefinite ternary quadratic form. 
Today, such groups are called arithmetic Fuchsian groups, and we study them as unit 
groups of quaternion algebras. To read more on the history of differential equations 
in the time of Riemann and Poincaré, see the history by Gray [Gray2000], as well as 
Gray’s scientific biography of Poincaré [Gray2013]. 

In the context of these profound analytic discoveries, Erich Hecke (1887-1947) 
began his study of modular forms. He studied the Dedekind zeta function, a gener- 
alization of Riemann’s zeta function to number fields, and established its functional 
equation using theta functions. In the study of similarly defined analytic functions 
arising from modular forms, he was led to define the “averaging” operators acting on 
spaces of modular forms that now bear his name. In this way, he could interpret the 
Fourier coefficients a(n) of a Hecke eigenform (normalized, weight 2) as eigenvalues 
of his operators: he proved that they satisfy a relation of the form 


a(m)a(n) = > a(mn/d?)d (1.4.5) 
d|gcd(m,n) 
and consequently a two-term recursion relation. He thereby showed that the Dirichlet 
L-series of an eigenform, defined via Mellin transform, has an Euler product, analytic 
continuation, and functional equation. 
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Hecke went further, and connected the analytic theory of modular forms and his 
operators to the arithmetic theory of quadratic forms. In 1935-1936, he found that for 
certain systems of quaternary quadratic forms, the number of representations of integers 
by the system satisfied the recursion (1.4.5), in analogy with binary quadratic forms. 
He published a conjecture on this subject in 1940 [Hec40, Satz 53, p. 100]: that the 
weighted representation numbers satisfy the Hecke recursion, connecting coefficients 
to operators on theta series, and further that the columns in a composition table always 
result in linearly independent theta series. He verified the conjecture up to prime level 
q < 37, but was not able to prove this recursion using his methods of complex analysis 
(see his letter [Bra41, Footnote 1]). 


The arithmetic part of these conjectures was investigated by Heinrich Brandt (1886— 
1954) in the quaternionic context—and so the weave of our narrative is further tightly 
sewn. Preceding Hecke’s work, and inspired by Gauss composition of binary quadratic 
forms as the product of classes of ideals in a quadratic field, Brandt had earlier 
considered a generalization to quaternary quadratic forms and the product of classes 
of ideals in a quaternion algebra [Bra28]: he was only able to define a partially defined 
product, and so he coined the term groupoid for such a structure [Bra40]. He then 
considered the combinatorial problem of counting the ways of factoring an ideal into 
prime ideals, according to their classes. In this way, he recorded these counts in a 
matrix T(n) for each positive integer n, and he proved strikingly (sketched in 1941 
[Bra41], dated 1939, and proved completely in 1943 [Bra43]) that the matrices T(n) 
satisfy Hecke’s recursion (1.4.5). To read more on the life and work of Brandt, see 
Hoehnke—Knus [HK2004]. Today we call the matrices T(n) Brandt matrices, and for 
certain purposes, they are still the most convenient way to get ahold of spaces of 
modular forms. 


Martin Eichler (1912-1992) wrote his thesis [Eic36] under the supervision of 
Brandt on quaternion orders over the integers, in particular studying the orders that 
now bear his name. Later he continued the grand synthesis of modular forms, quadratic 
forms, and quaternion algebras, viewing in generality the orthogonal group of a 
quadratic form as acting via automorphic transformations [Eic53]. In this vein, he 
formulated his basis problem (arising from the conjecture of Hecke) which sought 
to understand explicitly the span of quaternionic theta series among classical modu- 
lar forms, giving a correspondence between systems of Hecke eigenvalues appearing 
in the quaternionic and classical context. He answered the basis problem in affir- 
mative for the case of prime level in 1955 [Eic56a] and then for squarefree level 
[Eic56b, Eic58, Eic73]. For more on Eichler’s basis problem and its history, see 
Hijikata—Pizer-Shemanske [HPS89a]. 


Having come to recent history, our account now becomes much more abbreviated: 
we provide further commentary in situ in remarks in the rest of this text, and we 
conclude with just a few highlights. In the 1950s and 1960s, there was subtantial 
work done in understanding zeta functions of certain varieties arising from quaternion 
algebras over totally real number fields. For example, Eichler’s correspondence was 
generalized to totally real fields by Shimizu [Shz65]. Shimura embarked on a deep and 
systematic study of arithmetic groups obtained from indefinite quaternion algebras 
over totally real fields, including both the arithmetic Fuchsian groups of Poincaré, 
Fricke, and Klein, and the generalization of the modular group to totally real fields 
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studied by Hilbert. In addition to understanding their zeta functions, he also formulated 
a general theory of complex multiplication in terms of automorphic functions; as a 
consequence, he found the corresponding arithmetic quotients can be defined as an 
algebraic variety with equations defined over a number field—and so today we refer to 
quaternionic Shimura varieties. For an overview of Shimura’s work, see his lectures 
at the International Congress of Mathematicians in 1978 [Shi80]. As it turns out, 
quaternion algebras over number fields also give rise to arithmetic manifolds that are 
not algebraic varieties, and they are quite important in the areas of spectral theory, 
low-dimensional geometry, and topology—in particular, in Thurston’s geometrization 
program for hyperbolic 3-manifolds and in classifying knots and links. 

Just as the Hecke operators determine the coefficients of classical modular forms 
and Dirichlet L-series, they may be vastly generalized, replacing modular groups 
by other algebraic groups, such as the group of units in a central simple algebra or 
the orthogonal group of a quadratic form. Understanding the theory of automorphic 
forms in this context is a program that continues today: formulated in the language 
of automorphic representations, and seen as a nonabelian generalization of class field 
theory, Langlands initiated this program in a letter to Weil in January 1967. It is 
indeed fitting that an early success of the Langlands program [Gel84, B+2003] would 
be on the subject of quaternion algebras: a generalization of the Eichler-Shimizu 
correspondence to encompass arbitrary quaternion algebras over number fields was 
achieved in foundational work by Jacquet—Langlands [JL70] in 1970. For more on the 
modern arithmetic history of modular forms, see Edixhoven—Van der Geer-Moonen 
[EGM2008]; Alsina—Bayer [AB2004, Appendices B—C] also give references for further 
applications of quaternion algebras in arithmetic geometry (in particular, of Shimura 
curves). 


1.5 Conclusion 


We have seen how quaternion algebras have threaded mathematical history through to 
the present day, weaving together advances in algebra, quadratic forms, number theory, 
geometry, and modular forms. And although our history ends here, the story does not! 

Quaternion algebras continue to arise in unexpected ways. In the arithmetic setting, 
quaternion orders arise as endomorphism rings of supersingular elliptic curves and 
have been used in proposed post-quantum cryptosystems and digital signature schemes 
(see for example the overview by Galbraith—Vercauteren [GV2018]). In the field of 
quantum computation, Parzanchevski—Sarnak [PS2018] have proposed Super-Golden- 
Gates built from certain special quaternion algebras and their arithmetic groups that 
would give efficient 1-qubit quantum gates. In coding theory, lattices in quaternion 
algebras (and more generally central simple algebras over number fields) yield space- 
time codes that achieve high spectral efficiency on wireless channels with two transmit 
antennas, currently part of certain IEEE standards [BO2013]. 

Quaternions have also seen a revival in computer graphics, modeling, and animation 
[HFK94, Sho85]. Indeed, a rotation in R? about an axis through the origin can be 
represented by a 3 x 3 orthogonal matrix with determinant 1, conveniently encoded in 
Euler angles. However, the matrix representation is redundant, as there are only three 
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degrees of freedom in such a rotation. Moreover, to compose two rotations requires the 
product of the two corresponding matrices, which requires 27 multiplications and 18 
additions in R. Quaternions, on the other hand, represent this rotation with a 4-tuple, 
and multiplication of two quaternions takes only 16 multiplications and 12 additions 
in R (if done naively). In computer games, quaternion interpolation provides a way 
to smoothly interpolate between orientations in space—something crucial for fighting 
Nazi zombies. Quaternions are also vital for attitude control of aircraft and spacecraft 
[Hans2006]: they avoid the ambiguity that can arise when two rotation axes align, 
leading to a potentially disastrous loss of control called gimbal lock. 

In quantum physics, quaternions yield elegant expressions for Lorentz transforma- 
tions, the basis of the modern theory of relativity [Gir83]. Some physicists are now 
hoping to find deeper understanding of these principles of quantum physics in terms 
of quaternions. And so, although much of Hamilton’s quaternionic physics fell out of 
favor long ago, we have come full circle in our elongated historical arc. The enduring 
role of quaternion algebras as a catalyst for a vast range of mathematical research 
promises rewards for many years to come. 


Exercises 


1. Hamilton sought a multiplication * : R? x R? — R? that preserves length: 
2 2 2 
Ivić hol” = Iv * wll 


for v,w € R?. Expanding out in terms of coordinates, such a multiplication 
would imply that the product of the sum of three squares over R is again the sum 
of three squares in R. (Such a law holds for the sum of four squares (1.1.6).) 
Show that such a formula for three squares is impossible as an identity in the 
polynomial ring in 6 variables over Z. [Hint: Find a natural number that is the 
product of two sums of three squares which is not itself the sum of three squares. ] 


2. Hamilton originally sought an associative multiplication law on 
D:=R+Ri+Rj = R? 


where i? = —1 and every nonzero element of D has a (two-sided) inverse. Show 
this cannot happen in two (not really different) ways. 


(a) Ifij = a + bi + cj with a,b,c € R, multiply on the left by i and derive a 
contradiction. 


(b) Show that D is a (left) C-vector space, so D has even dimension as an 
R-vector space, a contradiction. 


3. Show that there is no way to give RÌ the structure of a ring (with 1) in which 
multiplication respects scalar multiplication by R, i.e., 


x+(cy) =c(x- y) =(cx)-y for all c € Rand x, y € R? 


and every nonzero element has a (two-sided) inverse, as follows. 
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(a) Suppose B := R? is equipped with a multiplication law that respects scalar 
multiplication. Show that left multiplication by œ € B is R-linear and a 
satisfies the characteristic polynomial of this linear map, a polynomial of 
degree 3. 


(b) Now suppose that every nonzero «œ € B has an inverse. By consideration 
of eigenvalues or the minimal polynomial, derive a contradiction. [Hint: 
show that the characteristic polynomial has a real eigenvalue, or that 
every a € B satisfies a (minimal) polynomial of degree 1, and derive a 
contradiction from either statement. ] 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s 
Creative Commons license, unless indicated otherwise in a credit line to the material. If 
material is not included in the chapter’s Creative Commons license and your intended 
use is not permitted by statutory regulation or exceeds the permitted use, you will need 
to obtain permission directly from the copyright holder. 
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Chapter 2 


Beginnings 


In this chapter, we define quaternion algebras over fields by giving a multiplication 
table, following Hamilton; we then consider the classical application of understanding 
rotations in R3. 


2.1 > Conventions 


Throughout this text (unless otherwise stated), we let F be a (commutative) field with 
algebraic closure F”. 

When G is a group, and H C G is a subset, we write H < G when H is a subgroup 
and H < G when H is a normal subgroup; if G is abelian (written multiplicatively), we 
write G? := {g" : g € G} < G for the subgroup of squares. 

We suppose throughout that all rings are associative, not necessarily commutative, 
with multiplicative identity 1, and that ring homomorphisms preserve 1. In particular, a 
subring of a ring has the same 1. For a ring A, we write A* for the multiplicative group 
of units of A. An algebra over the field F is a ring Bequipped with a homomorphism 
F — B such that the image of F lies in the center Z(B) of B, defined by 


Z(B) := {a € B : af = Ba for all 8 € B}; (2.1.1) 


if Z(B) = F, we say B is central (as an F-algebra). We write M,,(F) for the F-algebra 
of n X n-matrices with entries in F. 

One may profitably think of an F-algebra as being an F-vector space that is also 
compatibly a ring. If the F-algebra B is not the zero ring, then its structure map F —> B 
is necessarily injective (since 1 maps to 1) and we identify F with its image; keeping 
track of the structure map just litters notation. The dimension dimp B of an F-algebra 
B is its dimension as an F-vector space. 

A homomorphism of F-algebras is a ring homomorphism which restricts to the 
identity on F. An F-algebra homomorphism is necessarily F-linear. An F-algebra 
homomorphism B —> B is called an endomorphism. By convention (and as usual for 
functions), endomorphisms act on the left. An invertible F-algebra homomorphism 
B æ B’ is called an isomorphism, and an invertible endomorphism is an automor- 
phism. 
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The set of automorphisms of B forms a group, which we write as Aut(B)—these 
maps are necessarily F-linear, but we do not include this in the notation. reserve the 
notation Endz (V) for the ring of F-linear endomorphisms of the F-vector space V, 
and Autz (V) for the group of F-linear automorphisms of V; in particular, Endz (B) ~ 
M,(F) if n = dimp B. 


Remark 2.1.2. Throughout, whenever we define a homomorphism of objects, we adopt 
the (categorical) convention extending this to the terms endomorphism (homomor- 
phism with equal domain and codomain), isomorphism (invertible homomorphism), 
and automorphism (invertible endomorphism). 


A division ring (also called a skew field) is a ring D in which every nonzero 
element has a (two-sided) inverse, i.e., D\{0} is a group under multiplication. A 
division algebra is an algebra that is a division ring. 


2.2 > Quaternion algebras 


In this section, we define quaternion algebras in a direct way, via generators and 
relations. Throughout the rest of this chapter, suppose that char F + 2; the case 
char F = 2 is treated in Chapter 6. 


Definition 2.2.1. An algebra B over F is a quaternion algebra if there exist i, j € B 
such that 1, i, j,ij is an F-basis for B and 

i =a, j° =b, and ji=-ij (2.2.2) 
for some a,b € F*. 


The entire multiplication table for a quaternion algebra is determined by the mul- 

tiplication rules (2.2.2), linearity, and associativity: for example, 
GD? = GG) = iGDj = (-i)j = -@)G?) = -ab 
and j(ij) = (-ij)j = —bi. Conversely, given a,b € F*, one can write down the unique 
possible associative multiplication table on the basis 1, i, j, k compatible with (2.2.2), 
and then verify independently that it is associative (Exercise 2.1). Accordingly, for 
b 

a,b € F*, we define “| to be the quaternion algebra over F with F-basis 1, i, j, ij 
subject to the multiplication (2.2.2); we will also write (a, b | F) when convenient for 
formatting. By definition, we have dimp (a, b | F) = 4. 


F F 
so Definition 2.2.1 is symmetric in a,b. The elements a,b are far from unique in 
determining the isomorphism class of a quaternion algebra: see Exercise 2.4. 

If K 2 F isa field extension of F, then there is a canonical isomorphism 


a,b sk a,b 
ple VE 
extending scalars (same basis, but now spanning a K-vector space), so Definition 2.2.1 
behaves well with respect to inclusion of fields. 


„b b, 
The map which interchanges i and j gives an isomorphism (£ ) = | s), 
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R 
numbers, discovered by Hamilton; we call H the ring of (real) Hamiltonians (also 
known as Hamilton’s quaternions). 


1 
Example 2.2.3. The R-algebra H := is the ring of quaternions over the real 


Example 2.2.4. The ring M2(F) of 2x 2-matrices with coefficients in F is a quaternion 


algebra over F: there is an isomorphism 7 æ M>(F) of F-algebras induced by 


l 0). 9. $0. 4 
12 Ne a Nae ol 


If F = F* is algebraically closed and B is a quaternion algebra over F, then 
necessarily B ~ Mo(F) (Exercise 2.4). Consequently, every quaternion algebra B over 
F has B 8r F” = Mo(F2). 


A quaternion algebra B is generated by the elements i, j by definition (2.2.2). How- 
ever, exhibiting an algebra by generators and relations (instead of by a multiplication 
table) can be a bit subtle, as the dimension of such an algebra is not a priori clear. But 
working with presentations is quite useful; and at least for quaternion algebras, we can 
think in these terms as follows. 


Lemma 2.2.5. An F-algebra B is a quaternion algebra if and only if there exist nonzero 
i, j € B that generate B as an F-algebra and satisfy 


i =a, j’ =b, and ij = -ji (2.2.6) 


with a,b € F”. 


In other words, once the relations (2.2.6) are satisfied for generators i,j, then 
automatically B has dimension 4 as an F-vector space, with F-basis 1, i, j, ij. 


Proof. It is necessary and sufficient to prove that the elements 1,i, j,ij are linearly 
independent. Suppose that a = t + xi + yj + zij = 0 with t, x,y,z € F. Using the 
relations given, we compute that 
0 = i(ai + ia) = 2a(t + xi). 
Since char F + 2 and a # 0, we conclude that t + xi = 0. Repeating with j and ij, we 
similarly find that t+ yj = t + zij = 0. Thus 
æ — (t + xi)— (t + yj)— (t + zij) = -2t =0. 


Since i, j are nonzero, B is not the zero ring, so 1 + 0; thus t = 0 and so xi = yj = 
zij = 0. Finally, if x + 0, then i = 0 so i? =0=a, impossible; hence x = 0. Similarly, 
y=z= 0. Oo 


Accordingly, we will call elements i, j € B satisfying (2.2.6) standard generators 
for a quaternion algebra B. 
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Remark 2.2.7. Invertibility of both a and b in F is needed for Lemma 2.2.5: the 
commutative algebra B = F[i, j]/(i, j)? is generated by the elements i, j satisfying 
i? = j? =ij = —ji = 0 but B is not a quaternion algebra. 


Remark 2.2.8. In light of Lemma 2.2.5, we will often drop the symbol k = ij and 
reserve it for other use. (In particular, in later sections we will want k to represent other 
quaternion elements.) If we wish to use this abbreviation, we will assign k := ij. 


2.3 > Matrix representations 


Every quaternion algebra can be viewed as a subalgebra of 2 x 2-matrices over an at 
most quadratic extension; this is sometimes taken to be the definition! 


,b 
Proposition 2.3.1. Let B := (=) be a quaternion algebra over F and let F(a) be 
a splitting field over F for the polynomial x? — a, with root ya € F(Va). Then the map 
A: B > M(F(Va)) 
.. [va 0 0 b 
hd aa | 0 -ya \ı 0 


PE aes t+xVa b(y+zvya) 
taxis yj tain (ENE t—xVa 


is an injective F-algebra homomorphism and an isomorphism onto its image. 


(2.3.2) 


Proof. Injectivity follows by checking ker A = {0} on matrix entries, and the homo- 
morphism property can be verified directly, checking the multiplication table (Exercise 
2.10). o 


Remark 2.3.3. Proposition 2.3.1 can be turned around to assert the existence of quater- 
nion algebras: one can check that the set 


eE POD) nance r} s macrevay 


is an F-vector subspace of dimension 4, closed under multiplication, with the matrices 
A(i), A(j) satisfying the defining relations (2.2.2). 


2.3.4. Ifa ¢ F**, then K = F(a) 2 F is a quadratic extension of F. Let Gal(K | F) = 
Autr (K) ~ Z/2Z be the Galois group of K over F and let o € Gal(K |F) be the 
nontrivial element. Then we can rewrite the image A(B) in (2.3.2) as 


u bv 
A(B) = EA a [UV € x} c M2(K). (2.3.5) 
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Corollary 2.3.6. We have an isomorphism 


1 
(=$) => mcr) 


co 1 0\/0 b 

BIT lo -1P 0 
Proof. Specializing Proposition 2.3.1, we see the map is an injective F-algebra homo- 
morphism, so since dimp B = dimp M2(F) = 4, the map is also surjective. o 


(2.3.7) 


The provenance of the map (2.3.2) is itself important, so we now pursue another 
(more natural) proof of Proposition 2.3.1. 


2.3.8. Let 
K := F[i] = F @ Fi = F[x]/(x’ - a) 


be the (commutative) F-algebra generated by i. Suppose first that K is a field (so 
a ¢ F*?): then K ~ F(ya) is a quadratic field extension of F. The algebra B has the 
structure of a right K-vector space of dimension 2, with basis 1, 7: explicitly, 

œ =t+xi+yj+zij = (t+ xi)+j(y-zi E K®jK 


for all a € B, so B = K @ jK. We then define the left regular representation of B 
over K by 
A: B > Endg(B) 


ah (Aa: BR ap). 


Each map Ag is indeed a K-linear endomorphism in B (considered as a right K-vector 
space) by associativity in B: for all œ, 8 € B and w € K, 


Ad(Bw) = a(Bw) = (a@B)w = Aa(p)w. 
Similarly, 4 is an F-algebra homomorphism: for all a, 8, v € B 
Aap’) = (@B)v = a(B(v)) = Aadgy(y) 


reading functions from right to left as usual. The map 4 is injective (A is a faithful 
representation) since 2g = 0 implies Ag(1) = a = 0. 
In the basis 1, j we have Endx(B) ~ M2(K), and J is given by 


(2.3.9) 


fi 0) 2. A, 


these matrices act on column vectors on the left. We then recognize the map 4 given 
in (2.3.2). 

If K is not a field, then K ~ F x F, and we repeat the above argument but with 
B a free module of rank 2 over K; then projecting onto one of the factors (choosing 
a € F) gives the map A, which is still injective and therefore induces an F-algebra 
isomorphism B =~ Mo(F). 
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Remark 2.3.11. In Proposition 2.3.1, B acts on columns on the left; if instead, one 
wishes to have B act on the right on rows, give B the structure of a left K-vector space 
and define accordingly the right regular representation instead (taking care about the 
order of multiplication). 


2.3.12. In some circumstances, it can be notationally convenient to consider variants 
of the injection (2.3.2): for example 


B > M(F(va)) 


t+xya y+zVa (2.3.13) 
by- zvya) t-—xvVa 


t+xi+yj+zij => 


is obtained by taking the basis 1, b`! j, equivalently postcomposing by ( s): See 


also Exercise 2.12. 


Remark 2.3.14. The left regular representation 2.3.8 is not the only way to embed B 
as a subalgebra of 2 x 2-matrices. Indeed, the “splitting” of quaternion algebras in 
this way, in particular the question of whether or not B ~ M2(F), is a theme that will 
reappear throughout this text. For a preview, see Main Theorem 5.4.4. 


2.3.15. Thinking of a quaternion algebra as in 2.3.8 as a right K-vector space suggests 
notation for quaternion algebras that is also useful: for a peek, see 6.1.5. 


2.4 > Rotations 


To conclude this chapter, we return to Hamilton’s original design: quaternions model 
rotations in 3-dimensional space. This development is not only historically important 
but it also previews many aspects of the general theory of quaternion algebras over 
fields. In this section, we follow Hamilton and take k := ij. 


Proposition 2.3.1 provides an R-algebra embedding 
A: H © Endc(H) ~ M2(C) 
t+xi -y- a = Q a) (2.4.1) 


t+xi+yj+zk=u+jv=> : ; se 
IIS J as t— xl v u 


where u := t + xi and v := y + zi and `~ denotes complex conjugation. (The abuse of 
notation, taking i € H as well asi € C is harmless: we may think of C c H.) We have 


u —v 
ae" | =u +P eP +x? ty? $2, 


thus H* = HI\ {0}. If preferred, see (2.3.13) to obtain matrices of the form e i 


instead. 
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2.4.2. We define the subgroup of unit Hamiltonians as 
H! :={t+xit+yjt+zkeH:P +x? +y +2 = 1}. 


(In some contexts, one also writes GL; (H) = H* and SL; (H) = H!.) 


As a set, the unit Hamiltonians are naturally identified with the 3-sphere in R4. As 
groups, we have an isomorphism H! ~ SU(2) with the special unitary group of rank 
2, defined by 


SU(n) := {A € SL,(C) : A* = A7!} = {A € SL, (©) : JA = AJ} (2.4.3) 


where A* = A’ is the (complex) conjugate transpose of A and J := i a is the 


image of j € H!. 


Definition 2.4.4. Let a € H. We say a is real if a € R, and we say a is pure (or 
imaginary) if a € Ri + Rj + Rk. 


2.4.5. Just as for the complex numbers, every element of H is the sum of its real 
part and its pure (imaginary) part. And just like complex conjugation, we define a 
(quaternion) conjugation map 


~ H>H 
PANEN = eee (2.4.6) 
œ =t+(xi+yj+zk)= @=t-(xi+ yj + zk) 
by negating the imaginary part. We compute that 
at+a@=tr(A(a)) = 2t 
(2.4.7) 


lal? := det(A(a)) = aT = Ta = t? + x? + y? +2, 


The notation || ||? is used to indicate that it agrees the usual square norm on H = R4. 


The conjugate transpose map on M2(C) restricts to quaternion conjugation on the 
image of H in (2.4.1), also known as adjugation 


a= p z] +> A@) = p a: 


Thus the elements a € H such that A(@) = A(a) (i.e., A* = A, and we say A is 
Hermitian), are exactly the scalar (real) matrices; and those that are skew-Hermitian, 
i.e., A* = —A, are exactly the pure quaternions. The conjugation map plays a crucial 
role for quaternion algebras and is the subject of the next chapter (Chapter 3), where 
to avoid confusion with other notions of conjugation we refer to it as the standard 
involution. 


2.4.8. Let 
H? := {v = xi + yj + zk € H : x,y,z € R} = R? 
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be the set of pure Hamiltonians, the three-dimensional real space on which we will soon 
see that the (unit) Hamiltonians act by rotations. (The reader should not confuse v € H? 
with v the entry of a 2 x 2-matrix in a local instantiation above.) For v € H? = R, 


Ivl? = x? + y? +z? = detA), (2.4.9) 


and from (2.4.1), 
H? = {v € H : tr(40)) = v +7 = 0}. 


We again see that Y = —v for v € HP. 


The set H? is not closed under multiplication: if v, w € H?, then 
vw =-v:w+vVvXw (2.4.10) 


where v - w is the dot product on R? and v x w € HP is the cross product, defined as 
the determinant 


i j k 
vxw =det| vy v v3 (2.4.11) 
Wi w2 W3 
where v = vi + voj + v3k and w = wii + w2j + w3k, so 
V: W = Viw + Vo2W2 + V3W3 


and 
VXWe= (v2W3 = v3w2)i F (v3w1 = vıw3)j + (viw = vzw1)k. 


The formula (2.4.10) is striking: it contains three different kinds of ‘multiplications’ ! 


Lemma 2.4.12. For all v,w € HP, the following statements hold. 
(a) vw € H? if and only if v, w are orthogonal. 
(b) v? = Iv? € R. 


(c) wv = —vw if and only if v, w are orthogonal. 


Proof. Apply (2.4.10). o 


2.4.13. The group H! acts on our three-dimensional space H? (on the left) by conju- 
gation: 

H' OH? > H° 
(2.4.14) 


ve ava}; 


indeed, tr(A(ava™!)) = tr(A(v)) = 0 by properties of the trace, so ava! e H®. Or 
H? = {v € H : v? € Reo} 


and this latter set is visibly stable under conjugation. The representation (2.4.14) is 
called the adjoint representation. 
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2.4.15. Let a € H'\{+1}. Then there exists a unique 6 € (0, 7) such that 


œ =t+xi+ yj +zk =cosé + (sindé)I(a) (2.4.16) 
where (œ) is pure and ||Z(œ)|| = 1: to be precise, we take 8 such that cos 6 = t and 
i+yjt+zk 
Ko = Z T 
|sin 8| 


We call Z(œ) the axis of a, and observe that [(a)* = —1. 


Remark 2.4.17. In analogy with Euler’s formula, we can write (2.4.16) as 
«æ = exp(/(a)@). 


We are now prepared to identify this action by quaternions with rotations. As usual, 
let 
O(n) := {A € M,,(R): At = Av} 


be the orthogonal group of R” (preserving the standard inner product), and let 
SO(n) := {A € O(n) : det(A) = 1} < O(n) 


to be the special orthogonal group of rotations of R”, a normal subgroup of index 2 
fitting into the exact sequence 


1 > SO) > O(n) $ {+1} > 1. 


Proposition 2.4.18. H! acts by rotation on H? ~ R? via conjugation (2.4.14): specifi- 
cally, a acts by rotation through the angle 20 about the axis I(œ). 


Proof. Let œ € H'\{+1}. Then for all v € H®, 
leva" |? = Ivl? 


by (2.4.9), so œ acts by a matrix belonging to O(3). 

But we can be more precise. Let j’ € H? be a unit vector orthogonal to i’ = I(a). 
Then (i'^)? = (j’)* = —1 by Lemma 2.4.12(b) and j’i’ = —i’j’ by Lemma 2.4.12(c), so 
without loss of generality we may suppose that /(@) = i and j’ = j. Thus œ = t + xi = 
cos 6 + (sin 6)i with £? + x? = cos? 0 + sin? 0 = 1, and a7! = t — xi. 

We have aia! = i, and 


aja! = (t + xi)j(t— xi) = (t + xi)(t + xi)j 


— (2.4.19) 
= (f° — x^) + 2txi)j = (cos 20)j + (sin 20)k 
by the double angle formula. Consequently, 
aka"! = i(aja') = (— sin 26)j + (cos 20)k 
so the matrix of «œ in the basis i, j, k is 
1 0 0 
A=|0 cos20 —sin2é@], (2.4.20) 


0 sin26 cos2é@ 


a (counterclockwise) rotation (determinant 1) through the angle 20 about i. o 
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Corollary 2.4.21. The action (2.4.13) defines a group homomorphism H! — SO(3), 
fitting into an exact sequence 


1 > {41} > H! > SOG) > 1. 


Proof. The map H! — SO(3) is surjective, since every element of SO(3) is rotation 
about some axis (Exercise 2.15). If œ belongs to the kernel, then œ = cos 8 + (sin 0) (œ) 
must have sin 0 = 0 so œ = 1. o 


2.4.22. The matrix representation of H in section 2.4 extends to a matrix representation 
of H @p C, and this representation and its connection to unitary matrices is still used 
widely in quantum mechanics. In the embedding with 


ele O\ a TW a ae 2 
12 | \ ap 7P Os oP i 0 


whose images are unitary matrices, we multiply by —i to obtain Hermitian matrices 


_ [1 0 _(0 -i _ (0 1 
SEONG: ap ae op ~u o 


where ox, Cy, 0z are the famous Pauli spin matrices. Because of this application to 
the spin (a kind of angular momentum) of an electron in particle physics, the group 
H! also goes by the name H! = Spin(3). 

The extra bit of information conveyed by spin can also be seen by the “belt trick” 
[Hans2006, Chapter 2]. 


2.4.23. We conclude with one final observation, returning to the formula (2.4.10). 
There is another way to mix the dot product and cross product (2.4.11) in H: we define 
the scalar triple product 


HxHxH—-R 


(2.4.24) 
(u,v,W) = u: (v X w). 


Amusingly, this gives a way to “multiply” triples of triples! The map (2.4.24) defines 
an alternating, trilinear form (Exercise 2.19). If u, v,w € H°, then the scalar triple 
product is a determinant 


u-(vXw)=det]v) v2 v3 
W1 W2 W3 


and |u: (v x w)| is the volume of a parallelepiped in R? whose sides are given by u, v, w. 


Exercises 


Let F be a field with char F # 2. 
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> l. 


2. 


3. 


> 4, 


ay 


> 7. 


. Use the quaternion algebra B = 


Show that a (not necessarily associative) F-algebra is associative if and only if 
the associative law holds on a basis, and then check that the multiplication table 
implied by (2.2.2) is associative. 


Show that if B is an F-algebra generated by i,j € B and 1,i,j are linearly 
dependent, then B is commutative. 


L1i\ Tv 
Verify directly that the map (3) — M)(F) in Example 2.2.4 is an isomor- 
phism of F-algebras. 


Let a,b € F*. 


(a) Show that (=?) z (=) R (* th) 
F F 


a, b a ac’, bd? 
F 

is finite, then there are only finitely many isomorphism classes of quater- 

nion algebras over F, and in particular that if F”? = F* then there is only 


(b) Show that if c,d € F* then | Conclude that if F* / F”? 


1,1 
one isomorphism class (3) =x Mo)(F). [The converse is not true, see 


Exercise 3.16.] 


(c 


wm 


Dd 
Show that if B = (3) is a quaternion algebra over R, then B ~ M2(R) 


or B = H, the latter occurring if and only if a < 0 and b < 0. Conclude 
that if B is a division quaternion algebra over R, then B = H. 

(d) Let B be a quaternion algebra over F. Show that B &p F*! ~ M(F™®, 
where F”! is an algebraic closure of F. 

(e) Refine part (d) as follows. A field K 2 F is a splitting field for B if 
B r K ~ M2(K). Show that B has a splitting field K with [K : F] < 2. 


a,b 


LetB =| 
F 


be a quaternion algebra over F. Let i’ € BNF satisfy (i’)* = a’ € 


, b’ 
F*. Show that there exists b’ € F* and an isomorphism B = (=) (under 


which i’ maps to the first standard generator). 


a1= 


1 
} multiplicativity of the determinant, 


and the left regular representation (2.3.2) to show that if two elements of F can 
be written as the sum of four squares, then so too can their product (a discovery 
of Euler in 1748). [In Chapter 3, this statement will follow immediately from the 
multiplicativity of the reduced norm on B; here, the formula is derived easily 
from multiplicativity of the determinant.] 


Let B be an F-algebra. Show that if B is a quaternion algebra over F, then B is 
central. 
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> 8. 


> 9, 


> 10. 


11. 


12. 


13. 
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Let A, B be F-algebras, and let 6: A — B be a surjective F-algebra homomor- 
phism. Show that ¢ restricts to an F-algebra homomorphism Z(A) —> Z(B). 


Prove the following partial generalization of Exercise 2.4(b). Let B be a finite- 
dimensional algebra over F. 


(a) Show that every element œ € B satisfies a unique monic polynomial of 
smallest degree with coefficients in F. 

(b) Suppose that B = D is a division algebra. Show that the minimal poly- 
nomial of œ € D is irreducible over F. Conclude that if F = F” is 
algebraically closed, then D = F. 


Prove Proposition 2.3.1: show directly that the map 
A: B > M(F(Va)) 
rac. [va 0 0 b 
a | 0 -va}’\1 0 
extends uniquely to an injective F-algebra homomorphism. /Hint: check that the 
relations are satisfied. ] 


a) Show explicitly that every quaternion algebra B = (a, b | F) is isomorphic 
to an F-subalgebra of M4(F) via the left (or right) regular representation 
over F: write down 4 x 4-matrices representing i and j and verify that 
the relations i? = a, j? = b, ji = —ij hold for these matrices. Note the 
2 x 2-block structure of these matrices. 


b 


wm 


With respect to a suitable such embedding in (a) for B = H, verify that 
the quaternionic conjugation map œ + @ is the matrix transpose, and the 
matrix determinant is the square of the norm lll? = aa. 


In certain circumstances, one may not want to “play favorites” in the left regular 
representation (Proposition 2.3.1) and so involve i and j on more equal footing. 
To this end, show that the map 


B= ($ ) > M(F(Va, Vb) 


oT w P j. ” (2.4.25) 


0 -va]’\Vvb 0 


is an injective F-algebra isomorphism. How is it related to the left regular 
representation? 


Let B = (a,b | F) be a quaternion algebra over F. For a nonzero element 
a@=t+xit+yj+zk € B, show that the following are equivalent: 


(i) t = 0; and 
(ii) a ¢ F ando? €F. 


[So the notion of “pure quaternion” is not tethered to a particular basis.] 


EXERCISES 33 


14. 


15. 


16. 


> 17. 


18. 


19. 


20. 


Verify that (2.3.7) is an isomorphism of F’-algebras, and interpret this map as 
arising from the left regular representation via a map B => M2(FXF) > M2(F). 


Show that every rotation A € SO(3) fixes an axis. /Hint: Consider the eigenvalues 


of A.] 


For v € H? and 6 € H®°\{0}, consider the map v œ> B"'VB = —B"!vB e HP. 
Show that this map is the reflection across the plane {w € H? : Bw = 0}. (For 
example, taking 6 =i, the map is xi + yj + zk œ —xi + yj + zk.) 


In Corollary 2.4.21, we showed that SU(2) ~ H! has a 2-to-1 map to SO(3), 
where H! acts on H? = R? by conjugation: quaternions model rotations in 
three-dimensional space, with spin. Quaternions also model rotations in four- 
dimensional space, as follows. 


(a) Show that the map 


(H! x H')OH > H 
see (2.4.26) 
x ax 


defines a (left) action of H! xH! on H = R4, giving a group homomorphism 
¢: H! x H! > O(4). 


(b) Show that ø surjects onto SO(4) < O(4). [Hint: If A € SO(4) fixes 
1 € H, then A restricted to HP is a rotation and so is given by conjugation. 
More generally, if Al = a, consider x œ> a~'Ax.] 


(c) Show that the kernel of ¢ is {+1} embedded diagonally, so there is an exact 
sequence 
1 > {+1} > SU(2) x SU(2) > SO) > 1. 


[More generally, the universal cover of SO(n) for n => 3 is a double cover called 
the spin group Spin(n), and so Corollary 2.4.21 shows that Spin(3) ~ SU(2) 
and this exercise shows that Spin(4) ~ SU(2) x SU(2). For further reading, see 
e.g. Fulton—Harris [FH91, Lecture 20].] 


Let puo: R? > R? be the counterclockwise rotation by the angle 8 about the 
axis u € R? ~ H®, with ||u|| = 1. Prove Rodrigues’s rotation formula: for all 
ve R’, 


Pu,e(v) = (cos 8)v + (sin 8)(u X v) + (1 — cos 0)(u - v)u 
where u X v and u - v are the cross and dot product, respectively. 


Verify that the map (2.4.24) is a trilinear alternating form on H, i.e., show the 
form is linear when any two of the three arguments are fixed and zero when two 
argument are equal. 


Let B be a quaternion algebra over F and let M2(B) be the ring of 2 x 2- 
matrices over B. (Be careful in the definition of matrix multiplication: B is 
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noncommutative!) Consider the Cayley determinant: 


Cdet: M2(B) > B 


Cdet - A = ad -yB 

(a) Show that Cdet is F-multilinear in the rows and columns of the matrix. 

(b) Show that Cdet is not left B-multilinear in the rows of the matrix. 

(c) Give an example showing that Cdet is not multiplicative. 

(d) Find a matrix A € M2(H) that is invertible (i.e., having a two-sided inverse) 
but has Cdet(A) = 0. Then find such an A with the further property that its 
transpose has nonzero determinant but is not invertible. 


[Moral: be careful with matrix rings over noncommutative rings! For more on 
quaternionic determinants, including the Dieudonné determinant, see Aslaksen 
[As196].] 
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Chapter 3 


Involutions 


In this chapter, we define the standard involution on a quaternion algebra. In this 
way, we characterize division quaternion algebras as noncommutative division rings 
equipped with a standard involution. 


3.1 > Conjugation 


The quaternion conjugation map (2.4.6) defined on the Hamiltonians H arises naturally 
from the notion of real and pure (imaginary) parts, as defined by Hamilton. This 
involution has a natural generalization to a quaternion algebra B = (a,b | F) over a 
field F with char F + 2: we define 


~: B>B 
æa =t+xi+yj+zij=e @=t-—(xi+yj+zij) 
Multiplying out, we then verify that 
ad = wa = t? — ax? — by? + abz’ € F. 


The way in which the cross terms cancel, because the basis elements i, j,k skew 
commute, is a calculation that never fails to enchant! 

But this definition seems to depend on a basis: it is not intrinsically defined. 
What properties characterize it? Is it unique? We are looking for a good definition 
of conjugation ~ : B —> B on an F-algebra B: we will call such a map a standard 
involution. 

The involutions we consider should have basic linearity properties: they are F-linear 
(with 1 = 1, so they act as the identity on F) and have order 2 as an F-linear map. 
An involution should also respect the multiplication structure on B, but we should not 
require that it be an F-algebra isomorphism: instead, like the inverse map (or transpose 
map) reverses order of multiplication, we ask that a8 = $ @ for all a € B. Finally, we 
want the standard involution to give rise to a trace and norm (a measure of size), which 
is to say, we want a + w € F and œg = aa € F for all œ € B. The precise definition 
is given in Definition 3.2.1, and the defining properties are rigid: if an algebra B has a 
standard involution, then it is necessarily unique (Corollary 3.4.4). 
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The existence of a standard involution on B implies that every element of B satisfies 
a quadratic equation: by direct substitution, we see that a € B is aroot of the polynomial 
x? —tx +n € F[x] where t := œ + @ and n := a@ = aa, since then 


a’ -(a+@%at+aa=0 


identically. Accordingly, we define the reduced trace trd: B — F by trd(a) = œ + œ 
and reduced norm nrd: B — F by nrd(a) = œw. We observe that trd is F-linear and 
nrd is multiplicative on B*. 

Motivated by this setting, we say that B has degree 2 if every element a € B 
satisfies a (monic) polynomial in F'[x] of degree 2 and, to avoid trivialities, that B + F 
(or equivalently, at least one element of B satisfies no polynomial of degree 1). The 
final result of this section is the following theorem (see Theorem 3.5.1). 


Theorem 3.1.1. Let B be a division F-algebra of degree 2 over a field F with char 
F + 2. Then either B = K is a quadratic field extension of F or B is a division 
quaternion algebra over F. 


As a consequence, division quaternion algebras are characterized as noncommu- 
tative division algebras with a standard involution, when char F # 2. 


3.2 Involutions 


Throughout this chapter, let B be an F-algebra. For the moment, we allow F to be of 
arbitrary characteristic. We begin by defining involutions on B. 


Definition 3.2.1. An involution” : B — B is an F-linear map which satisfies: 


@ 1=1; 
(ii) @ = «a for all a € B; and 
(iii) aß = Ba for all a, B € B (the map ~ is an anti-automorphism). 


3.2.2. We define the opposite algebra of B by letting B°? = B as F-vector spaces but 
with multiplication @ ‘op £ = 6-a@ fora, p € B. 

One can then equivalently define an involution to be an F-algebra isomorphism 
B æ B? whose underlying F-linear map has order at most 2. 


Remark 3.2.3. What we have defined to be an involution is known in other contexts as 
an involution of the first kind. An involution of the second kind is a map which acts 
nontrivially when restricted to F, and hence is not F-linear; although these involutions 
are interesting in other contexts, they will not figure in our discussion (and anyway one 
can consider such an algebra over the fixed field of the involution). 


Definition 3.2.4. An involution ~ is standard if aa € F for alla € B. 


Remark 3.2.5. Standard involutions go by many other names. The terminology stan- 
dard is employed because conjugation on a quaternion algebra is the “standard” exam- 
ple of such an involution. Other authors call the standard involution the main involution 
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for quaternion algebras, but then find situations where the “main” involution is not stan- 
dard by our definition. The standard involution is also called conjugation on B, but 
this can be confused with conjugation by an element in B*. We will see in Corollary 
3.4.4 that a standard involution is unique, so it is also called the canonical involution; 
however, there are other circumstances where involutions can be defined canonically 
that are not standard (like the map induced by g + g7! on the group ring F[G]). 


3.2.6. If T is a standard involution, so that a@ € F for all œ € B, then 
(a+ 1)(@at+1l)=(a7+1)\(@+1)=ae@+a+a+1EF 
and hence a + w € F for all a € B as well; it then also follows that qaw = @a, since 


(a + w)a = ala +0). 


Example 3.2.7. The identity map is a standard involution on B = F as an F-algebra. 
The R-algebra C has a standard involution, namely, complex conjugation. 


Example 3.2.8. The adjugate map 


a=(2 oari E 
c d — a 
is a standard involution on M2(F) since AAt = ATA = ad — bc = det A € F. 


Matrix transpose is an involution on M,,(F) but is a standard involution (if and) 
only ifn = 1. 


3.2.9. Suppose char F + 2 and let B = (a, b | F). Then the map 
~: B>B 
œ =t+xi+yj+zij=> =t- xi- yj-—zij 
defines a standard involution on B and @ = 2t — a. The map is F-linear with 1 = 1 and 


@ = a, So properties (i) and (ii) hold. By F-linearity, it is enough to check property 
(iii) on a basis (Exercise 3.1), and we verify for instance that 


ij =ij=—ij = ji=(-/(-) = ji 
(see Exercise 3.3). Finally, the involution is standard because 


(t+ xit+ yj +zij(t-— xi- yj- zij) = -ax — by? + abz? € F. (3.2.10) 


Remark 3.2.11. Algebras with involution play an important role in analysis, in particu- 
lar Banach algebras with involution and C*-algebras (generally of infinite dimension). 
A good reference is the text by Dixmier [Dix77] (or the more introductory book by 
Conway [Con2012]). 
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3.3 Reduced trace and reduced norm 


Let” : B — B bea standard involution on B. We define the reduced trace on B by 


trd: B > F 
= (3.3.1) 
«a+ 
and similarly the reduced norm 
nrd: Bo F 
= (3.3.2) 
ar aa. 


Example 3.3.3. For B = M2(F), equipped with the adjugate map as a standard invo- 
lution as in Example 3.2.8, the reduced trace is the usual matrix trace and the reduced 
norm is the determinant. 


3.3.4. The reduced trace trd is an F-linear map, since this is true for the standard 
involution: 
trd(a + B) = (a + B) + (æ + P) = (@ + T) + (B + B) = trd(a) + trd(B) 
for a, p € B. The reduced norm nrd is multiplicative, since 
nrd(@B) = (apap) = aBBa@ = anrd(B)a@ = nrd(a) nrd(f) 
for alla, 8 € B. 


It will be convenient to write 

B® := {a € B : trd(a) = 0} 
i (3.3.5) 
B` := {a € B* : nrd(a) = 1} 


for the F-subspace B? C B of elements of reduced trace 0 and for the subgroup 
B! < B* of elements of reduced norm 1. We observe that B! < B* is normal, by 
multiplicativity, indeed we have an exact sequence of groups 


1> B! > ee 


(noting that the reduced norm map need not be surjective). 


Lemma 3.3.6. If B is not the zero ring, then a € B is a unit (has a two-sided inverse) 
if and only if nrd(q) + 0. 


Proof. Exercise 3.5. m 


Lemma 3.3.7. For all a, B € B, we have trd(Ga) = trd(@ 8). 
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Proof. We have 
trd(@B) = trd(a(trd(B) — B)) = trd(a) trd(B) — trd(a@B) 


and so => 
trd(aB) = trd(a@B) = trd(Ba) = trd(a) trd(B) — trd(Ba) 


therefore trd(@) = trd(8a). o 


Remark 3.3.8. The maps trd and nrd are called reduced for the following reason. 
Let A be a finite-dimensional F-algebra, and consider the left regular representation 
A: A => Endr (A) given by left multiplication in A (cf. Proposition 2.3.1, but over F). 
We then have a (left) trace map Tr: A — F and (left) norm map Nm: A —> F given 
by mapping a € B to the trace and determinant of the endomorphism Ae € Endr (A). 
When A = M2(F), a direct calculation (Exercise 3.13) reveals that 


Tr(a) = 2trd(a) = 2 tr(@) 


(algebra trace, reduced trace, and matrix trace, respectively; there is no difference 
between left and right), and 


Nm(a) = nrd(a)” = det(a)* 


for all a € A, whence the name reduced. (To preview the language of chapter 7, this 
calculation can be efficiently summarized: as a left A-module, A is the sum of two 
simple A-modules—acting on the columns of a matrix—and the reduced trace and 
reduced norm represent ‘half’ of this action.) 


3.3.9. Since 
a’ ~(a+@at+aa=0 (3.3.10) 


identically we see that œ € B is a root of the polynomial 
x? — trd(a)x + nrd(a) € F[x] (3.3.11) 


which we call the reduced characteristic polynomial of a. The fact that æ satisfies its 
reduced characteristic polynomial is the reduced Cayley-Hamilton theorem for an 
algebra with standard involution. When œ ¢ F, the reduced characteristic polynomial 
of æ is its minimal polynomial, since if œ satisfies a polynomial of degree 1 thena € F. 


3.4 Uniqueness and degree 


Definition 3.4.1. An F-algebra K with dimp K = 2 is called a quadratic algebra. 


Lemma 3.4.2. Let K be a quadratic F-algebra. Then K is commutative and has a 
unique standard involution. 
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Proof. Leta € KNF. Then K = F @ Fa = F[a], so in particular K is commutative. 
Then a? = ta —n for unique t,n € F, since 1, a is a basis for K. 

If : K — K is any standard involution, then from (3.3.10) and uniqueness we 
conclude t = a + w (and n = œw), and so any involution must have w = t — a. On 
the other hand, there is a unique standard involution x: B — B with w = t — a: the 
verification is straightforward (see Exercise 3.2). oO 


Example 3.4.3. The reduced trace and norm on a quadratic algebra are precisely the 
usual algebra trace and norm. If char F + 2 and K 2 F is a quadratic field extension 
of F, then the standard involution is just the nontrivial element of Gal(K | F). 


Corollary 3.4.4. If B has a standard involution, then this involution is unique. 


Proof. For any a € BNF, we have from (3.3.10) that dimp F[a] = 2, so the restriction 
of the standard involution to F'[@] is unique. Therefore the standard involution on B is 
itself unique. o 


We have seen that the equation (3.3.10), implying that if B has a standard involution 
then every a € B satisfies a quadratic equation, has figured prominently in the above 
proofs. To further clarify the relationship between these two notions, we make the 
following definition. 


Definition 3.4.5. The degree of B is the smallest m € Zo such that every element 
a € B satisfies a monic polynomial f(x) € F[x] of degree m, if such an integer exists; 
otherwise, we say B has degree ov. 


3.4.6. If B has finite dimension n = dimp B < œ, then every element of B satis- 
fies a polynomial of degree at most n: if a € B then the elements 1, œ, ..., œ” are 
linearly dependent over F. Consequently, every finite-dimensional F-algebra has a 
(well-defined) integer degree, at most n. 


Example 3.4.7. By convention, we interpret Definition 3.4.5 as defining the degree 
of the zero ring to be 0 (since 1 = 0, the element 0 satisfies the monic polynomial 
0x)—whatever! 

If B has degree 1, then B = F. If B has a standard involution, then either B = F or 
B has degree 2 by (3.3.11). 


3.5 Quaternion algebras 


We are now ready to characterize division algebras of degree 2 when char F + 2. (For 
the case char F = 2, see Chapter 6.) 


Theorem 3.5.1. Suppose char F + 2 and let B be a division F-algebra. Then B has 
degree at most 2 if and only if one of the following hold: 


(i) B= F; 
(ii) B = K is a quadratic field extension of F; or 


(iii) B is a division quaternion algebra over F. 
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Proof. From Example 3.4.7, we may suppose that B # F and B has degree 2. 

Let i € BNF. Then F[i] = K is a (commutative) quadratic F-subalgebra of the 
division ring B, so K = F(i) is a field. If K = B, we are done. Completing the square 
(since char F + 2), we may suppose that i? = a € F™. 

Let ¢: B > B be the map given by conjugation by i, i.e., 6(a@) = i !ai. Then ¢ 
is a K-linear endomorphism of B, thought of as a (left) K-vector space, and ¢7 is the 
identity on B. Therefore ¢ is diagonalizable, and we may decompose B = B* @ B- 
into eigenspaces for ¢: explicitly, we can always write 


_ a+ $a) | a-ga) 
~  - g 2 

We now prove dimx B* = 1. Let a € B*. Then L = F(a,i) is a field. Since 
char F + 2, and L is a compositum of quadratic extensions of F, the primitive element 
theorem implies that L = F(£) for some £ € L. But by hypothesis £ satisfies a 
quadratic equation so dimp L = 2 and hence L = K. (For an alternative direct proof 
of this claim, see Exercise 3.10.) 

If B = B* = K, we are done. So suppose B7 + {0}. We will prove that dimx B™ = 1. 
If0 + j € B then i"! ji = —j, soi = —j~'ij and hence all elements of B~ conjugate i 
to —i. Thus if 0 # jı, j2 € B then j; j2 centralizes i and j)j2 € Bt = K. Thus any two 
nonzero elements of B~ are K-multiples of each other. 

Finally, let j € B-\{0}; then B = Bt ẹ® B7 = K @Kj so B has F-basis 
1,i,j,ij and ji = —ij. We claim that trd(j) = 0: indeed, both j and i™tji = -j 
satisfy the same reduced characteristic (or minimal) polynomial of degree 2, so 
trd(j) = trd(—j) = —trd(j) so trd(j) = 0. Thus j? = b € F*, and B is a quater- 
nion algebra by definition. o 


€ B+ B7. 


Remark 3.5.2. We need not assume in Theorem 3.5.1 that B is finite-dimensional; 
somehow, it is a consequence, and every division algebra over F (with char F + 2) of 
degree < 2 is finite-dimensional. 

There are algebras of arbitary (finite or infinite) dimension over F of degree 2: see 
Exercise 3.15. Also, a boolean ring (see Exercise 3.12) has degree 2 as an F2-algebra, 
and there are such rings of arbitrary dimension over F2. Such algebras are quite far 
from being division rings, of course. 


Remark 3.5.3. The proof of Theorem 3.5.1 has quite a bit of history, discussed by van 
Praag [vPr2002] (along with several proofs). See Lam [Lam2005, Theorem III.5.1] for 
a parallel proof of Theorem 3.5.1. Moore [Moore35, Theorem 14.4] in 1915 studied 
algebra of matrices over skew fields and in particular the role of involutions, and gives 
an elementary proof of this theorem (with the assumption char F # 2). Dieudonné 
[Die48, Die53] gave another proof that relies on structure theory for finite-dimensional 
division algebras. 


Corollary 3.5.4. Let B be a division F-algebra with char F + 2. Then B has degree at 
most 2 if and only if B has a standard involution. 


Proof. In each of the cases (i)—-(Giii), B has a standard involution; and conversely if B 
has a standard involution, then B has degree at most 2 (Example 3.4.7). Oo 
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Remark 3.5.5. The statement of Corollary 3.5.4 holds more generally—even if B is 
not necessarily a division ring—as follows. Let B be an F-algebra with char F + 2. 
Then B has a standard involution if and only if B has degree at most 2 [Voi201 1b]. 
However, this is no longer true in characteristic 2 (Exercise 3.12). 


Corollary 3.5.6. Let B be a division F-algebra with char F + 2. Then the following 
are equivalent: 


(i) B is a quaternion algebra; 
(ii) B is noncommutative and has degree 2; and 


(iii) B is central and has degree 2. 


Definition 3.5.7. An F-algebra B is algebraic if every a € B is algebraic over F (i.e., 
a satisfies a polynomial with coefficients in F). 


If B has finite degree (such as when dimp B = n < oo), then B is algebraic. 


Corollary 3.5.8. (Frobenius). Let B be an algebraic division algebra over R. Then 
either B = Ror B= C or B = Has R-algebras. 


Proof. If a € BNR then R(q@) = C, so a satisfies a polynomial of degree 2. Thus if 
B + R then B has degree 2 and either B ~ C or B is a division quaternion algebra over 
R, and hence B ~ H by Exercise 2.4(c). o 


Example 3.5.9. Division algebras over R of infinite dimension abound. Transcendental 
field extensions of R, such as the function field R(x) or the Laurent series field R((x)), 
are examples of infinite-dimensional division algebras over R. Also, the free algebra in 
two (noncommuting) variables is a subring of a division ring B (its “noncommutative 
ring of fractions”) with center R and of infinite dimension over R. 


Remark 3.5.10. The theorem of Frobenius (Corollary 3.5.8) extends directly to fields 
F akin to R, as follows. A field is formally real if —1 cannot be expressed in F as a 
sum of squares and real closed if F is formally real and has no formally real proper 
algebraic extension. The real numbers R and the field of all real algebraic numbers are 
real closed. A real closed field has characteristic zero, is totally ordered, and contains 
a square root of each nonnegative element; the field obtained from F by adjoining a 
root of the irreducible polynomial x? + 1 is algebraically closed. For these statements, 
see Rajwade [Raj93, Chapter 15]. Every finite-dimensional division algebra over a real 
closed field F is either F or K = F(V-1) or B = (-1,-1| F). 


Remark 3.5.11. Algebras of dimension 3, sitting somehow between quadratic exten- 
sions and quaternion algebras, can be characterized in a similar way. If B is an R-algebra 
of dimension 3, then either B is commutative or B has a standard involution and is 
isomorphic to the subring of upper triangular matrices in M2(R). A similar state- 
ment holds for free R-algebras of rank 3 over a (commutative) domain R; see Levin 
[Lev2013]. 
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Exercises 


Throughout these exercises, let F be a field. 


> l. 


> 2. 


> 3. 


> 5. 


10. 


11. 


Let B be an F-algebra and let ~: B > B be an F-linear map with 1 = 1. Show 
that ~ is an involution if and only if (ii)-Gii) in Definition 3.2.1 hold for a basis 
of B (as an F-vector space). 

Let K = F[a] be a quadratic F-algebra, with a” = ta -n for (unique) t,n € F. 
Extending linearly, show that there is a unique standard involution ~: K —> K 
with the property that œ = t — «œ, and show that 


trd(x + ya) = 2x + ty 
nrd(x + y@) = x? + txy + ny” 


for all x + ya € F[a]. 
Verify that the map ~ in Example 3.2.9 is a standard involution. 


. Determine the standard involution on K = F x F (with F — K under the 


diagonal map). 


Let B be an F-algebra with a standard involution. Show that 0 + a € B is a left 
zerodivisor if and only if œ is a right zerodivisor if and only if nrd(@) = 0. In 
particular, if B is not the zero ring, then a € B is (left and right) invertible if and 
only if nrd(@) + 0. 


. Suppose char F + 2, let B be a division quaternion algebra over F, and let 


K,, K2 C B be subfields with Kı N Kz = F. Show that the F-subalgebra of B 
generated by Kı and K3 is equal to B. Conclude that if 1, a, 6 € B are F-linearly 
independent, then 1, a, 6, af are an F-basis for B. [Hint: use the involution.] By 
way of counterexample, show that these results need not hold for B = Mo(F). 


. Show that B = M,,(F) has a standard involution if and only if n < 2. 
. Let G be a finite group. Show that the F-linear map induced by g > g`! 


for g € G is an involution on the group ring F[G] = B zee F8- Determine 
necessary and sufficient conditions for this map to be a standard involution. 


. Let B be an F-algebra with a standard involution ~ : B — B. In this exercise, 


we examine when ~ is the identity map. 


(a) Show that if char F + 2, then x € B satisfies x = x if and only x € F. 

(b) Suppose that dimp B < oo. Show that the identity map is a standard 
involution on B if and only if (i) B = F or (ii) char F = 2 and Bisa 
quotient of the commutative ring F[x,..., tlie —Aj,.+.5 x2 — an) with 
aE F. 

Let K 2 F be a field which has degree m as an F-algebra in the sense of 
Definition 3.4.5. Suppose that char F + m. Show that [K : F] = m, i.e., K has 
degree m in the usual sense. (What happens when char F | m?) 

Let B be an F-algebra with standard involution. Suppose that ¢: B —> B is 
an F-algebra automorphism. Show for œ € B that (a) = #(q@), and therefore 
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12. 


> 13. 


14. 


15. 


> 16. 
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that trd(¢(@)) = trd(@) and nrd(¢(a@)) = nrd(@). [Hint: consider the map a |> 
¢ '(¢(a)).] 

In this exercise, we explore further the relationship between algebras of degree 
2 and those with standard involutions (Remark 3.5.5). 


(a) Suppose char F + 2 and let B be a finite-dimensional F-algebra. Show that 
B has a standard involution if and only if deg, B < 2. 

(b) Let F = F; and let B be a Boolean ring, a ring such that x? = x for all 
x € B. (Verify that 2 = 0 in B, so B is an F -algebra.) Prove that B does not 
have a standard involution unless B = F> or B = Fz x Fo, but nevertheless 
any Boolean ring has degree at most 2. 


Let B = M,,(F), and consider the map 2: B —> Endr (B) by a > Ag defined 
by left-multiplication in B. Show that for all a € M,(F), the characteristic 
polynomial of Ag is the nth power of the usual characteristic polynomial of æ. 
Conclude when n = 2 that tr(œ) = 2 trd(A) and det(a) = nrd(a)’. 


Considering a slightly different take on the previous exercise: let B be a quater- 
nion algebra over F. Show that the characteristic polynomial of left multiplication 
by a € B is equal to that of right multiplication and is the square of the reduced 
characteristic polynomial. [Hint: if a direct approach is too cumbersome, con- 
sider applying the previous exercise and the left regular representation as in 
2.3.8.] 

Let V be an F-vector space and let t : V — F be an F-linear map. Let B = F@V 
and define the binary operation x - y = t(x)y for x, y € V. Show that - induces 
a multiplication on B, and that the map x + X = t(x) — x for x € V induces 
a standard involution on B. [Such an algebra is called an exceptional algebra 
[GrLu2009, Voi201 1b].] Conclude that there exists a central F-algebra B with a 
standard involution in any dimension n = dimp B > 3. 


In this exercise, we mimic the proof of Theorem 3.5.1 to prove that a quaternion 
algebra over a finite field of odd cardinality is not a division ring, a special case 
of Wedderburn’s little theorem: a finite division ring is a field. 


Assume for purposes of contradiction that B is a division quaternion algebra 
over F = F4 with q odd. 


(a) Leti € BNF. Show that the centralizer Cgx(i) = {œ € B* : ia = ai} of i in 
B* satisfies Cgx(i) = F(i)*. 

(b) Conclude that any noncentral conjugacy class in B™ has order q? + 1. 

(c) Derive a contradiction from the class equation q^ — 1 = q — 1 + m(q? + 1) 
(where m € Z). 


[For the case q even, see Exercise 6.16; for fun, the eager reader may wish to 
prove Weddernburn’s little theorem for F = F, directly.] 
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17. Derive Euler’s identity (1.1.6) that the product of the sum of four squares is 
again the sum of four squares as follows. Let F = Q(x,...,%4, y1,---, Y4) 
be a function field over Q in 8 variables and consider the quaternion algebra 
(-1,-1 | F). Show (by an explicit universal formula) that if R is any commutative 
ring and x, y € R are the sum of four squares in R, then xy is the sum of four 
squares in R. 

18. Suppose char F + 2. For an F-algebra B, let 


V(B) = {a € BNF : a” € F} U {0}. 


Let B be a division ring. Show that V(B) is a vector space (closed under addition) 
if and only if B = F or B = K is a quadratic field extension of F or B is a 
quaternion algebra over F. 

19. Let B be an F-algebra with F-basis e1, e2,...,e@,. Let” : B — Bbeaninvolution. 
Show that ~ is standard if and only if 


eie; € F and (e; + e; )(e; + e;) € F for alli, j =1,...,n. 
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Chapter 4 


Quadratic forms 


Quaternion algebras, as algebras equipped with a standard involution, are intrinsically 
related to quadratic forms. We develop this connection in the next two chapters. 


4.1 > Reduced norm as quadratic form 


Let F be a field with char F + 2 and let B = (a,b | F) be a quaternion algebra over 
F. We have seen (3.2.9) that B has a unique standard involution and consequently a 
reduced norm map, with 


nrd(t + xi + yj + zij) = t° — ax? — by? + abz? (4.1.1) 


for t, x, y, z € F. The reduced norm therefore defines a quadratic form, a homogeneous 
polynomial of degree 2 in F[t, x, y, z] (thought of as a function of the coefficients of 
an element respect to the basis 1, i, j, k). It should come as no surprise, then, that the 
structure of the quaternion algebra B is related to properties of the quadratic form nrd. 

Let Q: V — F be a quadratic form. Then Q can be diagonalized by a change of 
variables: there is a basis e1,...,@, of V such that 


9 7) 
Ol(xyey +++ + Xn€n) = Q(X, . «5 Xn) = AX] +++ + nx; 


with a; € F. We define the discriminant of Q to be the (well-defined) product 
disc(Q) := a, +++ an/2” € F/F”. (The factor 2” is for consistency with more general 
notions; it is harmless if a bit annoying.) We say that a quadratic form is nondegenerate 
if its discriminant is nonzero. The reduced norm quadratic form (4.1.1) is already 
diagonal in the basis 1,i, j, k, and it is nondegenerate because a, b + 0. 

A similarity from Q to another quadratic form Q’: V’ > F is a pair (f, u) where 
f:vV Z V’ is an F-linear isomorphism and u € F™ satisfy Q’(f(x)) = uQ(~) for all 
x € V. An isometry is a similarity with u = 1. The orthogonal group of Q is the 
group of self-isometries of Q, i.e., 


O(Q)(F) := {f € Autr (V) : O(f(x)) = Q(x) for all x € V}. 
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An isometry f € O(Q)(F) is special if det f, and the special orthogonal group of Q 
is the group of special isometries of Q. 

More generally, we have seen that any algebra with a standard involution has a 
quadratic form nrd. We say that the standard involution is nondegenerate whenever 
the quadratic form nrd is so. Generalizing Theorem 3.1.1, we prove the following (see 
Main Theorem 4.4.1 for the proof). 


Main Theorem 4.1.2. Let B be an F-algebra. Then B has a nondegenerate standard 
involution if and only if one of the following holds: 


(i) B= F; 
(ii) B = K has dimp K = 2 and either K ~= F x F or K is a field; or 
(iii) B is a quaternion algebra over F. 


This theorem gives another way of characterizing quaternion algebras: they are 
noncommutative algebras with a nondegenerate standard involution. 

In Section 2.4, we saw that the unit Hamiltonians H! act on the pure Hamiltonians 
H? (Section 2.4) by rotations: the standard Euclidean quadratic form (sum of squares) 
is preserved by conjugation. This generalizes in a natural way to an arbitrary field, and 
so we can understand the group of linear transformations that preserve a ternary (or 
quaternary) form in terms of the unit group of a quaternion algebra B (Proposition 
4.5.10): there is an exact sequence 


1 => F* > B* > SO(ard|p0)(F) > 1 


where SO(Q)(F) is the group of special (or oriented) isometries of the quadratic form 


4.2 Basic definitions 


In this section, we summarize basic definitions and notation for quadratic forms over 
fields. The “Bible for all quadratic form practitioners” (according to the MathSciNet 
review by K. Szymiczek) is the book by Lam [Lam2005]; in particular, Lam gives 
a very readable account of the relationship between quadratic forms and quaternion 
algebras over F when char F + 2 [Lam2005, Sections HI.1-II.2] and many other 
topics in the algebraic theory of quadratic forms. Also recommended are the books by 
Cassels [Cas78], O’ Meara [O’Me73], and Scharlau [Scha85], as well as the book by 
Grove [Grov2002], who treats quadratic forms from a geometric point of view in terms 
of the orthogonal group. For reference and further inspiration, see also the hugely 
influential book by Eichler [Eic53]. 
Let F be a field. (For now, we allow char F to be arbitrary.) 


Definition 4.2.1. A quadratic form Q is a map Q: V —> F on an F-vector space V 
satisfying: 


(i) O(ax) = a?Q(x) for all a € F and x € V; and 
(ii) The map T: V x V — F defined by 


T(x, y) = Q(x + y) — Q(x) - QO) 


is F-bilinear. 
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We call the pair (V, Q) a quadratic space and T the associated bilinear form. 


We will often abbreviate a quadratic space (V, Q) by simply V. If Q is a quadratic 
form then the associated bilinear form T is symmetric, satisfying T(x, y) = T(y, x) 
for all x, y € V; in particular, T(x, x) = 2Q(x) for all x € V, so when char F + 2 we 
recover the quadratic form from the symmetric bilinear form. 


For the remainder of this section, let Q: V — F bea quadratic form with associated 
bilinear form T. 


4.2.2. Suppose dimp V = n < œ. Let e),..., en be a basis for V, giving an isomor- 
phism V = F”. Then Q can be written 


Olnier +++- + Xnen) = X Olei)x? + X Tlen ej)xixj € FIX.. Xn] 
i i<j 
as a homogeneous polynomial of degree 2. 
The Gram matrix of Q in the basis e; is the (symmetric) matrix 


[T] := (T (ei, e;)i j € Mr (F). 


We then have T(x, y) = x*[T]y for x, y € V ~ F” as column vectors. Under a change 
of basis A € GL, (F) with e; = Ae;, the Gram matrix [T] in the basis e; has 


[TY = A*[T]A. (4.2.3) 


Definition 4.2.4. A similarity of quadratic forms from Q: V > F to Q’: V’ > F 
is a pair (f, u) where f: V — V’ is an F-linear isomorphism and u € F™ satisfy 
Q’ (f(x)) = uQ(x) for all x € V, i.e., such that the diagram 


v->F 


| P | (4.2.5) 
Vv’ KeS F 


commutes. In a similarity (f, u), the scalar u is called the similitude factor of the 
similarity. An isometry of quadratic forms (or isomorphism of quadratic spaces) is a 
similarity with similitude factor u = 1; we write in this case Q = Q’. 


Definition 4.2.6. The general orthogonal group (or similarity group) of the quadratic 
form Q is the group of self-similarities of Q under composition 
GO(Q)(F) := {(f,u) € Autr(V) x F* : O(f(x)) = uQ(x) for all x € V}; 


the orthogonal group of Q is the group of self-isometries of Q, i.e., 


O(Q)(F) := {f € Autr (V) : O(f(x)) = Q(x) for all x € V}. 
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Remark 4.2.7. A similarity allows isomorphisms of the target F (as a one-dimensional 
F-vector space). The notion of isometry comes from the connection with measuring 
lengths, when working with the usual Euclidean norm form on a vector space over R: 
similarity allows these lengths to scale uniformly (e.g., similar triangles). 


There is a canonical exact sequence 
1 > O(Q)(F) > GO(Q)(F) > F* 
(f,u) hu 


realizing O(Q)(F) < GO(Q)(F) as the subgroup of self-similarities with similitude 
factor u = 1. 


(4.2.8) 


4.2.9. Returning to 4.2.2, suppose dimp V = n < œ and char F + 2. Then one can 
understand the orthogonal group of Q quite concretely in matrix terms as follows. 
Choose a basis e1,...,@, for V and let [T] be the Gram matrix of Q with respect to 
this basis, so that 20(x) = x*[T]x for all x € V ~ F”. Then Autz (V) ~ GL,(F) and 
A € GL,,(F) belongs to O(Q) if and only if 


(Ax)'[T](Ax) = x'(A'[T]A)x = x" [T]x 
for all x € V, and therefore 
O(O)(F) = {A € GL, (F) : A‘[T]A = [T]} (4.2.10) 
and 


GO(Q)(F) = {(A,u) € GL,(F) x F* : A*[T]A = u(T]} (4.2.11) 


From now on, let Q: V — F be a quadratic form and let T : V x V —> F be the 
symmetric bilinear form associated to Q. 


Definition 4.2.12. Let x, y € V. We say that x is orthogonal to y (with respect to Q) 
if T(x, y) =0. 


Since T is symmetric, x is orthogonal to y if and only if y is orthogonal to x for 
x, y € V, and so we simply say x, y are orthogonal. If S C V is a subset, we write 


S+ := {x €V:T(v,x) = 0 for all v € S} 


for the subspace of V which is orthogonal to (the span of) S. 


4.2.13. Let B be an algebra over F with a standard involution. Then nrd: B > F isa 
quadratic form on B. Indeed, nrd(aq) = a*a for all œ € B, and the map T given by 


T(a, B) = (æ + Ba + B) — aÑ — BB = aß + BU = aß + aß =trd(@B) (4.2.14) 
for a, p € B is bilinear, and 


T(a, P) = trd(aB) = trd(a(trd(B) — B)) = trd(@) trd(B) — trd(@). (4.2.15) 
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So a, 8 € B are orthogonal with respect to nrd if and only if 
trd(aß) = aß + Ba =0 


if and only if 
trd(a p) = trd(q) trd(f). 


Thus 1 anda € B are orthogonal if and only if trd(@) = 0 if and only if a” = —nrd(a). 
Moreover, rearranging (4.2.14), 


ap + Ba = trd(B)a + trd(a)B — T(a, B). (4.2.16) 


In particular, if 1, œ, 6 € B are linearly independent over F, then by (4.2.16) they are 
pairwise orthogonal if and only if Ba = —a@f. 

In this way, we see that the multiplication law in B is governed in a fundamental 
way by the reduced norm quadratic form. 


Definition 4.2.17. Let Q: V — F be a quadratic form. We say that Q represents an 
element a € F if there exists x € V such that Q(x) = a. A quadratic form is universal 
if it represents every element of F. 


Definition 4.2.18. A quadratic form Q (or a quadratic space V) is isotropic if Q 
represents 0 nontrivially (there exists 0 + x € V such that Q(x) = 0) and otherwise Q 
is anisotropic. 


Remark 4.2.19. The terminology isotropic is as least as old as Eichler [Eic53, p. 3], 
and perhaps it goes back to Witt. The word can be used to mean “having properties 
that are identical in all directions’, and so the motivation for this language possibly 
comes from physics: the second fundamental form associated to a parametrized surface 
z = f(x, y) in R? is a quadratic form, and (roughly speaking) this quadratic form defines 
the curvature at a given point. In this sense, if the quadratic form vanishes, then the 
curvature is zero, and things look the same in all directions. 


4.2.20. Let Q’: V’ — F be another quadratic form. We define the orthogonal direct 
sum 


Qag’: VeV >F 
(Qa Q’)\(x + x’) = Q(x) + Qx’) 


where x € V and x’ € V’; the associated bilinear form T aT’ has 


TaT) + x,y +y) =T, y) +T, y) 


for all x,y € V and x’, y’ € V’. By definition, under the natural inclusion of V, V’ C 
V eV’, we have V’ C V+ (and V c (V’)*). 


4.2.21. For a € F, we write (a) for the quadratic form ax? on F. More generally, for 
dj,...,4n € F, we write 


(a1)... m (an) := (a1, ..., An) 


for the quadratic form on F” defined by Q(x1, .. ., Xn) = ax? +-+ anx. 
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To conclude this introduction, we state an important result due originally to Witt 
which governs the decomposition of quadratic spaces into orthogonal sums up to 
isometry. 


Theorem 4.2.22. Let V ~ V’ be isometric quadratic spaces with orthogonal decom- 
positions V ~ W, a Wz and V’ = W; ma W3. 


(a) fW = Wi then W = W3. 
(b) If g : Wi — W; is an isometry, then there exists an isometry f: V — V’ such 
that flw, = g and f(W2) = W3. 


Proof. The proof is requested in Exercise 4.16. For a proof and the equivalence between 
Witt cancellation (part (a)) and Witt extension (part (b)), see Lam [Lam2005, Proof 
of Theorem 1.4.2, p. 14], Scharlau [Scha85, Theorem 1.5.3], or O’Meara [O’Me73, 
Theorem 42:17]. o 


Theorem 4.2.22(a) is called Witt cancellation and 4.2.22(b) is called Witt exten- 
sion. 


4.3 Discriminants, nondegeneracy 


For the remainder of this chapter, we suppose that char F + 2. (We take up the case 
char F = 2 in section 6.3.) Throughout, let Q: V — F be a quadratic form with 
dimp V =n < œ and associated symmetric bilinear form T. 

The following result (proven by induction) is a standard application of Gram- 
Schmidt orthogonalization (Exercise 4.1); working with a quadratic form as a polyno- 
mial, this procedure can be thought of as iteratively completing the square. 


Lemma 4.3.1. There exists a basis of V such that Q = (ai, .. ., an) with a; € F. 


A form presented with a basis as in Lemma 4.3.1 is called normalized (or diag- 
onal). For a diagonal quadratic form Q, the associated Gram matrix [T] is diagonal 
with entries 2a),...,2dy. 


4.3.2. The determinant det([7]) of a Gram matrix for Q depends on a choice of basis for 
V, but by (4.2.3), a change of basis matrix A € GL,,(F) operates on [T] by A*[T]A, and 
det(At[T]A) = det(A} det([T]), so we obtain a well-defined element det(T) € F/F”? 
independent of the choice of basis. 


Definition 4.3.3. The discriminant of Q is 
disc(Q) := 2” det(T) € F/F”. 
The signed discriminant of Q is 


sgndisc(Q) := (-1)""~)? disc(Q) € F/F. 
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When it will cause no confusion, we will represent the class of the discriminant in 
F/F” simply by a representative element in F. 


Remark 4.3.4. The extra factor 2~” is harmless since char F + 2, and it allows us to 
naturally cancel certain factors 2 that appear whether we are in even or odd dimension— 
it will be essential when we consider the case char F = 2 (see 6.3.1). The distinction 
between even and odd dimensional quadratic spaces is not arbitrary: indeed, this 
distinction is pervasive, even down to the classification of semisimple Lie algebras. 


Example 4.3.5. We have disc((a1,...,@n)) = 41 ++ -an for a; E F. 


Definition 4.3.6. The bilinear form T : V x V — F is nondegenerate if for all 
x € V\{0}, the linear functional Ty: V — F defined by 7;.(y) = T(x, y) is nonzero, 
i.e., there exists y € V such that T(x, y) # 0. We say that Q (or V) is nondegenerate if 
the associated bilinear form T is nondegenerate. 


4.3.7. The bilinear form T induces a map 
V — Hom(V, F) 
xr (yh T(x, y)) 


and T is nondegenerate if and only if this map is injective (and hence an isomorphism) 
if and only if det(T) + 0. Put another way, Q is nondegenerate if and only if disc(Q) + 0, 
and so a diagonal form (a1, . . ., an} is nondegenerate if and only if a; + 0 for all i. 


Example 4.3.8. Let B = (a, b | F) be a quaternion algebra. Then by 3.2.9, the quadratic 
form nrd: B —> F is normalized with respect to the basis 1, i, j, ij. Indeed, 


nrd = (1, —a, —b, ab). 


We have disc(nrd) = (ab)* + 0, so nrd is nondegenerate. 


If B is an F-algebra with a standard involution, then the reduced norm defines a 
quadratic form on B, and we say that the standard involution is nondegenerate if nrd 
is nondegenerate. 


4.3.9. One can often restrict to the case where a quadratic form Q is nondegenerate by 
splitting off the radical, as follows. We define the radical of Q to be 


rad(Q) := V+ = {x € V : T(x, y) = 0 for all y € V}. 


The radical rad(Q) € V is a subspace, so completing a basis of rad(Q) to V we can 
write (noncanonically) V = rad(Q) & W, as the direct sum is an orthogonal direct sum 
by definition of the radical. In this decomposition, Ql|raaço) is identically zero and Qļw 
is nondegenerate. 
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4.4 Nondegenerate standard involutions 


In this section, we follow Theorem 3.5.1 with a characterization of quaternion algebras 
beyond division algebras. 


Main Theorem 4.4.1. Suppose char F + 2 and let B be an F-algebra. Then B has a 
nondegenerate standard involution if and only if one of the following holds: 


(i) B= F; 
(ii) B = K is a quadratic F-algebra and either K ~= F x F or K is a field; or 
(iii) B is a quaternion algebra over F. 


Case (ii) in Main Theorem 4.4.1 is equivalent to requiring that K be a quadratic 
F-algebra that is reduced (has no nonzero nilpotent elements). 


Remark 4.4.2. By Exercise 3.15, there exist F-algebras with standard involution having 
arbitrary dimension, so it is remarkable that the additional requirement that the standard 
involution be nondegenerate gives such a tidy result. 


Proof of Main Theorem 4.4.1. If B = F, then the standard involution is the identity and 
nrd is nondegenerate. If dimp K = 2, then after completing the square we may write 
K = F[x]/(x* — a) and in the basis 1, x we find nrd = (1, a). By Example 4.3.5, nrd is 
nondegenerate if and only if a € F* if and only if K is a quadratic field extension of F 
orK=FXF. 

Suppose that dimp B > 2. Let 1,i,7 be a part of a normalized basis for B with 
respect to the quadratic form nrd. Then T(1,i) = trd(@i) = 0, so i? = a € F*, since 
nrd is nondegenerate. Note in particular that i = —i. Similarly j? = b € F*, and 
by (4.2.16) we have trd(ij) = ij + ji = 0. We have T(1,ij) = trd(ij) = 0, and 
T(ij,i) = trd(i(ij)) = —a trd(j) = 0 and similarly T(ij, j) = 0, hence ij € {1,i, j}+. If 
ij = 0 then i(ij) = aj = 0 so j = 0, a contradiction. Since nrd is nondegenerate, it 
follows then that the set 1, i, j, ij is linearly independent. 

Therefore, the subalgebra A of B generated by i, j satisfies A = (a,b | F), and if 
dimp B = 4 we are done. So let k € A+; then trd(k) = 0 and k? = c € F*. Thus 
k € BX, with k~! = c7!k. By 4.2.13 we have ka = @k for any a € A since k = —k. 
But then 


k(ij) = (Jk = jik = jki = k(ji). (4.4.3) 


But k € B* so ij = ji = —ij, and this is a contradiction. 


Main Theorem 4.4.1 has the following corollaries. 


Corollary 4.4.4. Let B be an F-algebra with char F + 2. Then B is a quaternion alge- 
bra if and only if B is noncommutative and has a nondegenerate standard involution. 


Proof. Immediate. o 


Corollary 4.4.5. Let B have a nondegenerate standard involution, and suppose that 
K C Bis a commutative F-subalgebra such that the restriction of the standard involu- 
tion is nondegenerate. Then dimp K < 2. Moreover, if K + F, then the centralizer of 
K* in B* is K*. 
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Proof. The first statement is immediate; the second follows by considering the algebra 
generated by the centralizer. o 


Remark 4.4.6. Algebras with involutions come from quadratic forms, and the results 
of this chapter are just one special case of a much more general theory. More precisely, 
there is a natural bijection between the set of isomorphism classes of finite-dimensional 
simple F-algebras equipped with an F-linear involution and the set of similarity classes 
of nondegenerate quadratic forms on finite-dimensional F-vector spaces. More gener- 
ally, for involutions that act nontrivially on the base field, one looks at Hermitian forms. 
Consequently, there are three broad types of involutions on central simple algebras, 
depending on the associated quadratic or Hermitian form: orthogonal, symplectic, and 
unitary. Accordingly, algebras with involutions can be classified by the invariants of the 
associated form. This connection is the subject of the tome by Knus—Merkurjev—Rost— 
Tignol [KMRT98]. In this way the theory of quadratic forms belongs to the theory of 
algebras with involution, which in turn is a part of the theory of linear algebraic groups, 
as expounded by Weil [Weil60]: see the survey by Tignol [Tig98] for an overview and 
further references. 


4.5 Special orthogonal groups 


In this section, we revisit the original motivation of Hamilton (Section 2.4) in a more 
general context, relating quaternions to the orthogonal group of a quadratic form. We 
retain our running hypothesis that char F + 2 and Q: V — F is a nondegenerate 
quadratic form with dimp V =n < o. 


Definition 4.5.1. An isometry f € O(Q)(F) is special (or proper) if det f = 1. The 
special orthogonal group of Q is the group of special isometries of Q: 


SO(Q)(F) := {f € O(Q)(F) : det(f) = 1}. 


The condition “det f = 1” is well-defined, independent of the choice of F-basis 
of V; having chosen a basis of V so that O(Q)(F) < GL,(F), we have SO(Q)(F) = 
O(Q)(F) N SL, (F). 


4.5.2. Suppose that V = F” and let f € O(Q) be a self-isometry of Q, represented in 
the standard basis by A € GL, (F). Taking determinants in (4.2.10) we conclude that 
det(A)? = 1 so det(A) = +1. The determinant is surjective (see Exercise 4.15), so we 
have an exact sequence 


1 > SOF) > OOF) SS {41} > 1. 
If n is odd, then either f or — f is special, so the sequence splits and 
O(Q)(F) = {+1} x SO(Q)(F). (4.5.3) 


4.5.4. Similarly, if (f,u) E€ GO(Q)(F) then from (4.2.11) we get u~” det(f)? = Lif 
n = 2m is even, then u“™ det( f) = +1, and we define the general special orthogonal 
group (or special similarity group) of Q to be 


GSO(Q)(F) := {(f,u) E€ GO(Q)(F) : uw" det(f) = 1} 
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giving an exact sequence 
1 — GSO(Q)\(F) > GO(Q)(F) > {+1} > 1. 
If n is odd, we have little choice other than to define GSO(Q)(F) := GO(Q)(F). 


Example 4.5.5. If V = R” and Q is the usual Euclidean norm on V, then 
O(Q)(R) = O(n) = {A € GL, (R): AAt = 1} 


is the group of linear maps preserving length (but not necessarily orientation), whereas 
SO(Q)(R) is the usual group of rotations of V (preserving orientation). Similarly, 
GSO(Q)(R) consists of orientation-preserving similarities, preserving orientation but 
allowing a constant scaling. 

In particular, if n = 2 then O(2) := O(Q)(R) contains 


cos@ sin 
-sin cos@ 


SO(2) := SO(Q)(R) = | :0E R} ~ R/(2xZ) = S! 


(the circle group) with index 2, with a reflection in any line through the origin repre- 
senting a nontrivial coset of SO(2) < O(2). 


4.5.6. More generally, we may define reflections in O(Q)(F) as follows. For x € V 
anisotropic (so Q(x) + 0), we define the reflection in x to be 
hy: VOV 
T(v, x) 
- x. 
Q(x) 


T(v) =v 


We have t(x) = x — 2x = —x, and 


Oma =00)+0 |- x) ) m b T(v, 2} 


Ox) ~ ~ 0G) 
T 2 T(v, 
= Ov) + on o- RATo a= O00 


so Tx E O(Q)(F)\ SO(Q)(F). 


By a classical theorem of Cartan and Dieudonné, the orthogonal group is generated 
by reflections. 


Theorem 4.5.7. (Cartan-Dieudonné). Let (V, Q) be a nondegenerate quadratic space 
with dimp V = n. Then every isometry f € O(Q)(F) is a product ofat most n reflections. 


Proof. See Lam [Lam2005, §1.7], O’Meara [O’Me73, §43B], or Scharlau [Scha85, 
Theorem 1.5.4]. The proof is by induction on n, carefully recording the effect of a 
reflection in an anisotropic vector. oO 
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Since reflections have determinant —1, an isometry f is special (and f € SO(Q)(F)) 
if and only if it is the product of an even number of reflections. 


4.5.8. Now let B be a quaternion algebra over F, and recall (3.3.5) that we have defined 
B® := {v € B : trd(B) = 0}. 


Then there is a (left) action 
B*OB? > B’ 
(4.5.9) 


a-v = aval. 


since trd(ava!) = trd(v) = 0. Moreover, B* acts on V by isometries with respect to 
the quadratic form Q = nrd|gzo: V —> F, since nrd(ava™!) = nrd(v) for all œ € B and 
veV. 


Proposition 4.5.10. Let B be a quaternion algebra over F. Then the action (4.5.9) 
induces an exact sequence 

1 => F* > B* > SO(nrd|p0)(F) > 1. (4.5.11) 
If further nrd(B*) = F*?, then 

1 > {+1} > B! > SO(nrd|g0)(F) > 1, 
where B! := {a € B : nrd(a) = 1}. 
Proof. Let Q = nrd |go. We saw in 4.5.8 that the action of B” is by isometries, so lands 
in O(Q)(F). By the Cartan—Dieudonné theorem (Theorem 4.5.7, the weak version of 
Exercise 4.17 suffices), every isometry is the product of reflections, and by determinants 


an isometry is special if and only if it is the product of an even number of reflections. 
A reflection in x € V = B? with Q(x) = nrd(x) + 0 is of the form 


(v) T(v, x) trd(vx) 
T(v) =v- x=v- x 
Q(x) nrd(x) (4.5.12) 
=v—-—(vx+ xvx! =-xvx | = xvx l, 
the final equality from x = —x as x € B}. The product of two such reflections is thus 


of the form v + ava! with a € B*. Therefore B* acts by special isometries, and 
every special isometry so arises: the map BX — O(Q)(F) surjects onto SO(Q)(F). The 
kernel of the action is given by those a € B* with ava@! = v for all v € B®, i.e., 
a € Z(B*) = F*. 

The second statement follows directly by writing BX = B'F™. o 


Example 4.5.13. If B ~ M2(F), then nrd = det, so det? = (1,-1,-1) and (4.5.11) 
yields the isomorphism PGL2(F) ~ SO((1, —1, -1))(F). 


Example 4.5.14. If F = R and B = H, then det(H) = R>o = R*?, and the second 
exact sequence is Hamilton’s (Section 2.4). 
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To conclude, we pass from three variables to four variables. 


4.5.15. In Exercise 2.17, we showed that there is an exact sequence 
1 > {+1} > H! x H! > SOM > 1 


with H! x H! acting on H = R* by v > avg! = avg for a, B € H!. 
More generally, let B be a quaternion algebra over F. Then there is a left action of 
B* x B* on B: 
BX x BOB > B 
(a, B): v = avg. 


This action is by similarities, since if a = nrd(q@) and b = nrd(£), then 


(4.5.16) 


nrd(avB"!) = nrd(@) nrd(v) nrd(B™!) = ; nrd(v) 


for all v € V, with similitude factor u = a/b. In particular, if nrd(@) = nrd(), then the 
action is by isometries. 


Proposition 4.5.17. With notation as in 4.5.15, the left action (4.5.16) induces exact 


sequences 


1 > F* > B* x B* > GSO(nrd)(F) > 1 
ab (a,a) 


(4.5.18) 


and 
1 > F* > {(a, B) € B* x B* : nrd(a) = nrd(B)} — SO(mrd)(F) > 1. 
If further nrd(B*) = F”?, then the sequence 
1 > {+1} > B! x B! > SO(mrd)(F) > 1 


is exact. 


Proof. For the first statement, we first show that the kernel of the action is the diagonally 
embedded F*. Suppose that «v87! = v for all v € B; taking v = 1 shows £ = a, and 
then we conclude that av = va for all v € B so œ € Z(B) = F. 

Next, the map BX x B* — GSO(nrd)(F) is surjective. If f € GSO(nrd)(F) then 
nrd(f(x)) = unrd(x) for all x € B, so in particular u € nrd(B*). Every such similitude 
factor occurs, since the similitude factor of (a, 1) is nrd(q@). So it suffices to show that 
the map 

{(a, B) € B* x B* : nrd(q@) = nrd(B)} > SO(nrd)(F) 


is surjective. We again appeal to the Cartan—Dieudonné theorem; by the same compu- 


tation as in (4.5.12), we calculate that a reflection in x € B% is of the form 
tv) = -x7 |. 


The product of two reflections for x, y € B* is thus of the form 


vie yaar Dy! = xO = avg! 
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where a = yx7! and 8 = g, and in particular the action is by special similarities. We 
conclude that (4.5.18) and the second sequence are both exact. 

The final statement again follows by writing BX = B! F*, and seeing the kernel as 
F* A B! = {+1}. m 


Example 4.5.19. When B = M>(F), then nrd(B*) = det(GL2(F)) = F*, giving the 
exact sequence 


1 > GL (F) > GL2(F) x GL2(F) —> GSO(det)(F) > 1. 


Exercises 


Let F be a field with char F # 2. 


> 1. Give an algorithmic proof that every finite-dimensional quadratic space has a 
normalized basis (Lemma 4.3.1). 


2. Let F = R and let 


y= [ann : an € R for all n > 0 and > a? converges}. 
n=0 


Show that V is an R-vector space, and the map Q: V > R by Q((an)n) = D0 a? 
is a quadratic form, and so V is an example of an infinite-dimensional quadratic 
space. [This example generalizes to the context of Hilbert spaces.] 

3. Let B be a quaternion algebra over F. Let N : B > F and A: B —> F be defined 
by N(a) = trd(@?) and A(a) = trd(a)? — 4 nrd(a). Show that N, A are quadratic 
forms on B, describe their associated bilinear forms, and compute a normalized 
form (and basis) for each. 

4. Generalize Exercise 2.15 as follows. Let Q: V — F be a quadratic form with 
dimp V = n and let y € O(Q). 


(a) If n is odd and det y = 1, then y has a nonzero fixed vector (and therefore 
restricts to the identity on a one-dimensional subspace of V). 
(b) If n is even and det y = —1, then y has both eigenvalues —1 and 1. 


5. Generalizing part of Exercise 4.3, let B be an F-algebra with a standard involu- 
tion. Show that the discriminant form 


A: B >F 
Ala) = trd(a)* — 4nrd(a) 
is a quadratic form. 


6. Let Q: V > F be a quadratic form with dimp V < œo and associated bilinear 
form T. The map 


V — Homp(V, F) 
x (y > T(x, y)) 
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is F-linear. Show that Q is nondegenerate if and only if this map is an isomor- 
phism. 

7. Write out the action (4.5.9) explicitly, as follows. Let B = (a,b | F) and let 
a@=t+xityjt Zij. 


(a) Show that the matrix of the action v © ava! in the F-basis B= i,j,k} 
for B° is [a] where nrd(a)[a] is equal to 


t? — ax? + by? — abz* —2a(tz+ xy) —2(ty + axz) 
2b(tz — xy) 1? + ax? — by? — abz* 2(tx — byz) 
2b(axz — ty) 2a(tx + byz) t? + ax? + by? + abz? 


and nrd(@) = £ — ax? — by? + abz’. 

(b) Let Q = nrd|go and let T be the associated bilinear form. Show that 
the Gram matrix [T] in the basis 6 is the diagonal matrix with entries 
—2a, —2b, 2ab. Then confirm by direct calculation that 


[a] € SO(Q)(F) = {A € SL3(F) : A[T]A* = [T]}. 


8. In this exercise, we prove the chain lemma. Let B := (a, b | F) be a quaternion 
algebra. 


(a) Show that if i’ is orthogonal to 1, j, then (i’)* = a’ € F* and ï’, j are 
standard generators for B, so B = (a’,b | F). 


(b) Let B’ := (a’,b’ | F), and suppose that B is isomorphic to B’. Show that 
there exists c € F* such that 


B= (=) ` (<$) 2 Ga z (22) 
F F F F 
[Hint: let ġ: B’ — B be the isomorphism, and take an element orthogonal 
to 1, j, ġQ’).] 
9. In this exercise, we develop some of the notions mentioned in Remark 3.3.8 in 
the context of quadratic forms. 


Let B be a finite-dimensional F-algebra (not necessarily a quaternion algebra), 
and let Tr: B — F be the left algebra trace (the trace of the endomorphism 
given by left multiplication). 


(a) Show that the map B —> F defined by x œ> Tr(x?) is a quadratic form on 
B; this form is called the (left) trace form on B. 

(b) Compute the trace form of A x B and A 8r B in terms of the trace form of 
A and B. 

(c) Show that if K > F is a inseparable field extension of finite degree, then 
the trace form on K (as an F-algebra) is identically zero. On the other hand, 
show that if K/F is a finite separable field extension (with char F + 2) then 
the trace form is nondegenerate. 

(d) Compute the trace form on Qv5) and Q(q@) where œ = 2 cos(27/7), so 
that a? +a*-2a-1=0. 
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> 10. 


11. 


12. 


> 13. 


14. 


Let Q: V > F and Q’: V’ — F be quadratic forms over F with dimp V = 
dimp V’ = n < œ, and let T,T’ be the associated bilinear forms. Suppose 
that there is a similarity Q ~ Q’ with similitude factor u € F*. Show that 
det T’ = u” detT € F/F*. 


Let Q: V — F be a nondegenerate quadratic form with dimp V =n < œ. 


(a) A subspace W C V is totally isotropic if Qļw = 0 is identically zero. 
The Witt index v(Q) of Q is the maximal dimension of a totally isotropic 
subspace. Show that if v(Q) = m then 2m < n. 

(b) A Pfister form is a form in 2” variables defined inductively by Ka = 
(1, —a} and 


«a, »++,Am-1, am) = «a, e.’ am-1) = Ama, oe) Am-1)- 


„b 
Show that the reduced norm nrd on = is the Pfister form ((a, b)). 


(c) The hyperbolic plane is the quadratic form H: F 2 > F with H(x, y) = xy. 
A quadratic form Q is a hyperbolic plane if Q ~ H. A quadratic form Q 
is totally hyperbolic if Q ~ H H where H is a hyperbolic plane. 
Show that if Q is an isotropic Pfister Taim, then Q is totally hyperbolic. 

(d) Suppose that Q is an isotropic Pfister form with n > 4. Let W c V be 
a subspace of dimension n — 1. Show that Q|w is isotropic. [This gives 
another proof of Main Theorem 5.4.4 (iii) > (iv).] 


(a) Let B be a quaternion algebra over F. Show that the reduced norm is 
the unique nonzero quadratic form Q on B that is multiplicative, i.e., 
O(aB) = O(a)Q(B) for all a, B € B. 

(b) Show that (a) does not necessarily hold more generally, for B an algebra 
with a standard involution. /Hint: consider upper triangular matrices. ] 


In this exercise, we pursue some geometric notions for readers some background 


in algebraic geometry (at the level of Hartshorne [Har77, Chapter 1]). 
Let Q be nonzero quadratic form on V with dimp V = n. The vanishing locus 


of Q(x) = 0 defines a projective variety X C P(V) ~ P” of degree 2 called a 
quadric. Show that the quadratic form Q is nondegenerate if and only if the 
projective variety X is nonsingular. [For this reason, a nondegenerate quadratic 
form is also synonymously called nonsingular. ] 


In this exercise, we work out from scratch Example 4.5.13: we translate the 
results on rotations in section 2.4 to B = M2(R), but with respect to a different 
measure of ‘length’. 


Let 


M: (R)? = {v € M>(R) : tr(v) = 0} = (i a x,y, ZE RÌ . 


For v € M2(R)?, we have det(v) = —x? — yz. Show that the group 
M2(R)! = SL2(R) = {a € Mo(R): det(æ) = 1} 
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> 15. 


> 16. 


> 17. 
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acts linearly on M2(R)° by conjugation (the adjoint representation) preserving 
the determinant, giving rise to an exact sequence 


1 > {+1} > SL)(R) > SO(det) - 1. 


Let Q: V — F be a quadratic form with V finite-dimensional over F. Show 
that SO(Q) < O(Q) is a (normal) subgroup of index 2. What can you say about 
GSO(Q) < GO(Q)? 


In this exercise, we prove Theorem 4.2.22. Let Q: V — F be a quadratic form 
with dimp V < œ and let T be its associated bilinear form. 


(a) Let v € V be anisotropic. Define the reflection along v by 


w:V >V 
nee e 2, 


Observe that T, is F-linear, and then show that t, € O(V) with det m, = —1. 
[Hint: extend v to a basis of the orthogonal complement of V.] Why is Ty 
called a reflection? 


(b) If x,y € V are anisotropic with Q(x) = Q(y), show that there exists 
f € O(V) such that f(x) = y. [Hint: reflect along either v = x + y or 
v = x — y as at least one is anisotropic, in the former case postcomposing 
with reflection along x.] 


(c) Let Q’: V’ — F be another quadratic form, and let f: V — V’ be an 
isometry. For W C V, show that f(W+) = f(W)+. 


(d) Prove Theorem 4.2.22(a). [Hint: reduce to the case where dimp W, = 
dimp Wi = 1; apply parts (b) and (c).] 


(e) Prove Theorem 4.2.22(b). [Hint: compare the isometry V =~ V’ with the 
isometry g.] 


Prove the following weakened version of the Cartan—Dieudonné theorem (The- 
orem 4.5.7): Let (V, Q) be a nondegenerate quadratic space with dimp V = n. 
Show that every isometry f € O(Q)(F) is a product of at most 2n — 1 reflections. 
[Hint: in the proof of Exercise 4.16(b), note that f can be taken to be a product 
of at most 2 reflections, and finish by induction. ] 
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Chapter 5 


Ternary quadratic forms and 
quaternion algebras 


Continuing our treatment of quadratic forms, in this chapter we connect quaternion 
algebras to ternary quadratic forms. 


5.1 > Reduced norm as quadratic form 


Let F be a field with char F + 2 and let B = (a,b | F) be a quaternion algebra over 
F. We saw in the previous chapter (4.1.1) that the reduced norm defines a quadratic 
form. But always have scalar norms nrd(t) = t for t € F, so the form carries the same 
information when restricted to the space of pure quaternions 


B® := {a € B : trd(a) = 0} 
with basis i, j, ij. This quadratic form restricted to B° is 
nrd(xi + yj + zij) = —ax* — by? + abz* 


with discriminant (—a)(—b)(ab) = (aby’, so the trivial class in F*/F*?. 

We might now try to classify quaternion algebras over F up to isomorphism in 
terms of this quadratic form. Recall as in the previous chapters that for morphisms 
between quadratic forms, one allows either isometries, an invertible change of basis 
preserving the quadratic form, or similarities, which allow a rescaling of the quadratic 
form by a nonzero element of F. Our main result is as follows (Corollary 5.2.6). 


Theorem 5.1.1. The map B |> nrd|go induces a bijection: 


with discriminant 1 € F*/F*? 
up to isometry 


Nondegenerate ternary 


a algebras over F 


Ternary quadratic forms over F 
up to isomorphism | | | 


quadratic forms over F 
up to similarity 
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The map B +> nrd|go in Theorem 5.1.1 has inverse defined by the even Clifford 
algebra (see section 5.3). The similarity class of a nondegenerate ternary quadratic 
form cuts out a well-defined plane conic C C P? over F, so one also has a bijection 
between isomorphism classes of quaternion algebras over F and isomorphism classes 
of conics over F. Finally, keeping track of an orientation allows one to fully upgrade 
this bijection to an equivalence of categories (Theorem 5.6.8). 

The classification of quaternion algebras over F is now rephrased in terms of 
quadratic forms, and a more detailed description depends on the field F. In this vein, 
the most basic question we can ask about a quaternion algebra B is if it is isomorphic 
to the matrix ring B ~ M2(F): if so, we say that B is split over F. For example, every 
quaternion algebra over C (or an algebraically closed field) is split, and a quaternion 
algebra (a, b | R) is split if and only if a > 0 or b > 0. 

Ultimately, we will identify six equivalent ways (Main Theorem 5.4.4) to check if 
a quaternion algebra B is split; in light of Theorem 5.1.1, we isolate the following. 


Proposition 5.1.2. B is split if and only if the quadratic form nrd|go represents O 
nontrivially. 


In later chapters, we will return to this classification problem, gradually increasing 
the “arithmetic complexity” of the field F. 


5.2 Isomorphism classes of quaternion algebras 


In Section 2.4, we found that the unit Hamiltonians act by conjugation on the pure 
quaternions H? =~ R? as rotations, preserving the standard inner product. In this 
section, we return to this theme for a general quaternion algebra, and we characterize 
isomorphism classes of quaternion algebras in terms of isometry classes of ternary 
quadratic forms. 

Throughout this chapter, let F be a field with char F + 2, and let B = (a,b | F) be 
a quaternion algebra over F. 


Definition 5.2.1. a € B is scalar if œ € F and pure if trd(@) = 0. 


5.2.2. Recalling (3.3.5), we have the F-vector space of pure (trace 0) elements of B 
given by B° = {1}+. The standard involution restricted to B? is given by @ = —a@ for 
a € B®, so equivalently B® is the —1-eigenspace for ~. We have B? = Fi @ Fj ® Fij 
and in this basis 

nrd |go = (—a, —b, ab) (5.2.3) 


so that disc(nrd|go) = (ab)? = 1 € F*/F*? (cf. Example 4.3.8). 
Proposition 5.2.4. Let B, B’ be quaternion algebras over F. Then the following are 
equivalent: 
(i) B = B’ are isomorphic as F-algebras; 
(ii) B = (B^)? are isomorphic as F-algebras; 
(iii) B ~ B’ are isometric as quadratic spaces; and 
(iv) B? = (B’)° are isometric as quadratic spaces. 
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If f: B? © (B’) is an isometry, then f extends uniquely to either an isomorphism 
f: B > B’ or an isomorphism f : B œ (B’)°? of F-algebras. 


Proof. We follow Lam [Lam2005, Theorem HI.2.5]. The equivalence (i) = (ii) follows 
from postcomposing with the standard involution : B’ © (B’)°?. 

The implication (i) > (iii) follows from the fact that the standard involution on an 
algebra is unique and the reduced norm is determined by this standard involution, so 
the reduced norm on B is identified with the reduced norm on B’. 

The implication (iii) > (iv) follows from Witt cancellation (Theorem 4.2.22); and 
(iv) > (iii) is immediate, since B = (1) @B° and B’ = (1) 8(B’)° so the isometry 
extends by mapping 1 +> 1. (Or use Witt extension, Theorem 4.2.22(b).) 

So finally we prove (iv) > (i). Let f: B® —> (B^? be an isometry of quadratic 
spaces. Suppose B = (a,b | F). Since f is an isometry, nrd(f(i)) = nrd@) = —a and 


nrd(f(i)) = FOFO = -fO 


so f(i)* = a. Similarly f(j)* = b. Finally, ji = —ij since i, j are orthogonal (as in 
the proof of Main Theorem 4.4.1), but then f(i), f(j) are orthogonal as well and so 
FOFO = -—FOFY. 

Similarly, we know that ij is orthogonal to i, j, thus f(ij) is orthogonal to both 
f@ and f(j) and so f(ij) = uf@f() for some u € F*; taking reduced norms gives 
nrd(ij) = u? nrd(i) nrd(j) so u? = 1 thus u = +1. If u = 1, then fad = FOSG) 
and f extends via f(1) = 1 to an F-algebra isomorphism B æ B’. Otherwise, 
u = —l and f(ij) = -—f@fYU) = f(D, in which case f extends to an F-algebra 
anti-isomorphism, or equivalently an F-algebra isomorphism B æ (B’)°?; but then 
postcomposing with the standard involution we obtain an F-algebra isomorphism 
BP. Oo 


Main Theorem 5.2.5. Let F be a field with char F + 2. Then the functor B +> nrd |go 
yields an equivalence of categories between 


Quaternion algebras over F, 
under F-algebra isomorphisms and anti-isomorphisms 


and 
Ternary quadratic forms over F with discriminant 1 € F*/F*?, 


under isometries. 


Proof. The association B +> nrd |go gives a functor from quaternion algebras to non- 
degenerate ternary quadratic forms with discriminant 1, by 5.2.2; the map sends iso- 
morphisms and anti-isomorphisms to isometries and vice versa by Proposition 5.2.4. 
Therefore the functor is fully faithful. To conclude, we show that the functor is essen- 
tially surjective. Let V be a nondegenerate ternary quadratic space with discriminant 
1 € F*/F*?. Choose a normalized basis for V, so that Q ~ (—a, —b, cY with a, b,c € F*. 
By hypothesis, we have disc(Q) = abc € F*?, so applying the isometry rescaling the 
third basis vector we may suppose c = ab. We then associate to V the isomorphism 
class of the quaternion algebra (a, b | F). The result follows. o 
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Corollary 5.2.6. The map B > nrd|go yields a bijection 


with discriminant 1 € F*/F*? 
up to isometry 
| Nondegenerate ternary | 
© 


u algebras over F 


Ternary quadratic forms over F 
up to isomorphism | | | 


quadratic forms over F 
up to similarity 


that is functorial with respect to F. 


By the expression functorial with respect to F, we mean that this bijection respects 
(is compatible with) field extensions: explicitly, if F —> K is an inclusion of fields, 
and B is a quaternion algebra with associated ternary quadratic form Q: B® > F, 
then the quaternion algebra Bg = B @ pf K has associated ternary quadratic form 
Ox: B} = Ber K >K. 


Proof of Corollary 5.2.6. Functoriality boils down to the fact that 
(Bx) = (B 8r K)’ = B° 8r K 


for F —> K an inclusion of fields. The first bijection is an immediate consequence of 
Main Theorem 5.2.5. We do not need anti-isomorphisms once we restrict to classes, 
since if there is an anti-isomorphism B — B’ then composing with the standard 
involution gives a straight up isomorphism. 

Next, we examine the natural map from isometry classes to similarity classes and 
show it is surjective. Every nondegenerate ternary quadratic form (or any quadratic 
form in odd dimension) is similar to a unique isometry class of quadratic forms with 
trivial discriminant: if Q = (a, b,c) with a, b,c € F*, then disc((a, b, c)) = abc and 


Q = (a,b,c) ~ abcla, b,c) = (a*be, ab’c, abc’) = (bc, ac, ab) 


and disc((bc, ac, aby) = (abc)? = 1 € F*/F*?. Therefore the map is surjective. 

To conclude, we show this map is injective. Suppose that Q, Q’ are forms of 
discriminant 1, so detT,detT’ € F*?. Suppose there is a similarity Q ~ Q’, so 
QO’ (f(x)) = uQ(x) for some f: V > V’ and u € F*; we show in fact that Q = Q’ are 
isometric. By Exercise 4.10, we have det T” = u? det T, and u = c? € F*?. Therefore 


O'(c! f(x) = € 7 O' (F(x) = uw O'(f(x)) = O(a) 


and c7! f: V -© V’ is the sought after isometry. o 


Remark 5.2.7. We will refine Main Theorem 5.2.5 in section 5.6 by restricting the 
isometries to those that preserve orientation. 
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5.3 Clifford algebras 


In this section, we define a functorial inverse to B + nrd|go = Q in Main Theorem 
5.2.5: this is the even Clifford algebra of Q. The Clifford algebra is useful in many 
contexts, so we define it more generally. Loosely speaking, the Clifford algebra of a 
quadratic form Q is the algebra generated by V subject to the condition x? = Q(x) 
for all x € V, so the multiplication on the Clifford algebra is induced by the quadratic 
form. 

Let Q: V — F be a quadratic form with dimp V = n < ov; in this section, we 
pause our assumption and allow F of arbitrary characteristic. 


Proposition 5.3.1. There exists an F-algebra Clif Q with the following properties: 


(i) There is an F-linear map t: V — Clf Q such that ux) = Q(x) forall x € V; 
and 

(ii) Clf Q has the following universal property: if A is an F-algebra andi, : V —> A 
is a map such that ta(xy = Q(x) for all x € V, then there exists a unique 
F-algebra homomorphism ¢ : Clf Q — A such that the diagram 


v—+cira 
| 
K4 


Y 
A 


LA 


commutes. 
The pair (Clf Q, t) is unique up to unique isomorphism. 


The algebra Clf Q in Proposition 5.3.1 is called the Clifford algebra of Q. 


Proof. Let 
Ten V := B yed (5.3.2) 
d=0 
where 
yel :=V@---@V and V™:=F 
— amm 
d 

so that 


TenV=FOVE(VOV)E.... 


Then Ten V has a multiplication given by tensor product: for x € V®? and y € V®° we 
define 
x-y=x@ye Vode) 


(concatenate, and possibly distribute, tensors). In this manner, Ten V has the structure 
of an F-algebra, and we call Ten V the tensor algebra of V. 
Let 
I(Q) = (x8 x- Q(x): x € V} c Ten V (5.3.3) 
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be the two-sided ideal generated the elements x ® x — Q(x) for all x € V. Let 
Clf Q = Ten V/I(Q). (5.3.4) 


The algebra Clf Q by construction satisfies (i). And if 14 : V —> A is as in (ii), then 
the map (x) > t4(x) for x € V extends to a unique F’-algebra map Ten V — A; since 
further ¢4(x)* = Q(x) for all x € V, this algebra map factors through ¢: ClfQ — A. 
By abstract nonsense (taking A = ClfQ), we see that any other algebra having the 
same property as Clf(Q) is uniquely isomorphic to it, i.e., Clf Q is unique up to unique 
isomorphism. o 


Example 5.3.5. If Q: F — F is the quadratic form Q(x) = ax? with a € F, then 
Clf(F) ~ F[x]/(x? — a) (Exercise 5.6). 


Example 5.3.6. In the extreme case where Q = 0 identically, Clf Q ~ ®%}-o AV is 
canonically identified with the exterior algebra on V. 


5.3.7. Let x, y € V. Then in Clf Q, we have 


(x+y)@(xt+y)-x®x-y@®y=Q(x+ y)-Qx)- Qy) 


(5.3.8) 
x@y+y@x=T(x,y). 
In particular, x, y are orthogonal if and only if x ® y = -y 8 x. 
5.3.9. Lete], . . ., €n be an F-basis for V. Then finite tensors on these elements are an F- 


basis for Ten V. In Clf Q, by 5.3.7 we have e;®e; = O(e;) and e;®e; = T(e;, €;)—e;®e;, 
so an F-spanning set for Clf Q is given by e; ®---@e;,, with 1 <i) < i2 <+: <ia <n 
(including 1 arising from the empty tensor product), and so 


dim; CIQ) < X; 4 = 2", (5.3.10) 
d 
d=0 
It is customary to abbreviate e;, -+ @ eig = ei +++ Cig: 


Example 5.3.11. If Q = (aj, ...,a,) is diagonal in the basis e;, then 


s = (Ea; .. 


2 . za 2 2 
(eitt eia) = sent ... tae; (eitt Cig)” => Aig. 


Example 5.3.12. Suppose char F + 2 and let Q: F? — F be the quadratic form 
Q(x) = (a, b). Then by a direct calculation using 5.3.9, we find 
Clf QO = F @ Fe; @ Fez © Fejez (5.3.13) 


2 
1 


2 


=aand e5 


with multiplication e = band e2e; = —e1 e7, i.e., with i := e; and j := e2 


b 
we have identified CIf Q = (=) when a,b + 0. 
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Example 5.3.12 generalizes as follows. 


Lemma 5.3.14. The map 1: V > CIf Q is injective, and dimp Clf(Q) = 2”. 


Proof. By Lemma 4.3.1, we may choose a basis e,...,@, for V in which Q = 
{a1,**+,@y) is diagonal. Let A be the F-vector space with basis the symbols Zi; ++- Ziq 
for 1 < ij) < i2 < +++ < ig < n. In the same way as what was considered for the 
relations (2.2.2), we verify directly that there is a unique, associative multiplication 
on A such that z? = aj and 7;z; = —z;Z;. (Alternatively, this can be viewed as a 
graded tensor product; see Exercise 5.21.) The map 14: V > A by ei |} zi has 
A(x? = tad; xie = aix? ++ anx? = Q(x), so by the universal property of 


Clf(Q), there exists a unique F-algebra homomorphism ¢: CIf(Q) — A such that 


gu = ta. Since the elements z;, --- Zig are F-linearly independent in A, so too are their 
preimages e; ---e;, in ClfQ, so the spanning set given in 5.3.9 is in fact a basis and 
¢ is an isomorphism. oO 


As it will cause no confusion, we may identify V with its image (V) — Clf Q. 


5.3.15. The reversal map, given by 
rev: ClfQ > CIf Q 


(5.3.16) 
XI: OX HX, OW: Oxy 


on pure tensors (and extended F-linearly) is well-defined, as it maps the ideal /(Q) to 
itself, and so it defines an involution on Clf Q that we call the reversal involution. 


Lemma 5.3.17. The association Q +» Clf Q induces a faithful functor from the cate- 
gory of 

quadratic forms over F, under isometries 
to the category of 


finite-dimensional F-algebras with involution, under isomorphisms. 


Proof. Let Q’: V’ — F be another quadratic form and let f: V — V’ be an isometry. 
Then f induces an F-algebra map Ten V > Ten(V’) and 


f(x ®x — O(x)) = f(x) @ f(x) - OG) = f(x) FA - OF) 


so f also induces an F-algebra map Clf Q — Clf(Q’). Repeating with the inverse 
map, and applying the universal property, we see that these maps are inverse, so define 
isomorphisms. The functor is faithful because V c Clf Q, so if f: V — V acts as the 
identity on Clf Q then it acts as the identity on V, so f itself is the identity. (This can 
be rephrased in terms of the universal property: see Exercise 5.13.) o 
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5.3.18. The tensor algebra Ten V has a natural Z>ọ grading by degree, and by con- 
struction (5.3.4), the quotient Clf Q = Ten V/J(Q) retains a Z/2Z-grading 


CIf Q = cif’ Q ecif! Q 


where CIf? Q c CIfQ is the F-subalgebra of terms of even degree and Cif! Q the 
Clf° Q-bimodule of terms with odd degree. The reversal involution 5.3.15 preserves 
Clf° Q and so descends to an involution on Clf° Q. 

We call CIf? Q the even Clifford algebra and Cif! Q the odd Clifford bimodule 
of Q. The former admits the following direct construction: let 


Ten? V := B yo 
d=0 


and let 19(Q) := 1(Q) N Ten? V; then Clf? Q ~ Ten? V/1°(Q). 


5.3.19. Referring to 5.3.9, the elements e1e2,...,€n-1€n generate cif? Q as an F- 
algebra, and Clf° Q has basis €i,°** i, Where d is even (including the empty product 
1), so dimp Clf°(Q) = 2”"!. 


Example 5.3.20. Continuing Example 5.3.12, we see that the reversal involution fixes 
i, j and acts as the standard involution on Clf° Q. So the algebra Clf Q is not just a 
quaternion algebra, but one retaining a Z/2Z-grading. 


Lemma 5.3.21. The association Q ++ Clf° Q defines a functor from the category of 
quadratic forms over F, under similarities 
to the category of 
finite-dimensional F-algebras with involution, under isomorphisms. 


Proof. Let Q’: V’ — F be another quadratic form and let (f, u) be a similarity, with 
f:V-V’ and u € F*, so that uQ(x) = Q’(f(x)) for all x € V. We modify the proof 
in Lemma 5.3.17: we define a map 


Ten? V > Ten°(V’) 
x18 8 xa H UDIP f(x1) @ ++ @ faa). 
Then under this map, we have 
x@x-O(x) bu !(f(x)®@ f(x) - OX) = uw '(F(x) @ FE) - (FD) 


so 19(Q) maps to 1°(Q’), and the induced map cif? Q > cfo’) is an F-algebra 
isomorphism. o 


5.3.22. Note that unlike the Clifford functor, the even Clifford functor need not be 
faithful: for example, the map —1: F? = F? has eje |œ (—e1)(—e2) = e1e2 so acts by 
the identity on CIf? Q. 


We now come to the important immediate application. 
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5.3.23. Suppose that char F + 2 and let Q(x) = (a, b,c) be a nondegenerate ternary 
quadratic form. Then the even Clifford algebra Clf° Q is given by 


Cf Q = F ® Fi Fj ® Fij 
where i = eje2, j = e263, subject to the multiplication 


P=-ab, j?=-bc, ij +ji=0. 


cfo = (== =), 


Letting k = e3e1, we obtain symmetrically with the other two pairs of generators j, k 
= —bc, —ac —ac, —ab 
ate (2) (4) 
The reversal involution is the standard involution on CIf? Q. Letting B = cif’ Q, 
nrd|go = (ab, bc, ac) = (abc?,a?bc, abc) = abcla, b, c). 
So if disc Q(x) = abc € F”?, then nrd |go is isometric to Q. In a similar way, if 


d, 
B= 


, then in Main Theorem 5.2.5 we associate Q = nrdgo = (—a, —b, ab), and 


A —ab, ab? E a,b 
CIf Q ~ [e ) = F } (5.3.24) 


This gives another tidy proof of the bijection in Corollary 5.2.6. 
Remark 5.3.25. The even Clifford map does not furnish an equivalence of categories 


for the same reason as in 5.3.22; one way to deal with this issue is to restrict the 
isometries to those that preserve orientation: we carry this out in section 5.6. 


5.4 Splitting 


The moral of Main Theorem 5.2.5 is that the problem of classifying quaternion algebras 
depends on the theory of ternary quadratic forms over that field (and vice versa). We 
now pursue the first consequence of this moral, and we characterize the matrix ring 
among quaternion algebras. Suppose that char F + 2, but still Q: V —> F a quadratic 
form with dimp V < œ. 


Definition 5.4.1. The hyperbolic plane is the quadratic form H: F? — F defined by 
H(x, y) = xy. A quadratic form is a hyperbolic plane if it is isometric to H. 


A hyperbolic plane H is universal, its associated bilinear form has Gram matrix 


í a in the standard basis, and H has normalized form H ~ (1, -1). 
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Lemma 5.4.2. Suppose Q is nondegenerate. Then Q is isotropic if and only if there 
exists an isometry Q ~ H@Q’ with Q’ nondegenerate and H a hyperbolic plane. 


Proof. For the implication (4), we have an isotropic vector from either one of the two 
basis vectors. For the implication (=), let x € V be isotropic, so x + O and satisfy 
Q(x) = 0. Since Q is nondegenerate, there exists y € V such that T(x, y) # 0; rescaling 
y, we may assume T(x, y) = 1. If T(y, y) = 2Q(y, y) + 0, replacing y — y—2x/T(y, y) 
gives T(y, y) = 0. Thus Q restricted to Fx + Fy is isometric to H, and in particular 
is nondegenerate. Therefore letting V’ := (Fx + Fy)+ and Q’ := Qly, we have 
V = (Fx+ Fy)®V’'andQ = HEQ. o 


Lemma 5.4.3. Suppose Q is nondegenerate and let a € F*. Then the following are 
equivalent: 


(i) Q represents a; 


(ii) Q = (a) HQ’ for some nondegenerate form Q'; and 
(iii) (—a) BQ is isotropic. 


Proof. For (i) > (ii), we take Q’ = Qlw and W = {v}" c V where Q(v) = a. For 
(ii) > (iii), we note that (—a) BQ = (a, —a) Œ Q’ is isotropic. For (iii) > (i), suppose 
((-a) 8 O)(v) = 0, so O(v) = ax? for some x € F. If x = 0, then Q is isotropic and 
by Lemma 5.4.2 represents a; if x + 0, then by homogeneity Q(v/x) = a and again Q 
represents a. o 


We now come to a main result. 


b 
Main Theorem 5.4.4. Let B = (27) be a quaternion algebra over F (with char F + 2). 


Then the following are equivalent: 


1,1 
G) B = | z ) = M2(F); 


(ii) B is not a division ring; 
(iii) The quadratic form nrd = (1, —a, —b, ab) is isotropic; 


(iv) The quadratic form nrd |go = (—a, —b, ab) is isotropic; 
(v) The binary form (a, b) represents 1; 

(vi) b e Nmgır(K*) where K = F[i]; and 

(vi’) b € Nmx\p(K*) where K = F(ya). 


Condition (vi) holds if and only if there exist x, y € F such that x? — ay? = b; if K 
is nota field then K =~ F x F and Nmx\r(K*) = F™. In condition (vi’), we take F(va) 
to be a splitting field for x? — a over F, so equal to F if a € F*. (Depending on the 
circumstances, one of these formulations may be more natural than the other.) 


Proof. We follow Lam [Lam2005, Theorem 2.7]. The isomorphism (1, 1 | F) ~ Mo(F) 
in (i) follows from Example 2.2.4. The implication (i) => (ii) is clear. The equivalence 
(ii) © (iii) follows from the fact that a € B* if and only if nrd(@) € F™ (Exercise 3.5). 
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We now prove (iii) > (iv). LetO + æ € B be such that nrd(q@) = 0. Iftrd(@) = 0, then 
we are done. Otherwise, trd(a) + 0. Let 6 be orthogonal to 1, œ, so that trd(@B) = 0. 
We cannot have both af = 0 and af = (trd(@) — a) 8 = 0, so we may suppose af + 0. 
But then nrd(@f) = nrd(a) nrd(f) = 0 as desired. 

To complete the equivalence of the first four we prove (iv) > (i). Let B € B? 
satisfy nrd(B) = 0. Since nrd|go is nondegenerate, there exists 0 + œ € B® such 
that trd(@B) + 0. Therefore, the restriction of nrd to Fa @ FB is nondegenerate 
and isotropic. By Lemma 5.4.2, we conclude there exists a basis for B° such that 
nrd|go = (1,-1) Bc) = (1,-1,c); but disc(nrd|p0) = —c € F”? by 5.2.2; rescaling, 
we may suppose c = —1. But then by Proposition 5.2.4 we have B ~ (1, 1 | F). 

Now we show (iv) > (v). For a € B®, 


nrd(@) = nrd(xi + yj + zij) = —ax? — by” + abz? 


as in 5.2.2. Suppose nrd(q@) = 0. If z = 0, then the binary form (a, b} is isotropic so is 
a hyperbolic plane by Lemma 5.4.2 and thus represents 1. If z # 0 then 


y 2 p 2 
+] ele) =1. 
alee) elz) 


Next we prove (v) > (vi). If a € F”? then K ~ F x F and Nmxjr(K*) = F* 3 b. 
If a ¢ F*?, then given ax? + by? = 1 we must have y # 0 so 


(eG) aree (5) 


In the equivalence (vi) © (vi’), the two statements are identical if a ¢ F x2 and 
both automatically satisfied if a € F*?. 

To conclude, we prove (vi) > (iii). If b = x -ay € Nmgjr(K*), then a = 
x+ yi + j + 0 has nrd(a) = x? — ay? — b = 0. o 


We give a name to the equivalent conditions in Main Theorem 5.4.4. 


Definition 5.4.5. A quaternion algebra B over F is split if B ~ M2(F). A field K 
containing F is a splitting field for B if B 8r K is split. 


Example 5.4.6. The fundamental example of a splitting field for a quaternion algebra 
is that C splits the real Hamiltonians H: we have H @p C ~ M32(C) as in (2.4.1). 


Lemma 5.4.7. Let K > F be a quadratic extension of fields. Then K is a splitting field 
for B if and only if there is an injective F-algebra homomorphism K — B. 


Proof. First, suppose 1: K <> B. We may suppose that K = F(Vd) with d € F*. Let 
u = (Vd), so x? = d. Then 1 @ Vd — u @ 1 is a zerodivisor in B @p K: 


(1@ Vd -@1)1@ Vd +@1)=1@d-de@l1=0. 
By Main Theorem 5.4.4, we conclude that B @r K ~ M2(K). 
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Next we prove the converse. If B ~ Mo2(F) already, then any quadratic field K 
embeds in B (take a matrix in rational normal form) and B &r K ~ M2(K) for any K. 
So by Main Theorem 5.4.4, we may suppose B is a division ring. Let K = F(Vd). We 
have B @r K ~ M2(K) if and only if (—a, —b, ab) is isotropic over K, which is to say 
there exist x, y, z, u, v, w € F such that 


-a(x + uVd)? — b(y + vyd}? + ab(z + wVdy* = 0. (5.4.8) 


Let œ = xi + yj + zij and B = ui + vj + wij. Then trd(q@) = trd(£) = 0. Expansion 
of (5.4.8) (Exercise 5.14) shows that œ is orthogonal to 8, so trd(« £) = 0, and that 
nrd(a) + dnrd(8) = 0. Since B is a division ring, if nrd(@) = c = O then £ = 0 
so nrd(@) = 0 as well and «œ = 0, a contradiction. So nrd(8) + 0, and the element 
y= ap! = clap € B has nrd(y) = -d and trd(y) = c7! trd(a@B) = 0 so y =das 
desired. im 


b 1,b 
Example 5.4.9. If B = (33), then either a € F*? and B = (=) = M (F) is split, 


or a ¢ F”? and K = F(Ņa) splits B. 


Example 5.4.10. Let p be an odd prime and let a be a quadratic nonresidue modulo p. 


We claim that =) is a division quaternion algebra over Q. By Main Theorem 5.4.4, 


it suffices to show that the quadratic form (1, —a, —p, ap) is anisotropic. So suppose that 
t? —ax* = p(y? —az?) with t, x, y, z € Q not all zero. The equation is homogeneous, so 
we can multiply through by a common denominator and suppose that t, x, y, z € Z with 
gcd(t, x, y, z) = 1. Reducing modulo p we find £? = ax? (mod p); since a is a quadratic 
nonresidue, we must have t = x = 0 (mod p). Plugging back in and cancelling a factor 
of p we find y? = az? = 0 (mod p), and again y = z = 0 (mod p), a contradiction. 


5.5 Conics, embeddings 


Following Main Theorem 5.2.5, we are led to consider the zero locus of the quadratic 
form nrd |,o up to scaling; this gives a geometric way to view the preceding results. 


Definition 5.5.1. A conic C c P? over F is a nonsingular projective plane curve of 
degree 2. An isomorphism of conics C,C’ over F is an element f € PGL3(F) = 
Aut(P?)(F) that induces an isomorphism of curves f: C > C’. 


If we identify 
P(B?) := (B°\{0})/F* = PYF) 


with (the points of) the projective plane over F, then the vanishing locus C = V(Q) of 
Q = nrd|go defines a conic over F: if we take the basis i, j, ij for B®, then the conic C 
is defined by the vanishing of the equation 


(x, y, zZ) = nrd(xi + yj + zij) = ax = by? +abz* =0. 
y yJ J 


Here, nondegeneracy of the quadratic form is equivalent to the nonsingularity of the 
associated plane curve (Exercise 4.13). 
The following corollary is then simply a rephrasing of Main Theorem 5.2.5. 
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Corollary 5.5.2. The map B +> C = V(nrd |go) yields a bijection 


Quaternion algebras over F = Conics over F 
up to isomorphism up to isomorphism 


that is functorial with respect to F. 


Main Theorem 5.4.4 also extends to this context. 


Theorem 5.5.3. The following are equivalent: 


© B = Ma(F); 


(vii) The conic C associated to B has an F-rational point. 


By Lemma 5.4.7, a quadratic field K over F embeds in B if and only if the ternary 
quadratic form nrd |go represents 0 over K. We can also rephrase this in terms of the 
values represented by nrd | go. 


Lemma 5.5.4. Let K be a quadratic extension of F of discriminant d. Then K > B if 
and only if nrd |go represents —d over F. 


Proof. Write K = F(Vd). Then K > B if and only if there exists œ € B such that 
a? = d if and only if there exists a € B with trd(@) = 0 and nrd(q@) = —d, as claimed. 
o 


Remark 5.5.5. Two conics over F are isomorphic (as plane curves) if and only if their 
function fields are isomorphic (Exercise 5.23). 


5.6 Orientations 


To conclude, we show that the notion of orientation underlying the definition of special 
isometries (as in Example 4.5.5) extends more generally to isometries between two 
different quadratic spaces by keeping track of one bit of extra information, refining 
Main Theorem 5.2.5. We follow Knus—Murkurjev—Rost-Tignol [KMRT98, Theorem 
15.2]. We retain our hypothesis that char F + 2. 

Let Q: V > F be a quadratic space with dimp V = n odd. 


Lemma 5.6.1. Suppose Q has signed discriminant sgndiscQ = d € F*/F*?. Let 
A := CIf Q be the Clifford algebra of Q, and let K = Z(A) be the center of A. Then 
K = F[x]/(x? - d). 


The signed discriminant gives a simpler statement; one could equally well work 
with the usual discriminant and keep track of the sign. 


Proof. We do the case n = 3. We may suppose V = F? with standard basis e1, e2, e3 
and that Q ~ (a, b,c) is diagonal, with sgndisc(Q) = —abc = d. We have the relation 
ee; = —e;e; fori + j; for alli = 1,2,3, conjugation by e; acts by —1 on e; and e;e; 
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for j + i. This implies Z(A) © F + Fe e2e3. Let 6 := e1e2e3 = e2€3e1 = €3e 12; then 
ôe; = e;ô fori = 1,2,3, so Z(A) = F[6]. We compute 


O° = (e1e2€3)(e1e2€3) = e;(€2€3)(e2€3) = —abc = sgndisc(Q) = d. (5.6.2) 
Therefore K ~ F[x]/(x? — d). 

The general case is requested in Exercise 5.19: with a basis e),...,e, for V, the 

center is generated over F by ô = e1 ++- en. oO 


From now on, suppose sgndisc(Q) = d = 1. 
Definition 5.6.3. An orientation of Q is a choice of ¢ € Z(Clf Q)\F with 2? = 1. 


5.6.4. Q has exactly two choices of orientation ¢, differing by sign, by Lemma 5.6.1: 
under an isomorphism Z(Clf Q) = F x F, the two orientations are (—1, 1) and (1, —1). 
More intrinsically, given an orientation ¢, we have a projection K > K/(¢ - 1) = F, 
and conversely given a projection 7: K — F, there is a unique orientation ¢ with 
m(Z) = 1 (the other maps to —1, by F-linearity). 


Definition 5.6.5. Let 2, 2’ be orientations on Q,Q’. An isometry f: V > V’ is 
oriented (with respect to ¢, ¢’) if in the induced map f: Z(CIf Q) => Z(Clf Q’) we 
have f) = ¢. 


5.6.6. An oriented isometry is the same as a special isometry (Definition 4.5.1) when 
V = F” (n still odd), as follows. Let A = Clf Q. Let e1, . . ., en be a basis for V adapted 
as in the proof of Lemma 5.6.1 and ô = e;...e,. Then Z(A) is generated by 6 and 
& = 1. If f € O(Q)(F), then f(6) = (det f), so ¢ = +ô is preserved if and only if 
det(f) = 1, and this is independent of the choice of orientation. 

So we define the oriented or special orthogonal group of a quadratic space by 
choosing an orientation and letting 


SO(Q)(F) := {f € O(Q)(F) : f is oriented}; 


the resulting group is independent of the choice, and we recover the same group as in 
Definition 4.5.1. 


„b 
5.6.7. Let B = (5 ) be a quaternion algebra over F. In previous sections, we took 


nrd|go : B? > F, a nondegenerate ternary quadratic space of discriminant 1. Since 
we are working with the signed discriminant, we take instead —nrd|po : B? > F 
with sgndisc(— nrd|,o) = 1; this map has a nice description as the squaring map, since 
a? = —nrd(a) fora € B®. 

We claim that B° has a canonical orientation. We have an inclusion 1: B? — B 
with (x)? = —nrd(x) for all x € B°. By the universal property of Clifford algebras, 
we get an F-algebra homomorphism ø: Clf(B°) —> B. We see that ¢ is surjective so 
it induces an F-algebra map ~ : Z(Clf(B°)) > Z(B) = F (Exercise 2.8). This defines 
a unique orientation ¢g = ¢ with ¢ — 1 € kerz, by 5.6.4. 

Explicitly, let i,j,k be the standard basis for B with k = ij. Then 
nrd(k) = ab, andi, j, k is abasis for B®. Let f= ijk"! = -ijk/(ab) € Z(Clf(B°)). Then 
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& = —ab(-ab)/(ab} = 1 as in (5.6.2). Multiplying out in B, we get (2) = 1 € B, so 
é is the same orientation as in the previous paragraph. 


The following theorem then refines Main Theorem 5.2.5. 


Theorem 5.6.8. Let F be a field with char F + 2. Then the functors 


(Q, 6) Cif? o 
(—nrd|z0, Zp) GB 


yield an equivalence of categories between 


Oriented ternary quadratic forms over F with signed discriminant 1 € F*/F*?, 
under oriented isometries. 


and 


Quaternion algebras over F, under F-algebra isomorphisms. 


Proof. Let B be a quaternion algebra. As in 5.6.7, the inclusion 1: B? —> B gives 
an F-algebra homomorphism Clf(— nrd|go) — B which restricts to a canonical F- 
algebra homomorphism Clf°(— nrd|go) > B. In fact, in coordinates, this map is the 
isomorphism (5.3.24): choosing the standard basis i, j, k for B = (a,b | F), and letting 
e} =1,e2 = j,e3 = k, we have 


0 0 —ab, ab? 
Clf*(— nrd|z0) = Clf (<a, b, —ab)) = | ———— 


F 
with the standard generators iọ := e;e2 = ij and jọ := e2e3 = jk. We define the 


isomorphism 
—ab, ab? a,b 
» 
F F 


io jo > ij, jk. 


Therefore, the canonical isomorphism Clf°(—nrd|0) © B yields a natural isomor- 
phism between these composed functors and the identity functor, giving an equivalence 
of categories. 

Conversely, let (Q,£) be an oriented ternary quadratic space, let B = Clf°Q, 
and consider (— nrd |g0, ¢g). We define a natural oriented isometry between these two 
spaces. We have a natural inclusion V <> Clf Q, and we define the linear map 


me: VB 
vee ve; 
since v, € CIf! Q, we have vé € Clf?Q = B. We now show that mg induces an 


oriented isometry mz : V > B®. To do so, we let V = F? by choosing an orthogonal 
basis e1, e2, e3 in which Q = (a, b,c) and —abc = 1. We identify B ~ (—ab, —bc | F) as 
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in 5.3.23, with i = eez and j = e2e3, and we let k = ij = —be3e so k? = b(-ac) = b. 
Then ¢ = €e)e2e3 with € = +1, and 

emg(e1) = e1(@1@2€3) = aez2e3 = aj 

eme(e2) = e2(e1€2€3) = —bee3 = -k (5.6.9) 

eme(e3) = €3(€1€2€3) = ci; 
so in particular mz(V) S B®. The map is an isometry, because 

~nrd(mz(v)) = —nrd(vé) = (vő? = v? = - nrd(v) (5.6.10) 
since £* = 1 and ć is central. Finally, the map is oriented: 
mg(¢) = mz(€e1e2€3) = E(e1 f)(e20 (E32) 
= e(€aj)(—ek)(eci) = (—ac)(ijk) = (-abc)ijk"! = ijk" = fp. 


This natural oriented isometry gives a natural transformation between these composed 
functors and the identity functor, and the statement follows. o 


Remark 5.6.11. Theorem 5.6.8 can be seen as a manifestation of the isomorphism 
of Dynkin diagrams A; ~ Bı (consisting of a single node e), corresponding to the 
isomorphism of Lie algebras sl = so3. This is just one of the (finitely many) exceptional 
isomorphisms—the others are just as beautiful, with deep implications, and the reader 
is encouraged to read the bible by Knus—Merkurjev—Rost-Tignol [KMRT98, § 15]. 


We record the following important consequence. 
Corollary 5.6.12. We have Aut(B) =~ B*/F*. 


Proof. We take stabilizers of objects on both sides of the equivalence of categories in 
Theorem 5.6.8; we find Aut(B) ~ SO(Q)(F) if B corresponds to Q. But by Proposition 
4.5.10, there is an isomorphism B*/F* ~ SO(Q)(F), and the result follows. o 


Remark 5.6.13. We will return to Corollary 5.6.12 in the Skolem—Noether theorem in 
section 7.7, generalizing to the context of embeddings into a simple algebra. 


To conclude, we extend the notion of oriented isometry to similarities. 
5.6.14. Let ¢,¢’ be orientations on quadratic spaces V, V’ and suppose dim V = 
dim V’ = n = 2m is even. Then a similarity (f, u) from V to V’ induces an F-linear 


map uA" f: A"(V) > A"(V’), and we say (f, u) is oriented if the map u™ A" f 
preserves orientations. We define 


GSO(Q)(F) := {(f, u) € GO(Q)(F) : (f, u) is oriented} 


and recover the same group as in 4.5.4. If n is odd, we declare that every similarity is 
oriented and let GSO(Q)(F) := GO(Q)(F). 
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Exercises 


Throughout, let F be a field with char F + 2. 


1. 


10. 


. Show that (—1, 26)g = 1, i.e., 


Let B, B’ be quaternion algebras over F. Show that if the quadratic forms nrdg 
and nrdpg are similar, then they are isometric. 


. Consider the hyperbolic quaternions Hpac of Macfarlane (1.2.1). 


(a) Show that Hac is the Clifford algebra of (1, 1, 1) over R. 
(a) Show that Hmac is isomorphic as an algebra over C = R(V-—1) to the even 
Clifford algebra of the ternary quadratic form —VY—1{1, 1, 1). 


. Prove the implication (vi) > (v) of Main Theorem 5.4.4 directly. 


. Use Main Theorem 5.4.4(vi) to give another proof that there is no division 


quaternion algebra B over a finite field F = F, (with q odd). 


(a) Show that the quadratic form Q(x, y, z) = x? + y? + z? is isotropic over Fp 
for all odd primes p. Conclude that (—1, —1 | Fp) ~ M2(Fp). [Hint: count 
squares and nonsquares. ] 

(b) More generally, show that every ternary quadratic form over a finite field 
F4 (with q odd) is isotropic. /Hint: Reduce to the case of finding a solution 
to y? = f(x) where f is a polynomial of degree 2.] Use Main Theorem 
5.4.4(iv) to give yet another proof that there is no division quaternion 
algebra B over F4. 

(c) Show that over a finite field F4 with q odd, there is a unique anisotropic 
binary quadratic form up to isometry. 


. Show that if Q: F —> F is the quadratic form Q(x) = ax? with a € F, then 


Clf(F) = F[x]/(x? - a). 
-1,26 


= MQ). 


-1 
. Let p be prime. Show that =| = M)(Q) if and only if p = 2 orp =1 


(mod 4). 


. Show that 


-2,-3\ (-1,-1 -2, -5 =Í,=1 
[ - Jel - ) put tnar | > |- (== } 


Let B = (a, b | F) be a quaternion algebra over F. Give a constructive (algorith- 
mic) proof of the implication (iv) > (i) in Main Theorem 5.4.4, as follows. 


Let € = xi + yj + zij € B satisfy nrd(e) = —ax? — by” + abz? = -e° = 0. 


(a) Show that there exists k € {i, j, ij} such that trd(ek) = s + 0. 
(b) Let ¢ := trd(k) and n := nrd(k), and let e’ := s~'e. Let 


i = ek-—(k +t) 
j :=k+(-tk+n+1)e. 
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11. 


12. 


13. 


> 14. 


15. 


16. 


17. 


18. 


> 19 
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Show that i’, j’ generate B as an F-algebra, and that (i’)* = (j^)? = 1 and 
ji’ = -i’j’. Conclude that B ~ M2(F). 

(c) Show that J := Fe’+Fke’ is a left ideal of B with dimp J = 2, and interpret 
(b) as arising from the left multiplication map B — Endr(/) =~ M2(F). 


Let B be a quaternion algebra over F. Let Q be the reduced norm on B, and for 
clarity write eg = 1, ey =i, e2 = j, e3 = k as a basis for the domain of Q. 


(a) Let C? = Clf° Q be the even Clifford algebra of the reduced norm Q. Show 
that Z(C°) ~ F x F. [Hint: Z(C°) is generated by ene, e2€3.] 

(b) Show that C° ~ B x B°?(~ B x B) as F-algebras. 

(c) Prove that if B’ is a quaternion algebra over F then B ~ B’ are isomorphic 
as F-algebras if and only if the reduced norms Q ~ Q’ are similar as 
quadratic spaces. 


Let Q: V — F be a nondegenerate quadratic form. Show that the reversal map 
T; CI? Q — CIf? Q on the Clifford algebra has the property that xx € F for all 
pure tensors x = e1é2:--- eq, but defines a standard involution on Clf Q if and 
only if V = {0} and on cif? Q if and only if dimp V < 3. 

Give another proof of Lemma 5.3.17 using the universal property of the Clifford 
algebra. 


Expand (5.4.8) and prove as a consequence that if a = xi + yj + zij and 
B = ui + vj + wij, then trd(a@B) = 0 (so a is orthogonal to 8) and moreover 
nrd(q@) + d nrd(8) = 0. 


Show that the Hilbert symbol is Galois equivariant, in the following sense: for 
all field automorphisms o € Aut(F) and all a,b € F*, we have (o (a), o(b))F = 
(a, b)F. 

Let a,b, b’ € F*. Show that there exists an F-linear isomorphism ¢: (a,b | 
F) > (a,b' | F) with ¢@ = i’ if and only if b/b’ € Nmxjr(K*) where 
K = F(Ņa). [More generally, see Corollary 7.7.6.] 

Let a € Q*\Q*?. Show that there are infinitely many distinct isomorphism 
classes of conics x? — ay” = bz? for b € Q*. 


Let K = F(a,b) with a, b algebraically independent, transcendental elements. 
„b 
Show that the generic quaternion algebra (=) is a division algebra. [Hint: 


show the associated ternary quadratic form is anisotropic. ] 


Prove Lemma 5.6.1 for general odd n as follows. 


_with ij <- < 


r 


(a) For a subset J = {ij,...,i,} G {1,... 7}, let ez = e;, --- e; 
i,. Then for subsets J, J C {1,..., n}, show that 


#I-#J-#(IOJ 
ejej = ejer(-1) Con, 


(b) Show that Z(CIf Q) = F[ô] = F[x]/(x? — d) where 6 = ejez ... en and 
d = sgndisc(Q). [Hint: Argue on bases and choose #J = 2 with IAJ = 1.] 
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20. Let Q: V > F be a quadratic form. Show that the even Clifford algebra Clf° Q 
with its map ı: V@V > Clf? Q has the following universal property: if A is an 
F-algebra and t4 : V @ V —> A is an F-linear map such that 


(i) ta(x ® x) = Q(x) for all x € V, and 
(ii) ta(x 8 yla (y 8 z) = Q(y)ta(X 8 z) for all x, y,z € V, 


then there exists a unique F-algebra homomorphism ¢ : Clf° Q — A such that 
the diagram 
vev —= fo 
| 
K4 


Y 
A 


LA 


commutes. Conclude that the pair (CIF? Q, 1) is unique up to unique isomorphism. 


21. In this exercise, we consider graded tensor products, giving an alternate verifi- 
cation of Lemma 5.3.14. 
Let A = Ao © A; and B = Bo © B, be finite-dimensional F-algebras equipped 
with a Z/2Z-grading. We define the graded tensor product A@B to be the usual 
tensor product as an F-vector space but with multiplication law defined on 
simple tensors by 


(a@b)-(a' 8b’) = (-1) 88 OH) aa’ @ bb’). 


(a) Show that AQB is an F -algebra of dimension (dimp A)(dim p B). 

(b) Let Qı: Vi > F and Q2: Vz —> F, and let Q := Q1 E Q2 be the orthogonal 
direct sum on the quadratic space V := Vi & V2. Show there is a canonical 
isomorphism of Clifford algebras 


Clf(Q) = Cl£(Q1)@ Cl£(Q2). 
(c) Observe that (b) gives another proof of Lemma 5.3.14. 


22. For i = 1,2, let Q;: V; —> F be quadratic forms over F. 


(a) Prove that there exists a canonical R-algebra isomorphism 


Clf°(Q1 Œ Q2) > (Clf°(Q1) 8 Clf°(Q2)) & (CIF! (Q1) $ Clf!(Q»)) 


where Clf! (Q,)®CIf ; (Q2)has multiplication induced from the full Clifford 
algebras Clf(Q1) and Clf(Q2). 
(b) Prove that there is a clf°(O, Q>)-bimodule isomorphism 


Clf'(Q Œ Q2) > (CIf°(Q1) 8 Clf!(Q2)) & (CIF! (Q1) 8 Clf°(Q»)) 


with bimodule structure induced by multiplication in the full Clifford 
algebra. 
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23. In this exercise, we assume background in algebraic curves. Show that two 
conics over F are isomorphic (as projective plane curves) if and only if their 
function fields are isomorphic. [ Hint: conics are anticanonically embedded—the 
restriction of @p2(—1) to the conic is a canonical sheaf—so an isomorphism of 
function fields induces an linear isomorphism of conics.] 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s 
Creative Commons license, unless indicated otherwise in a credit line to the material. If 
material is not included in the chapter’s Creative Commons license and your intended 
use is not permitted by statutory regulation or exceeds the permitted use, you will need 
to obtain permission directly from the copyright holder. 
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Chapter 6 


Characteristic 2 


In this chapter, we extend the results from the previous four chapters to the neglected 
case where the base field has characteristic 2. Throughout this chapter, let F be a field 
with algebraic closure F”. 


6.1 Separability 


To get warmed up, we give a different notation (symbol) for quaternion algebras that 
holds in any characteristic and which is convenient for many purposes. 


Definition 6.1.1. Let A be a commutative, finite-dimensional algebra over F. We say 
A is separable if 
A@p F” = F” x... x F”; 


otherwise, we say A is inseparable. 


Example 6.1.2. If A =~ F[x]/(f()) with f(x) € F[x], then A is separable if and only 
if f has distinct roots in F”. 


6.1.3. If char F + 2, and K is a quadratic F-algebra, then after completing the square, 
we see that the following are equivalent: 


(i) K is separable; 

(ii) K = F[x]/(x? — a) with a + 0; 
(iii) K is reduced (K has no nonzero nilpotent elements); 
(iv) K isa field or K = F x F. 


6.1.4. If char F = 2, then a quadratic F-algebra K is separable if and only if 
K = F[x]/(x? + x +a) 


for some a € F. A quadratic algebra of the form K = F[x]/(x? + a) with a € F is 
inseparable. 


Now we introduce the more general notation. 
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6.1.5. Let K be a separable quadratic F-algebra, and let b € F*. We denote by 
K,b 
—]:=Ke@Kj 


the F-algebra with basis 1, j as a left K-vector space and with the multiplication rules 
j? = band ja = gj for a € K, where “ is the standard involution on K (the nontrivial 
element of Gal(K | F) if K is a field). We will also write (K, b | F) for formatting. 


From 6.1.3, if char F + 2 then writing K = F[x]/(x? — a) we see that 


ig 


is a quaternion algebra over F. The point is that we cannot complete the square in 
characteristic 2, so the more general notation gives a characteristic-independent way 
to define quaternion algebras. In using this symbol, we are breaking the symmetry 
between the standard generators i, j, but otherwise have not changed anything about 
the definition. 


6.2 Quaternion algebras 


Throughout the rest of this chapter, we suppose that char F = 2. (We will occasionally 
remind the reader of this supposition, but it is meant to hold throughout.) 


Definition 6.2.1. An algebra B over F (with char F = 2) is a quaternion algebra if 
there exists an F-basis 1, i, j, k for B such that 
i +i=a, j =b, and k=ij=j(i+1) (6.2.2) 
witha € F and b € F*. 
Just as when char F + 2, we find that the multiplication table for a quaternion 
algebra B is determined by the rules (6.2.2), e.g. 
jk = j(ij)=(ij+j)j=bi+b=kj+b. 


a,b 


We denote by =| or [a,b | F) the F-algebra with basis 1,7, j,ij subject to the 


,b 
multiplication rules (6.2.2). The algebra s$) is not symmetric in a, b (explaining the 


choice of notation), but it is still functorial in the field F. 
If we let K = F[i] = F[x]/(x? + x + a), then 


FF) 


and our notation extends that of Section 6.1. 
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Example 6.2.3. The ring M2(F) of 2 x 2-matrices with coefficients in F is again a 
quaternion algebra over F, via the isomorphism 


1,1 
[=] > mw 


oP 1) (0 1 
BIT ly all. of" 


Lemma 6.2.4. An F-algebra B is a quaternion algebra if and only if there exist F- 
algebra generators i, j € B satisfying 


P+iza,j?=b, and ij = j(i +1). (6.2.5) 
Proof. Proven the same way as Lemma 2.2.5. oO 


6.2.6. Let B = [a, b | F) be a quaternion algebra over F. Then B has a (unique) standard 
involution” : B —> B given by 


@=tt+xityjtujeoea=xtaz=(t+x)+xityj +z 


since 

aÑ = (t+ xi + yj +zij((t+x)+xi+yj+zij) 
i 3 5 5 (6.2.7) 
=t +tx+ax* + byf + byz+abz° € F. 


Consequently, one has a reduced trace and reduced norm on B as in Chapter 3. 


We now state a version of Theorem 3.5.1 in characteristic 2; the proof is similar 
and is left as an exercise. 


Theorem 6.2.8. Let B be a division F-algebra with a standard involution that is not 
the identity. Then either B is a separable quadratic field extension of F or B is a 
quaternion algebra over F. 


Proof. Exercise 6.9. (This theorem is also implied by Theorem 6.4.1.) o 


6.3 =» Quadratic forms 


We now turn to the theory of quadratic forms over F with char F = 2. The basic 
definitions from section 4.2 apply. For further reference, Grove [Grov2002, Chapters 
12-14] treats quadratic forms in characteristic 2, and the book by Elman-Karpenko- 
Merkurjev [EKM2008, Chapters I-II] discusses bilinear forms and quadratic forms in 
all characteristics. 

Let Q: V > F be a quadratic form with dimp V = n < œ and associated bilinear 
form T. Then T(x, x) = 2Q(x) = 0 for all x € V, so one cannot recover the quadratic 
form from the symmetric (equivalently, alternating) bilinear form. 


6.3.1. We begin with the definition of the discriminant. When n is even, we simply 
define disc(Q) = det(T) € F/F**—this is equivalent to Definition 4.3.3 when char F # 
2, having absorbed the square power of 2. 
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When n is odd, the symmetric matrix T always has determinant 0 (Exercise 6.8); 
we need to “divide this by 2”. So instead we work with a generic quadratic form, as 
follows. Consider the quadratic form 


univ PEE = 2 2 
Qh (X1,.--, Xn) = >} AijXiXj = A11 X] + 412X1X2 + +++ + AnnXp, (6.3.2) 
lsi<j<n 


over the field FU’ := Q(aij)i,j=1,....n (now of characteristic zero!) with a;; transcen- 
dental elements. We compute its universal determinant 


2a a\2 Lae ain 
det((T"™)) = detl e re laili; (6.3.3) 
= : : oa : 1J44,J a 
ain an feo 2ann 


as a polynomial with integer coefficients. We claim all of these coefficients are even: 
indeed, reducing modulo 2 and computing the determinant over F2(a;;);,;, we recall 
that the determinant of an alternating matrix of odd size is zero (over any field). 
Therefore, we may let 


6(a11,-++,@nn) := det(T™™)/2 € Zlaisli; (6.3.4) 
be the universal (half-)discriminant. We then define 
disc(Q) := 6(Q(e1), T(e1, e2), -.-,O(en)) € F/F? 
by specialization. Repeating the argument in 4.3.2, if t;; € F and e; := J; tije; then 
5(O(e4), Tle}, e), -- - CE) = Elei), T(e1, €2), «- -, Olen) det(tij)” 


(verified universally!) so disc(Q) is well-defined. Moreover, this definition agrees with 
Definition 4.3.3 when char F # 2. 


Example 6.3.5. For example, disc((a)) = a for a € F, and if 
Q(x, y, z) = ax? + by? + cz? + uyz + vxz + wxy 
with a, b, c, u, v,w € F, then 
disc(Q) = 4abc + uvw — au? — bv? — cw? 
in all characteristics. 


Definition 6.3.6. We say Q is nondegenerate if disc(Q) + 0. 


Next, even though not every quadratic form over F can be diagonalized, so we will 
also make use of one extra form: for a,b € F, we write [a, b] for the quadratic form 
ax? + axy + by? on F?. 
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Lemma 6.3.7. There exists a basis of V such that 


Q ~ [a;, bı] m ... 8[am, bm] ®© (C1, . - -p Cr) (6.3.8) 


with ai, bi, cCjE& F. 


Proof. Exercise 6.11. o 


We say that a quadratic form Q is normalized if Q is presented with a basis as in 
(6.3.8). 


Example 6.3.9. The quadratic forms Q(x, y, z) = x? + yz + z? and Q(x, y, z) = x? + 
y? + z? are normalized over F>, but the quadratic form Q(x, y, z) = xz + yz + z? is not. 


Example 6.3.10. For a normalized quadratic form as in (6.3.8), 


disc(Q) = disc([a1, b1] 8 -++ Bam, bm]) disc(<c1, ..., ¢-)) 


= (a1 +- am}? disc((c},...,¢,)). 


In F/F”, we have 


0, ifr > 2; 
disc({c1, ae ope) = \ C], ifr = l; 
l, ifr=0. 


Therefore, Q is nondegenerate if and only if a, ---a,c,---c,; + 0Oandr <1. 


a,b 


Example 6.3.11. Let B = 
basis for B, and by (6.2.7), 


be a quaternion algebra. Then 1, i, j, ij is anormalized 


nrd = [1, a] e [b, ab], 


so disc(nrd) = b? so nrd is nondegenerate. 


6.4 » Characterizing quaternion algebras 


We now consider the characterization of quaternion algebras as those equipped with 
a nondegenerate standard involution (revisiting Main Theorem 4.4.1, but now with 
char F = 2). 


Theorem 6.4.1. Let B be an F-algebra (with char F = 2). Then B has a nondegenerate 
standard involution if and only if one of the following holds: 


(i) B= F; 
(ii) B = K is a separable quadratic F-algebra; or 


(iii) B is a quaternion algebra over F. 
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Proof. If B = F, then the standard involution is the identity, and nrd is nondegenerate 
on F because the reduced (half-)discriminant of the quadratic form nrd(x) = x? is 1. 

If dimp B = 2, then B = K has a unique standard involution (Lemma 3.4.2). By 
6.1.4, we see that the involution is nondegenerate if and only if K is separable. 

So suppose dimp B > 2. Since B has a nondegenerate standard involution, there 
exists an element i € B such that T(i, 1) = trd(i) + 0. We have i ¢ F since trd(F’) = {0}. 
Rescaling we may suppose trd(i) = 1, whence i? = i +a for some a € F, and 
nrd|rF+ri = [1,a]. (We have started the proof of Lemma 6.3.7, and 1,i is part of a 
normalized basis, in this special case.) 

By nondegeneracy, there exists j € {1,i}+ such that nrd(j) = b + 0. Thus trd(j) = 0 
so j = j and j? = b € F*. Furthermore, 


0 =trd(ij)=ij+ji=ij=+j(i+1) 
soij = j(i + 1). Therefore i, j generate an F-subalgebra A ~ [a,b | F). 


The conclusion of the proof follows exactly as in (4.4.3): if k € {1,i, j,ij}+ then 
k(ij) = k(ji), a contradiction. o 


Corollary 6.4.2. Let B be a quaternion algebra over F, and suppose that K £ B is a 
commutative separable F-subalgebra. Then dimp K < 2. Moreover, if K + F, then the 
centralizer of K* in B* is again K*%. 


Next, we characterize isomorphism classes of quaternion algebras in characteristic 
2 in the language of quadratic forms. 
6.4.3. Let B be a quaternion algebra over F. We again define 
B® := {a € B : trd(a) = 0} = {1}+. (6.4.4) 
But now B° = F 6 Fj ® Fk and in this basis 
nrd(x + yj + zij) = x? + by? + byz + abz? (6.4.5) 


so nrd |go ~ (1) æ [b, ab]. The discriminant is therefore 


disc(nrd |go) = b* = 1 € F*/F*?. (6.4.6) 


Theorem 6.4.7. Let F be a field with char F = 2. Then the functor B |> nrd|po yields 
an equivalence of categories between 


Quaternion algebras over F, 
under F-algebra isomorphisms 


and 


Ternary quadratic forms over F with discriminant 1 € F*/F”?, 
under isometries. 
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Proof. We argue as in Theorem 5.6.8 but with char F = 2. The argument here is easier, 
because all sign issues go away and there is no orientation to chase: by Exercise 6.12, 
there is a unique ¢ € Clf! Q\F such that ¢? = 1. The inclusion :: B? <> B induces 
a surjective F-algebra homomorphism Clf°(nrd|g0) —> B, so by dimensions it is an 
isomorphism; this gives one natural transformation. In the other direction, the map 
mg: V > B® by v & vć is again an isometry by (5.6.10), giving the other. 

Here is a second direct proof. By 6.4.3, the quadratic form nrd | go has discriminant 
1. To show the functor is essentially surjective, let Q: V — F be a ternary quadratic 
form with discriminant 1 € F*/F**. Then Q = (u) a [b,c] for some u, b,c € F. We 
have disc(Q) = ub? = 1 € F”? so b € F* and u € F°. Rescaling the first variable, 
we obtain Q ~ (1) æ [b, c]. Thus by 6.4.3, Q arises up to isometry from the quaternion 


b 
=| with a = cb"!. 
F 


algebra 


For morphisms, we argue as in the proof of Proposition 5.2.4 but with char F = 2. In 
one direction, an F-algebra isomorphism B ~> B’ induces an isometry B? ~> (B^)? by 
uniqueness of the standard involution. Conversely, let f: B? > (B’)® be an isometry. 

a,b 


Let 8 = | 
F 


} Extend f to an F-linear map B > B’ by mapping i b™! f(ij) f(j). 


The map f preserves 1: it maps F to F by Exercise 6.15, since F = (B°)+ = (B’)), 
and 1 = nrd(1) = nrd(f(1)) = far so f(1) = 1. We have fUY = nrd(f(j)) = 
nrd(j) = b and similarly f(ij)* = ab since j,ij € B’. Thus 
1 = trd(i) = b7! tra((ij)j) = Tj, j) = 

= DTEC), FG) = edb" FSG) = td FO) 

and similarly nrd( f (i)) = nrd(i) = a, thus HOK + f(i) +a = 0. Finally, 
FOLD =D FDSID = fG) 
and 
FOFO = OT FDALDFD = O'FDADLFG) + TEO, FED) 
= fij) + fO) = (FO + DID = FOLD 


so f isan isomorphism of F-algebras. Therefore the functor is full and faithful, yielding 
an equivalence of categories. o 


Corollary 6.4.8. The maps B +> Q = nrd |go }> C = V (Q) yield bijections 


Nondegenerate ternary 
Quaternion algebras over F quadratic forms over F 
| up to isomorphism | with discriminant 1 € F*/F*? 
up to isometry 
Nondegenerate ternary 
e | quadratic forms over F | 
up to similarity 
Conics over F 
| up to isomorphism | 
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that are functorial with respect to F. 


Proof. The remaining parts of the bijection follow as in the proof of Corollary 5.2.6. 
o 


We now turn to identifying the matrix ring in characteristic 2. 
Definition 6.4.9. A quadratic form H: V > F is a hyperbolic plane if H ~ [1,0]. 


Recall that [1,0] : F? — F is given by the quadratic form x? + xy = x(x + y), 
visibly isometric to the quadratic form xy. Definition 6.4.9 agrees with Definition 5.4.1 
after a change of basis. 


Lemma 6.4.10. If Q is nondegenerate and isotropic then Q ~ H a Q’ with H a hyper- 
bolic plane. 


Proof. We repeat the proof of Lemma 5.4.2. o 


We may again characterize division quaternion algebras by examination of the 
reduced norm as a quadratic form as in Main Theorem 5.4.4 and Theorem 5.5.3. 


a, b 


Theorem 6.4.11. Let B = | F 


) (with char F = 2). Then the following are equivalent: 


1,1 
G) B = | T ) = MxF); 


(ii) B is not a division ring; 


(iii) The quadratic form nrd is isotropic; 
(iv) The quadratic form nrd | go is isotropic; 
(v) The binary form [1, a] represents b; 
(vi) b € Nmx\r(K*) where K = F[i]; and 
(vii) The conic C := V(nrd| go) C P? has an F-rational point. 


Proof. Only condition (v) requires significant modification in the case char F = 2; see 
Exercise 6.13. oO 


Lemma 6.4.12. Let K > F be a quadratic extension of fields. Then K is a splitting 
field for B if and only if there is an injective F-algebra homomorphism K — B. 


Proof. lft: K —> Band K = F(a), then 1 ®a@ — (a) @ 1 is a zerodivisor in B 8f K, 
since 


(@a-Ua)@1)\1®a-Ula)@l1)=0, (6.4.13) 


and so B 8r K ~ Mo(K) and K is a splitting field. 
Conversely, let K = F(q@) and suppose B @r K =~ M2(K). If B ~ Mo(F), we can 
take the embedding mapping « to a matrix with the same rational canonical form. So 


a, 


we suppose that B = is a division ring. By Theorem 6.4.1 1(iv) (over K) and 
6.4.3, there exist x, y, z, u,v, w € F not all zero such that 


(x + ua) + by + va)’ +b + va)(z + wa) + ab(z + wa)? = 0; (6.4.14) 
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expanding and rewriting into the powers of œ gives 
(u? + by? + byw + abw°)a? + (vz + wy)ba + (x? + by? + byz + abz”) = 0. (6.4.15) 


Let B:= x + yj + zij and y := u + vj + wij. Then y € B*, since y = 0 implies 
nrd(£) = 0 and yet B is a division ring. Then the equation (6.4.15) can be written 


nrd(y)a? + trd(By)a + nrd(B) = 0; 
then a direct calculation shows that the element 
u = By! = nrd(y) ‘By 
satisfies the same equation as (6.4.15) in the variable œ, so there is an embedding 


K => B defined bya > wu. o 


Exercises 


Throughout these exercises, we let F be a field (of any characteristic, unless specified). 


1. Recall the primitive element theorem from Galois theory: if K 2 F is a separable 
field extension of finite degree, then there exists œ € K such that K = F(a)— 
and hence K ~ F[x]/(f(x)) where f(x) € F[x] is the minimal polynomial of a. 
Extend this theorem to algebras as follows. Let B be a separable, commutative, 
finite-dimensional F-algebra. Show that B ~ F[x]/(f(x)) for some f(x) € F[x]. 


> 2. Let B be a quaternion algebra over F and let K C B be a separable quadratic 


K,b 
F-algebra. Show that there exists b € F* such that B = (=) (as in 6.1.5). 


3. Let F5? be a separable closure of F and let B be a quaternion algebra over F. 
Show that B @r FP ~ M2 (FP). [More generally, see Exercise 7.24.] 


K,b 
> 4. Let K be a separable quadratic F-algebra and let u, b € F*. Show that F ) ~ 


K,ub 
| A ) if and only if u € nrd(K*) = Nmxyp(K®). 


5. Let B be a quaternion algebra over F, and let Ko 2 F be a quadratic field. Prove 
that there exists a separable extension K 2 F linearly disjoint from Kọ over F 
(i.e., K 8r Ko is a domain) such that K splits F. 

6. Suppose char F = 2 and let a € F and b € F”. 


a,ab 
r) 
a,b S 
2) = 
a,b 


F ) be a quaternion algebra over F. Compute the 


ifa #0. 


b 
(a) Show that $ ) ~ 
F 


(b) Show that if t € F and u € F*, then 


a+ (t +t”), bu? 
F . 


7. Let char F = 2 and let B = 


left regular representation 2: B — Endx(B) ~ Mo(K) where K = F[i] as in 
2.3.8. 
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> 8. Suppose char F = 2. Let M € M,,(F) be a symmetric matrix with n odd, and 
suppose that all diagonal entries of M are zero. Show that det M = 0. 


> 9. Let char F = 2 and let B be a division F-algebra with a standard involution. 
Prove that either the standard involution is the identity (and so B is classified by 
Exercise 3.9), or that the conclusion of Theorem 3.5.1 holds for B: namely, that 
either B = K is a separable quadratic field extension of F or that B is a quaternion 
algebra over F. [Hint: Replace conjugation by i by the map $(x) = ix + xi, and 
show that ¢ = ¢. Then diagonalize and proceed as in the case char F + 2.] 


> 10. Let char F = 2. Show that the even Clifford algebra Clf° Q of a nondegenerate 
ternary quadratic form Q: V — F is a quaternion algebra over F. 


> 11. Prove Lemma 6.3.7, that every quadratic form over F with char F = 2 has a 
normalized basis. 


> 12. Let char F = 2 and let Q: V — F be a quadratic form over F with discriminant 
d € F*/F*? and dimp V = n odd. Show that Z(Clf Q) ~ F[x]/(x? — d) and that 
there is a unique ¢ € Z(Clf Q) N Clf! Q such that £7 = 1. 


> 13. Prove Theorem 6.4.11. 


> 14. Let Q := Q’ m Q” be an orthogonal sum of two anisotropic quadratic forms over 
F (with F of arbitrary characteristic). Show that Q is isotropic if and only if 
there exists c € F* that is represented by both Q’ and —-Q”. 


> 15. Let B be a quaternion algebra over F (with F of arbitrary characteristic). Show 
that F = (B®). 


> 16. Prove Wedderburn’s little theorem in the following special case: a quaternion 
algebra over a finite field with even cardinality is not a division ring. [Hint: See 
Exercise 3.16.] 


This chapter is licensed under the terms of the Creative Commons Attribution- 
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nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
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Chapter 7 


Simple algebras 


In this chapter, we return to the characterization of quaternion algebras. We initially 
defined quaternion algebras in terms of generators and relations in Chapter 2; in the 
chapters that followed, we showed that quaternion algebras are equivalently noncom- 
mutative algebras with a nondegenerate standard involution. Here, we pursue another 
approach, and we characterize quaternion algebras in a different way, as central simple 
algebras of dimension 4. 


7.1 > Motivation and summary 


Consider now the “simplest” sorts of algebras. Like the primes among the integers or 
the finite simple groups among finite groups, it is natural to seek algebras that cannot 
be “broken down” any further. Accordingly, we say that a ring A is simple if it has no 
nontrivial two-sided (bilateral) ideals, i.e., the only two-sided ideals are {0} and A. To 
show the power of this notion, consider this: if ø: A — A’ is a ring homomorphism and 
A is simple, then ¢ is either injective or the zero map (since ker ¢ C B is a two-sided 
ideal). 

A division ring A is simple, since every nonzero element is a unit and therefore 
every nonzero ideal (left, right, or two-sided) contains | so is equal to A. In particular, 
a field is a simple ring, and a commutative ring is simple if and only if it is a field. 
The matrix ring M,,(F) over a field F is also simple, something that can be checked 
directly by multiplying by matrix units (Exercise 7.5). 

Moreover, quaternion algebras are simple. The shortest proof of this statement, 
given what we have done so far, is to employ Main Theorem 5.4.4 (and Theorem 
6.4.11 in characteristic 2): a quaternion algebra B over F is either isomorphic to 
M2(F) or is a division ring, and in either case is simple. One can also prove this 
directly (Exercise 7.1). 

Although the primes are quite mysterious and the classification of finite simple 
groups is amonumental achievement in group theory, the situation for algebras is quite 
simple, indeed! Our first main result is as follows (Main Theorem 7.3.10). 
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Theorem 7.1.1. (Wedderburn—Artin). Let F be a field and B be a finite-dimensional 
F-algebra. Then B is simple if and only if B ~ M,(D) where n > 1 and D is a 
finite-dimensional division F-algebra. 


As a corollary of Theorem 7.1.1, we give another characterization of quaternion 
algebras. 


Corollary 7.1.2. Let B be an F-algebra. Then the following are equivalent: 


(i) B is a quaternion algebra; 
(ii) B 8r F” ~ M>(F*), where F” is an algebraic closure of F; and 
(iii) B is a central simple algebra of dimension dimp B = 4. 


Moreover, a central simple algebra B of dimension dimp B = 4 is either a division 
algebra or has B ~ M2(F). 


This corollary has the neat consequence that a division algebra B over F is a 
quaternion algebra over F if and only if it is central of dimension dimp B = 4. 

For the reader in a hurry, we now give a proof of this corollary without invoking the 
Wedderburn—Artin theorem; this proof also serves as a preview of some of the ideas 
that go into the theorem. 


Proof of Corollary 7.1.2. The statement (i) > (ii) was proven in Exercise 2.4(d). 

To prove (ii) > (iii), suppose B is an algebra with B”?! := B @p F” ~ M2(F”). The 
F*!_algebra B” is central simple, from above. Thus Z(B) = Z(B”) N B = F. And if I 
is a two-sided ideal of B then /*! := I &p F” is a two-sided ideal of B”, so J?! = {0} 
or /*! = B* is trivial, whence J = J”! N F is trivial. Finally, dimp B = dimpa B® = 4. 

Finally, we prove (iii) > (i). Let B a central simple F-algebra of dimension 4. If 
B is a division algebra we are done; so suppose not. Then B has a nontrivial left ideal 
(e.g., one generated by a nonunit); let {0} © Z Ç B be a nontrivial left ideal with 
0 < m = dimp I minimal. Then there is a nonzero homomorphism B — Endr (I) = 
M,(F) which is injective, since B is simple. By dimension, we cannot have m = 1; 
if m = 2, then B ~ M)(F) and we are done. So suppose m = 3. Then by minimality, 
every nontrivial left ideal of B has dimension 3. But for any a € B, we have that Ja is 
a left ideal, so the left ideal J N Ja is either {0} or J; in either case, Ja C J and T is a 
right ideal as well. But this contradicts the fact that B is simple. o 


The Wedderburn—Artin theorem is an important structural result used throughout 
mathematics, so we give in this chapter a self-contained account of its proof. More 
generally, it will be convenient to work with semisimple algebras, finite direct products 
of simple algebras. When treating ideals of an algebra we would be remiss if we did 
not discuss more generally modules over the algebra, and the notions of simple and 
semisimple module are natural concepts in linear algebra and representation theory: 
a semisimple module is one that is a direct sum of simple modules (“completely 
reducible”), analogous to a semisimple operator where every invariant subspace has 
an invariant complement (e.g., a diagonalizable matrix). 

The second important result in this chapter is a theorem that concerns the simple 
subalgebras of a simple algebra, as follows (Main Theorem 7.7.1). 
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Theorem 7.1.3. (Skolem—Noether). Let A, B be simple F-algebras and suppose that 
B is central. Suppose that f, g: A — B are homomorphisms. Then there exists B € B 
such that f(a) = B-'g(a)B for alla € A. 


Corollary 7.1.4. Every F-algebra automorphism of a simple F -algebra B is inner, i.e., 
Aut(B) = BX/F™. 


Just as above, for our quaternionic purposes, we can give a direct proof. 


Corollary 7.1.5. Let B be a quaternion algebra over F and let K4, Ky C B be quadratic 
subfields. Suppose that 6: Kı — Ka is an isomorphism of F-algebras. Then ¢ lifts to 
an inner automorphism of B, i.e., there exists B € B such that œz = ¢(a1) = p'ap 
for all a, € Ky. In particular, Ky = p~! K; B. 


Proof. Write K; = F(a) with a; € B and let an = (œ) € K2 C B, so K) = F(a). 

We want to find 6 € B* such that a2 = B~'a)f. In the special case B ~ M>(F), then 

a1, @2 E€ M2(F) satisfy the same irreducible characteristic polynomial, so by the theory 

of rational canonical forms, a2 = B~!a) where 8 € B* ~ GL2(F) as desired. 
Suppose then that B is a division ring. Then the set 


W ={BEB: Bar = qf} (7.1.6) 


is an F’-vector subspace of B. Let F**P be a separable closure of F. (Or, apply Exercise 
6.5 and work over a splitting field K linearly disjoint from Kı ~ K2.) Then we have 
B @p FP ~ Mo(F*?), and the common characteristic polynomial of a1, a2 either 
remains irreducible over F? (if K > F is inseparable) or splits as the product of two 
linear factors with distinct roots. In either case, the theory of rational canonical forms 
again applies, and there exists 8 € B @p FP ~ GLo(F**) that will do; but then by 
linear algebra dim psp W 8r F5? = dimp W > 0, so there exists B € B \ {0} = B* with 
the desired property. o 


As shown in the above proof, Corollary 7.1.5 can be seen as a general reformulation 
of the rational canonical form from linear algebra. 


7.2 Simple modules 


Basic references for this section include Drozd—Kirichenko [DK94, §1-4], Curtis— 
Reiner [CR81, §3], Lam [Lam2001, §2—3], and Farb—Dennis [FD93, Part I]. An ele- 
mentary approach to the Weddernburn—Artin theorem is given by BreSar [Bre2010]. 
An overview of the subject of associative algebras is given by Pierce [Pie82] and 
Jacobson [Jacn2009]. 

Throughout this chapter, let B be a finite-dimensional F-algebra. 

To understand the algebra B, we look at its representations. A representation of 
B (over F) is a vector space V over F together with an F-algebra homomorphism 
B — Endr (V). Equivalently, a representation is given by a left (or right) B-module V: 
this is almost a tautology. Although one can define infinite-dimensional representations, 
they will not interest us here, and we suppose throughout that dimp V < ov, or 
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equivalently that V is a finitely generated (left or right) B-module. If we choose a 
basis for V, we obtain an isomorphism Endr (V) ~ M,,(F) where n = dimp V, so a 
representation is just a homomorphic way of thinking of the algebra B as an algebra 
of matrices. 


Example 7.2.1. The space of column vectors F” is a left M,(F’)-module; the space of 
row vectors is a right M,,(F’)-module. 


Example 7.2.2. B is itself a left B-module, giving rise to the left regular representa- 
tion B — Endr (B) over F (cf. Remark 3.3.8). 


Example 7.2.3. Let G be a finite group. Then a representation of F[G] (is the same as 
an F[G]-module which) is the same as a homomorphism G — GL(V), where V is an 
F-vector space (Exercise 3.8). 


Definition 7.2.4. Let V be a left B-module. We say V is simple (or irreducible) if 
V + {0} and the only B-submodules of V are {0} and V. 

We say V is indecomposable if V cannot be written as V = V; ®V2 with V1, V2 + {0} 
left B-modules. 


A simple module is indecomposable, but the converse need not hold, and this is a 
central point of difficulty in understanding representations. 


a 


Example 7.2.5. If B = Ile 


J : abcer] C M2(F), then the space V = F? of 


column vectors is not simple, since the subspace spanned by r is a B-submodule; 
nevertheless, V is indecomposable (Exercise 7.4). 


The importance of simple modules is analogous to that of simple groups. Arguing 
by induction on the dimension of V, we have the following lemma analogous to the 
Jordan—H6lder theorem on composition series. 


Lemma 7.2.6. A (finite-dimensional) left B-module V admits a filtration 


V=W2V 2V2 2 V = {0} 


such that V; /V;+; is simple for each i. 


This filtration is not unique, but up to isomorphism and permutation, the quotients 
V; /Vi+1 are unique. 


Lemma 7.2.7. If I is a maximal left ideal of B, then B/I is a simple B-module. 
Conversely, if V is a simple B-module, then V ~ B/I for a maximal left ideal I: more 
precisely, for any x € V\{0}, we may take 


I = ann(x) := {a € B : ax = 0}. 


Proof. For the first statement, a submodule of B/I corresponds to a left ideal containing 
I, so B/I is simple if and only if J is maximal. Conversely, letting x € V\{0} we have 
{0} # Bx C V a B-submodule and so Bx = V; and consequently V ~ B/I where 
I = ann(x) and again J is a maximal left ideal. o 
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Having defined the notion of simplicity for modules, we now consider simplicity 
of the algebra B. 


Definition 7.2.8. An F-algebra B is simple if the only two-sided ideals of B are {0} 
and B. 


Equivalently, B is simple if and only if any F-algebra (or even ring) homomorphism 
B = Ais either injective or the zero map. 


Example 7.2.9. A division F-algebra D is simple. In fact, the F-algebra M,,(D) is 
simple for any division F-algebra D (Exercise 7.5), and in particular M,,(F) is simple. 


Example 7.2.10. Let F”! be an algebraic closure of F. If B &p F” is simple, then B 
is simple. The association J +> I &p F” is an injective map from the set of two-sided 
ideals of B to the set of two-sided ideals of B @p F”. 


7.2.11. If B is a quaternion algebra over F, then B is simple. We have B @p F” ~ 
M2(F*), which is simple by Example 7.2.9, and B is simple by Example 7.2.10. 


Example 7.2.9 shows that algebras of the form M,,(D) with D a division F-algebra 
yield a large class of simple F-algebras. In fact, these are all such algebras, a fact we 
will now prove. First, a few preliminary results. 


Lemma 7.2.12. (Schur). Let B be an F-algebra. Let Vi, Vz be simple B-modules. Then 
any homomorphism ġġ: Vi — V2 of B-modules is either zero or an isomorphism. 


Proof. We have that ker ġ and img ¢ are B-submodules of V; and V2, respectively, so 
either ¢ = 0 or ker d = {0} and img ¢ = Vo, hence V; = V2. o 


Corollary 7.2.13. If V is a simple B-module, then Endg(V) is a division ring. 


7.2.14. Let B be an F-algebra and consider B as a left B-module. Then there is a map 
p: B? — Endg(B) 
a (Pa : BR Ba), 


where B°? is the opposite algebra of B defined in 3.2.2. The map p is injective since 
Pa = 0 implies pa(1) = a = 0; it is also surjective, since if @ € Endg(B) then letting 
a = ¢(1) we have 6(8) = Bġ(1) = Ba for all 8 € B. Finally, it is an F-algebra 
homomorphism, since 


Paglu) = ulap) = (ua)B = (pg ° Pa), 


and therefore p is an isomorphism of F-algebras. 

One lesson here is that a left module has endomorphisms that act naturally on the 
right; but the more common convention is that endomorphisms also act on the left. In 
order to make this compatible, the opposite algebra intervenes. 
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7.2.15. More generally, the decomposition of modules is determined by idempotent 
endomorphisms as follows. Let V be a left B-module. Then V is indecomposable if and 
only if Endg(V) has no nontrivial idempotents: that is to say, if e € Endg(V) satisfies 
e? = e, then e € {0, 1}. Given a nontrivial idempotent, we can write V = eV (1 -—e)V, 
and conversely if V = V; ® V, then the projection V — V, C V gives an idempotent. 


7.2.16. Many theorems of linear algebra hold equally well over division rings as they 
do over fields, as long as one is careful about the direction of scalar multiplication. For 
example, let D be a division F-algebra and let V be a left D-module. Then V ~ D” 
is free, and choice of basis for V gives an isomorphism Endp (V) ~ M,,(D°?). When 
n = 1, this becomes Endp(D) = D®, as in 7.2.14. 


Lemma 7.2.17. Let B be a (finite-dimensional) simple F-algebra. Then there exists a 
simple left B-module which is unique up to isomorphism. 


Proof. Since B is finite-dimensional over F, there is a nonzero left ideal J of B of 
minimal dimension, and such an ideal J is necessarily simple. Moreover, if v € I is 
nonzero then By = I, since By C I is nonzero and / is simple. Let J = By with v € J. 

Now let V be any simple B-module; we will show J ~ V as B-modules. Since B 
is simple, the natural map B — Endp(V) is injective (since it is nonzero). Therefore, 
there exists x € V such that vx + 0, so Zx + {0}. Thus, the map J > V by Bh px isa 
nonzero B-module homomorphism, so it is an isomorphism by Schur’s lemma. o 


Example 7.2.18. The unique simple left M,,(/')-module (up to isomorphism) is the 
space F” of column vectors (Example 7.2.1). 


7.2.19. Every algebra can be decomposed according to its idempotents 7.2.15. Let B 
be a finite-dimensional F-algebra. Then we can write B = J © --- @ I, as a direct 
sum of indecomposable left B-modules: this follows by induction, as the decomposing 
procedure must stop because each factor is a finite-dimensional F'-vector space. This 
means we may write 

l=e,+-::+e 


with e; € I;. For each a € I; we have a = 9); we; whence ae; = œ and ae; = 0 for 
j +i, which implies that 


e? = ĉj, ere; = 0 for j x i, and I; = Bei. 


Thus each e; is idempotent; we call {e;, . . ., e} a complete set of primitive orthogonal 
idempotents: the orthogonal is because e;e; = 0 for j + i, and the primitive is because 
each e; is not the sum of two other orthogonal idempotents (by 7.2.15). 


Remark 7.2.20. The tight connection between F and M,,(F) is encoded in the fact that 
the two rings are Morita equivalent: there is an equivalence of categories between 
F-vector spaces and left M,,(F)-modules. For more on this rich subject, see Lam 
[Lam99, §18], Reiner [Rei2003, Chapter 4], and Curtis—Reiner [CR81, §35]. 
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7.3 Semisimple modules and the Wedderburn—Artin theorem 


We continue our assumptions that B is a finite-dimensional F'-algebra and a B-module 
V is finite-dimensional. 


Definition 7.3.1. A B-module V is semisimple (or completely reducible) if V is 
isomorphic to a (finite) direct sum of simple B-modules V ~ @, Vj. 
B is a semisimple F-algebra if B is semisimple as a left B-module. 


Remark 7.3.2. More precisely, we have defined the notion of left semisimple and 
could equally well define right semisimple; below we will see that these two notions 
are the same. 


Example 7.3.3. If B = F, then simple F-modules are one-dimensional vector spaces, 
and as F is simple these are the only ones. Every F-vector space has a basis and so is 
the direct sum of one-dimensional subspaces, thus every F-module is semisimple. 


Example 7.3.4. A finite-dimensional commutative F-algebra B is semisimple if and 
only if B is the product of field extensions of F, i.e., B ~ Kı xX---x K, with K; 2 Fa 
finite extension of fields. 


Lemma 7.3.5. The following statements hold. 


(a) A B-module V is semisimple if and only if it is the sum of simple B-modules. 
(b) A submodule or a quotient module of a semisimple B-module is semisimple. 
(c) If B is a semisimple F-algebra, then every B-module is semisimple. 


Proof. For (a), let V = >); V; be the sum of simple B-modules. Since V is finite- 
dimensional, we can rewrite it as an irredundant finite sum; and then since each V; is 
simple, the intersection of any two distinct summands is {0}, so the sum is direct. 

For (b), let W C V be a submodule of the semisimple B-module V. Every x € W 
with x + 0 is contained in a simple B-submodule of W by minimality, so W = >); W; 
is a sum of simple B-modules. The result now follows from (a) for submodules. For 
quotient modules, suppose ¢: V — Z is a surjective B-module homomorphism; then 
¢@ '(Z) C V is a B-submodule, and ¢"!(Z) = 5; W; is a sum of simple B-modules, 
and hence by Schur’s lemma Z = >); #(W;) is semisimple. 

For (c), let V be a B-module. Since V is finitely generated as a B-module, there is a 
surjective B-module homomorphism B” — V for some r > 1. Since B” is semisimple, 
so too is V by (b). o 


Remark 1.3.6. Doing linear algebra with semisimple modules mirrors very closely 
linear algebra over a field. We have already seen that every submodule and quotient 
module of a semisimple module is again semisimple. Moreover, every module homo- 
morphism V — W with V semisimple splits, and every submodule of a semisimple 
module is a direct summand. The extent to which this fails over other rings concerns 
the structure of projective modules; we take this up in Chapter 20. 


Lemma 7.3.7. If B is a simple F-algebra, then B is a semisimple F-algebra. 
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Proof. Let I C B be a minimal nonzero left ideal, the unique simple left B-module up 
to isomorphism as in Lemma 7.2.17. For all œ € B, the left ideal Ja is a homomorphic 
image of J, so by Schur’s lemma, either Ja = {0} or Ja is simple. Let A := Viger Ia. 
Then A is a nonzero two-sided ideal of B, so since B is simple, we conclude A = B. 
Thus B is the sum of simple B-modules, and the result follows from Lemma 7.3.5(a). 

o 


Corollary 7.3.8. A (finite) direct product of simple F-algebras is a semisimple F- 
algebra. 


Proof. If B ~ B, x--- X B, with each B; simple, then by Lemma 7.3.7, each B; is 
semisimple so B; = Ð; I;i; is the direct sum of simple B;-modules J;;. Each J;; has 
the natural structure of a B-module (extending by zero), and with this structure it is 
simple, and B = p; j Ty is semisimple. o 


The converse of Corollary 7.3.8 is true and is proven as Corollary 7.3.14, a conse- 
quence of the Wedderburn—Artin theorem. 
In analogy to 7.2.16, we have the following corollary. 


Corollary 7.3.9. Let B be a simple F-algebra and let V be a left B-module. Then 
V = I™ for some n > 1, where I is a simple left B-module. In particular, two left 
B-modules V,, V) are isomorphic if and only if dimp V, = dimp V2. 


Proof. Since B is simple, B is semisimple by Lemma 7.3.7, and V is semisimple by 
Lemma 7.3.5. But by Lemma 7.2.17, there is a unique simple left B-module 7, and the 
result follows. o 


In other words, this corollary says that if B is simple then every left B-module V 
is free over B, so has a left basis over B; if we define the rank of a left B-module V to 
be cardinality of this basis (the integer n such that V = /®” as in Corollary 7.3.9), then 
two such modules are isomorphic if and only if they have the same rank. 

We now come to one of the main results of this chapter. 


Main Theorem 7.3.10. (Wedderburn—Artin). Let B be a finite-dimensional F -algebra. 
Then B is semisimple if and only if there exist integers nı, . . .,n, and division algebras 
D,,...,D, such that 

B ~ Mnp,(D1) X ++- X Mn, (Dr). 


Such a decomposition is unique up to permuting the integers n;, ...,n, and applying 
an isomorphism to the division rings D,,..., Dy. 


Proof. If B ~ |]; Mn, (Dj), then each factor M,,, (D;) is a simple F-algebra by Example 
7.2.9, so by Corollary 7.3.8, B is semisimple. 

So suppose B is semisimple. Then we can write B as a left B-module as the direct 
sum B = I™ ---@1?"" of simple B-modules J, . . ., I, grouped up to isomorphism. 


We have Endg(B) ~ B® by 7.2.14. By Schur’s lemma, 


End,(B) = B End, (12); 
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by 7.2.16, 
Endg (1?) = Mn,(Di) 


where D; = Endg(I;) is a division ring. So 
B = Endg(B)°? = M, (DP) x +++ X Mn, (D;”). 
The statements about uniqueness are then clear. o 


Remark 7.3.11. Main Theorem 7.3.10 as it is stated was originally proven by Wedder- 
burn [Wed08], and so is sometimes called Wedderburn’s theorem. However, this term 
may also apply to the theorem of Wedderburn that a finite division ring is a field; and 
Artin generalized Main Theorem 7.3.10 to rings where the ascending and descending 
chain condition holds for left ideals [Art26]. We follow the common convention by 
referring to Main Theorem 7.3.10 as the Wedderburn—Artin theorem. 


Corollary 7.3.12. Let B be a simple F-algebra. Then B ~ M,,(D) for a unique n € Zs 
and a division algebra D unique up to isomorphism. 


Example 7.3.13. Let B be a division F-algebra. Then V = B is a simple B-module, 
and in Corollary 7.3.12 we have D = Endg(B) = B°, and the Wedderburn—Artin 
isomorphism is just B ~ M,((B°P)°P). 


Corollary 7.3.14. An F-algebra B is semisimple if and only if B is the direct product 
of simple F -algebras. 


Proof. Immediate from the Wedderburn—Artin theorem, as each factor M,,(D;) is 
simple. o 


7.4 Jacobson radical 


We now consider an important criterion for establishing the semisimplicity of an 
F-algebra. Let B be a finite-dimensional F-algebra. 


Definition 7.4.1. The Jacobson radical rad B of B is the intersection of all maximal 
left ideals of B. 


We will in Corollary 7.4.6 see that this definition has left-right symmetry. Before 
doing so, we see right away the importance of the Jacobson radical in the following 
lemma. 


Lemma 7.4.2. B is semisimple if and only if rad B = {0}. 


Proof. First, suppose B is semisimple. Then B as a left B-module is isomorphic to 
the direct sum of simple left ideals of B. Suppose rad B # {0}; then rad B contains 
a minimal, hence simple, nonzero left ideal J C B. Then B = I @ T for some B- 
submodule 7’ and B/I’ = I so I’ is a maximal left ideal. Therefore rad B C I’, but then 
rad BN I = {0}, a contradiction. 

Conversely, suppose rad B = {0}. Suppose B is not semisimple. Let J; be a minimal 
left ideal of B. Since 7} + {0} = rad B, there exists a maximal left ideal J; not 
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containing J1, so Jı N Jı = {0} and B = J; ® Jı. Since B is not semisimple, J; # {0}, 
and there exists a minimal left ideal 2 Ç Jı C B. Continuing in this fashion, we obtain 
a descending chain Jı 2 J2 2 ..., a contradiction. Oo 


Corollary 7.4.3. B/rad B is semisimple. 


Proof. Let J = rad B. Under the natural map B — B/J, the intersection of all maximal 
left ideals of B/ rad B corresponds to the intersection of all maximal left ideals of B 
containing J; but rad B is the intersection thereof, so rad(B/J) = {0} and by Lemma 
7.4.2, B/J is semisimple. o 


We now characterize the Jacobson radical in several ways. 


7.4.4. For a left B-module V, define its annihilator by 
ann V := {æ € B:aV = 0}. 


Every annihilator ann V is a two-sided ideal of B: if œ € ann(V) and £ € B, then 
aBV C aV = {0} so af € ann(V). 


Lemma 7.4.5. The Jacobson radical is equal to the intersection of the annihilators of 
all simple left B-modules: i.e., we have rad B = (\y ann V, the intersection taken over 
all simple left B-modules. Moreover, if œ € rad B, then 1 — «œ € B*. 


Proof. We begin with the containment (2). Let a € (y ann V and let J be a maximal 
left ideal. Then V = B/I is a simple left B-module, so a@ € ann(B/I) whence aB C I 
ande € 1. 

The containment (C) follows with a bit more work. Let œ € rad B, and let V be a 
simple left B-module. Assume for purposes of contradiction that x € V has ax + 0. 
Then as in Lemma 7.2.7, V = B(ax) so x = Bax for some £ € B and (1 — Ba)x = 0. 
Let J be a maximal left ideal containing 1 — Ba. Since a € rad B, we have a € I, and 
thus 1 = (1 — Ba) + Ba € I, a contradiction. Thus «V = {0} and «œ € ann V. 

The final statement follows along similar lines as the previous paragraph (Exercise 
7.10). o 


Corollary 7.4.6. The Jacobson radical rad B is a two-sided ideal of B. 


Proof. The statement follows by combining 7.4.4 and Lemma 7.4.5: rad B is the 
intersection of two-sided ideals and so is itself a two-sided ideal. o 


Example 7.4.7. If B is commutative (and still a finite-dimensional F-algebra), then 
rad B = y(0) is the nilradical of B, the set of all nilpotent elements of B. 


A two-sided ideal J C B is nilpotent if J” = {0} for some n > 1, i.e., every product 
of n elements from J is zero. Every element of a nilpotent ideal is itself nilpotent. 


Lemma 7.4.8. J = rad B contains every nilpotent two-sided ideal, and J itself is 
nilpotent. 
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Proof. If I C B is a nilpotent two-sided ideal, then J + J is a nilpotent two-sided ideal 
of B/J; but rad(B/J) = {0} by Corollary 7.4.3, so B/J is the direct product of simple 
algebras (Corollary 7.3.14) and therefore has no nonzero nilpotent two-sided ideals. 
Therefore 7 C J+J CJ. 

Now we prove that J is nilpotent. Consider the descending chain 


BoJ2 P2... 


There exists n € Zs, such that J” = J*", We claim that J” = {0}. Assume for the 
purposes of contradiction that J ¢ J” is a minimal left ideal such that J"7 + {0}. Let 
a € I be such that J”« + {0}; by minimality J"a@ = J, so a = na for some 7 € J”, thus 
(1 -n)a = 0. But ņ € J” C J = rad B. By Lemma 7.4.5, 1 — 7 € B* is a unit hence 
æ = 0, a contradiction. o 


Example 7.4.9. Suppose B has a standard involution. Then by Lemma 7.4.8 and the 
fact that B has degree 2, we conclude that rad B C {e € B : e = 0}. If char F + 2 
and we define rad(nrd) as in 4.3.9 for the quadratic form nrd, then rad(nrd) = rad B 
(Exercise 7.21). 


Corollary 7.4.10. The Jacobson radical rad B is the intersection of all maximal right 
ideals of B. 


Proof. Lemma 7.4.8 gives a left-right symmetric characterization of the Jacobson 
radical, so rad B = rad B°?. There is a bijection between simple left B-modules and 
simple right B°P-modules, and the result follows. o 


7.5 Central simple algebras 


For more on central simple algebras (and in particular division algebras), see e.g. 
Saltman [Sal99] or Draxl [Dra83]. 
Recall (2.1.1) that the center of B is defined as 


Z(B) := {æ € B : aß = Ba for all a € B}. 


Remark 7.5.1. An F-algebra B is a central Z(B)-algebra when Z(B) is a field. (Under 
a more general definition of algebra, every algebra is an algebra over its center.) 


Example 7.5.2. The center Z(B) of a simple F-algebra is a field, since it is a simple 
commutative F-algebra. One reaches the same conclusion by applying Corollary 7.3.12 
together with Z(M,,(D)) = Z(D) (Exercise 7.5). 

The category of central simple algebras is closed under tensor product, as follows. 


Proposition 7.5.3. Let A, B be F-algebras and suppose that B is central. 


(a) The center of A r B is the image of Z(A) — A 8r B under z œ z@ 1. 
(b) Suppose that A, B are simple. Then A &r B is simple. 
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Proof. First, centrality in part (a). Suppose that y = >); a; ® B; € Z(A 8 B) (a finite 
sum). By rewriting the tensor, without loss of generality, we may suppose that a; are 
linearly independent over F. Then by properties of tensor products, the elements p; € B 
in the representation y = >); a; ® p; are unique. But then for all £ € B, 


Die 8 bbi) = (18 P) » ai 7 = (>: ai 7 (1@B) = Dei ® BiB) 


so BB; = Bi f for each i; thus 6; = b; € Z(B) = F. Hence 


y= dia ®bi = J abio = Jan) on 


since œ 1 also commutes with y for all a € A, we have >); a;b; € Z(A). Thus 
y € Z(A)@F = Z(A). 

Next, simplicity in part (b). Let J be a nontrivial two-sided ideal in A & B, and 
let y = Diy, œi ® B; € IN{0}. Without loss of generality, we may suppose 61 + 0. 
Then B6B = B since B is simple; multiplying on the left and right by elements of 
B C AQB, we may suppose further that 6; = 1. Let y € \{0} be such an element that 
is minimal with respect to m; then in particular the elements £; are linearly independent 
over F. Now for each £ € B, 


(1 ® By — y(1@B) = X (@ 8 (BB; - BiB) € I; 


i=2 
but by minimality of m, the right-hand side is zero, so 68; = pip for all i. Hence 
Pi € Z(B) = F for alli and as above y = a @ 1 for some 0 + a € A. But then 

TD (A® 1\(a@ 1l(A® 1) = (AaA)@1=ABl 


since A is simple, so J D (A @ 1)(1 ® B) = A@B, and thus J = A@B and A 8 B is 
simple. o 


Lemma 7.5.4. If B is a finite-dimensional algebra over F, then B is a central simple 
F-algebra if and only if the map 


$: Brp B? © Endp (B) 
Didi @ Bi > (UH YiaiMBi) 


is an isomorphism. 


Proof. First, the implication (=). Just as in 7.2.14, ġ is a nonzero F-algebra homomor- 
phism. By Proposition 7.5.3, B ®r B® is simple, so ¢ is injective. Since dimp (B @r 
B?) = dimp End (B) = (dimp B}, ¢ is an isomorphism. 

Now the converse (4); suppose ¢ is an isomorphism. If Z is an ideal of B then 
(I ® B°P) C Endr (B) is an ideal; but End; (B) is simple over F, therefore 7 is trivial. 
And if a € Z(B) then ¢(a ® 1) € Z(Endf(B)) = F, so a € F. o 
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7.5.5. Among central simple algebras over a field, quaternion algebras have an espe- 
cially nice presentation because the quadratic norm form can be put into a standard 
form (indeed, diagonalized in characteristic not 2). More generally, one may look at 
algebras with a similarly nice presentation, as follows. 

Let F be a field, let K > F be a cyclic extension of F of degree n = [K : F], let 
o € Gal(K | F) be a generator, and let b € F*. For example, if F contains a primitive 
nth root of unity č € F*%, and a € F*\F*", then we may take K = F(a) and 
o(a) = ¢ Ya. We then define the cyclic algebra 


K,o,b ; .n-1 


to be the left K-vector space with basis 1, j,...,j”~! and with multiplication j” = b 
and ja = o(a@)j for a € K. The definition of a cyclic algebra generalizes that of 6.1.5, 
where there is only one choice for the generator o. A cyclic algebra is a central simple 
algebra over F of dimension n?, and indeed (K, a, b | K) ~ M, (K). (See Exercise 7.12.) 
More generally, we may relax the condition that G be cyclic: there is an analogous 
construction for any finite Galois extension, yielding a central simple algebra called a 
crossed product algebra (and giving an interpretation to a second cohomology group): 
see Reiner [Rei2003, §29-30]. There are significant open problems relating cyclic 
algebras and crossed products to central simple algebras in general [ABGV2006]. 

It is a consequence of the main theorem of class field theory that if F is a global 
field then every (finite-dimensional) central simple algebra over F is isomorphic to a 
cyclic algebra. 


Remark 7.5.6. The theory of central simple algebras and Brauer groups extends to one 
over commutative rings (or even schemes), and this becomes the theory of Azumaya 
algebras: see Saltman [Sal99, §2]. 


7.6 Quaternion algebras 


Having set the stage, we are now ready to prove the following final characterizations 
of quaternion algebras. 


Proposition 7.6.1. Let B be an F-algebra. Then the following are equivalent: 


(i) B is a quaternion algebra; 

(ii) B is a central simple F-algebra with dimp B = 4; 
(iii) B is a central semisimple F-algebra with dimp B = 4; and 
(iv) B @p F” ~ Mo(F”), where F” is an algebraic closure of F. 


Proof. First, (i) > (ii): if B is a quaternion algebra, then B is central simple (7.2.11). 
The equivalence (ii) © (iii) follows from the Wedderburn—Artin theorem: 
1 = dim Z(B) = dimp Z(D;) 2r 
i=l 
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sor=1. 

Next we prove (ii) > (iv). If B is central simple, then B &p F al is a central simple 
F*_algebra by Proposition 7.5.3. But by Exercise 2.9, the only division F*!-algebra is 
F”, so by the Wedderburn—Artin theorem, B 8r F*! ~ M,,(F”); by dimensions, n = 2. 

It remains to prove (iv) > (i). So suppose B F Fl ~ Mo(F*). Then B is simple 
by Example 7.2.10 and dime B = 4. By the Wedderburn—Artin theorem (Corollary 
7.3.12), we have B ~ M,,(D) with n € Zs, and D a division ring. Since 4 = dimp B = 
n? dimp D, either n = 2 and B = M>(F), or n = 1 and B is a division ring. 

In this latter case, the result will follow from Theorem 3.5.1 (and Theorem 6.2.8 
for the case char F = 2) if we show that B has degree 2. But for any a € B we have 
that a € B @p F ~ M2(F%) satisfies its characteristic polynomial of degree 2, so that 
1, a, a” are linearly dependent over F*! and hence linearly dependent over F, by linear 
algebra. o 


Inspired by the proof of this result, we reconsider and reprove our splitting criterion 
for quaternion algebras. 


Proposition 7.6.2. Let B be a quaternion algebra over F. Then the following are 
equivalent: 


(i) B = Mo(F); 

(ii) B is not a division ring; 
(iii) There exists O + € € B such that € = 0; 
(iv) B has a nontrivial left ideal I C B; 


Proof. The equivalence (i) © (ii) follows from the Wedderburn—Artin theorem (also 
proved in Main Theorem 5.4.4 and Theorem 6.4.11). The implications (i) > (iii) > 
(ii) and (i) > (iv) => (ii) are clear. oO 


7.6.3. We showed in Lemma 7.2.17 that a simple algebra B has a unique simple left 
B-module J up to isomorphism, obtained as a minimal nonzero left ideal. If B is a 
quaternion algebra, this simple module Z can be readily identified using the above 
proposition. If B is a division ring, then necessarily Z = B. Otherwise, B ~ M>(F), 
and then J =~ F?, and the map B —> End (J) given by left matrix multiplication is an 
isomorphism. 


7.7 The Skolem—Noether theorem 


We conclude this chapter with a fundamental result that characterizes the automor- 
phisms of a simple algebra—and much more. 


Main Theorem 7.7.1. (Skolem—Noether). Let A, B be simple F-algebras and suppose 
that B is central. Suppose that f,g: A — B are homomorphisms. Then there exists 
B € B* such that f(a) = p~! g(œ)B for alla € A. 


7.7. THE SKOLEM—NOETHER THEOREM 109 


Proof. By Corollary 7.3.12, we have B ~ Endp(V) ~ M,,(D°?) where V is a simple 
B-module and D = Endg(V) is a central F-algebra. Now the maps f, g give V the 
structure of an A-module in two ways. The A-module structure commutes with the 
D-module structure since B ~ Endp(V). So V has two A @r D-module structures via 
f and g. 

By Proposition 7.5.3, since D is central over F, we have that A 8p D is a simple F- 
algebra. By Corollary 7.3.9 and a dimension count, the two A ®r D-module structures 
on V are isomorphic. Thus, there exists an isomorphism £: V —> V of A&F D-modules; 
i.e. (f (@)x) = g(a) B(x) for all a € A and x € V, and B(6x) = p(x) for all 6 € D 
and x € V. We have 6 € Endp(V) = B and so we can write BFD! = g(a) for all 
a € A, as claimed. o 


The following corollaries are immediate consequences (special cases) of the 
Skolem—Noether theorem. 


Corollary 7.7.2. If Aj, Az are simple F-subalgebras of a central simple F-algebra B 
and ġ: Aj — An is an isomorphism of F-algebras, then @ is induced by an inner 
automorphism of B. 


Proof. Let u: Aj — B be the natural inclusions, and apply Main Theorem 7.7.1 to 


f := 4 and g := n o ¢: we conclude there exists £ € B* such that 4(@) = a = 
Bolla) or equivalently ¢(a@) = BaB™! for all a € Aj, as desired. o 


Corollary 7.7.3. If B is a central simple F-algebra and a, a2 € B™, then a, a2 have 

the same irreducible minimal polynomial over F if and only if there exists B € B* such 

that ay = Bay B. 

Proof. The implication (<=) is immediate. Conversely (=), let A; := Fla] = 

F[x]/(f;(x)) where f;(x) € F[x] are minimal polynomials over F. Since these poly- 

nomials are irreducible, A; is a field hence simple, so Corollary 7.7.2 gives the result. 
o 


Corollary 7.7.4. The group of F-algebra automorphisms of a central simple algebra 
B is Aut(B) =~ B*/F*. 


Proof. Taking A = B in Main Theorem 7.7.1, we conclude that every automorphism 
of B as an F-algebra is inner, and an inner automorphism is trivial if and only if it is 
conjugation by an element of the center F*. o 


Example 7.7.5. By Corollary 7.7.4, we have a canonical isomorphism of groups 
Aut(M,(F)) = GL,(F)/F* = PGL, (F). 


As a final application, we extend the splitting criterion of Main Theorem 5.4.4(i) 
© (vi) to detect isomorphism classes of quaternion algebras. 


Corollary 7.7.6. Let K 2 F be a separable quadratic F-algebra, and let b,b’ € F”. 


Then , 
(=) ~ (= ) © b/b’ € Nmxyr(K*). 


F F 
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Taking b’ = 1, we recover the previous splitting criteria. 


Proof. For the implication (=), if b’/b = Nmxjr(q@) with a € K*, then an isomor- 
phism is furnished as left K-vector spaces by sending j > aj. 

For the implication (=), let 6: (K,b| F) — B’ := (K, b’ | F) be an isomorphism 
of F-algebras. If K ~ F x F is nota field, then Nmgjr (K*) = F™ and the result holds. 
So suppose K is a field. Then (K) € B’ isomorphic to K as an F-algebra, but need 
not be the designated one in B’; however, by the Skolem—Noether theorem, we may 
postcompose ¢ with an automorphism that sends ¢(K) to the designated one, i.e., we 
may suppose that ¢ is a K-linear map (taking the algebras as left K-vector spaces). Let 
oj) =a + Bj’ with a, B € K. Then 


{0} = trd(K j) = trd(4(K j)) = trd(K 6()) = trd(Ka) 
and thus œ = 0 since K is separable. Consequently, 
-b = nrd(j) = nrd(9(j)) = - Nmgır (8)b' 
and so b/b’ = NmxjF(f) as desired. o 


In the remainder of this section, we prove an important consequence of the Skolem— 
Noether theorem that compares centralizers of subalgebras to dimensions. 


Definition 7.7.7. Let A be an F-subalgebra of B. Let 
Cpa(A) := {$ € B : aß = Ba for alla € A} 
be the centralizer of A in B. 


The centralizer Cg(A) is an F-subalgebra of B. 


Proposition 7.7.8. Let B be a central simple F-algebra and let A C B a simple 
F-subalgebra. Then the following statements hold: 


(a) Cp(A) is a simple F-algebra. 
(b) dimr B = dimp A- dimp Cg(A). 
(c) Cg(Cg(A)) = A. 


Part (c) of this proposition is called the double centralizer property. 


Proof. First, part (a). We interpret the centralizer as arising from certain kinds of 
endomorphisms. We have that B is a left A& B® module by the action («8 8): u = «uf 
for æ & 8 € A & B” and u € B. We claim that 


Cp(A) = End 4g p(B). (7.7.9) 


Any ¢ € Endagper(B) is left multiplication by an element of B: if y = (1), then 
$W = (1) = yu by 1 & B®-linearity. Now the equality 


ya = ġa) = (l) = ay 
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shows that multiplication by y is A & 1-linear if and only if y € Cg(A), proving (7.7.9). 

By Proposition 7.5.3, the algebra A & B°P is simple. By the Wedderburn—Artin 
theorem, A & BP ~ M,,(D) for some n > 1 and division F-algebra D. Since M,,(D) 
is simple, its unique simple left D-module is V = D”, and Endm,„(D)(V) =~ D®. In 
particular, B ~ V” for some r > 1 as an A & B°P-module. So 


Cp(A) = End4gpr(B) ~ Endy, p)(V") ~ M,(Endm,,(p)(V)) ~ M, (D°). 


Thus Cg(A) is simple. 
For part (b), 
dimp Cg(A) = dimp M,(D®) = r? dimp D 


and 
dimp (A 9 B?) = dimp A- dimp B = n? dimp D 


and finally 
dimp B = dimp V” = r dimp D” = rn dimp D; 


putting these together gives dimp A - dimp Cg(A) = rndimp D = dimr B. 
Finally, part (c) follows from (a) and (b): 


dime B= dimp Cp(A) E dimp Cp(Cp(A)) = dimp A: dimp Cp(A) 
so dimp A = dimp Cg(Cg(A)) and A C Cg(Cg(A)), therefore equality holds. o 


Example 7.7.10. We always have the two extremes A = F and A = B, with Cg(F) = B 
and Cg(B) = F, accordingly. 


We note the following structurally crucial corollary of Proposition 7.7.8. 


Corollary 7.7.11. Let B be a central simple F-algebra and let K be a maximal subfield. 
Then [B : F] = [K : FP. 


Proof. If K is maximal, then Cg(K) = K and [B : F] = [K : FÈ. o 


Corollary 7.7.11 generalizes the comparatively easier statement for quaternion 
algebras: the maximal subfields of a quaternion algebra are quadratic. Returning now 
to quaternion algebras, we conclude with a nice package of consequences of the above 
results concerning embeddings of quadratic fields into quaternion algebras. 


7.7.12. Let B be a quaternion algebra over F and let K C B be a quadratic separable 
F-subalgebra. Then the set of all embeddings of K in B is naturally identified with the 
set K*\B*, as follows. 

By the Skolem—Noether theorem (Corollary 7.1.5, and Exercise 7.11 for the case 
K = F x F), if ġ: K © B is another embedding, then there exists 8 € B* such 
that ¢(æ) = B-'a® for all œ € K, and conversely. Such a conjugate embedding is the 
identity if and only if 8 centralizes K. By Corollary 4.4.5, and Corollary 6.4.2 for 
characteristic 2, the centralizer of K* in B% is K*. Therefore, the set of embeddings of 
K in B is naturally identified with the set K*\B*, with K* acting on the left. 
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7.8 Reduced trace and norm, universality 


In this last section, we consider notions of reduced trace and reduced norm in the 
context of semisimple algebras. Thr 


7.8.1. Let B be a (finite-dimensional) central simple algebra over F, and let F? denote 
a separable closure of F. By Exercise 7.24, we have an F-algebra homomorphism 


$: B@p F°? ~ M,,(F*?) 


for some n > 1. By the Skolem—Noether theorem (Main Theorem 7.7.1), for any 
other isomorphism ¢’: B @r FP ~ M,(FÀ®P), there exists M € GL, (F**?) such that 
¢'(a@) = Mola)M7!, so the characteristic polynomial of an element of B &p F*? is 
independent of the choice of ı. In particular, from the canonical embedding 1: B —> 
B 8f F°? by a + «a 8 1, we define the reduced characteristic polynomial of a € B 
to be the characteristic polynomial of (¢z)(a) as an element of F*P[T] and similarly 
the reduced trace and reduced norm of « to be the trace and determinant of (¢1)(@) 
as elements of F°®P, 

In fact, the reduced characteristic polynomial descends to F, as follows. The abso- 
lute Galois group Galp := Gal(F**? | F) acts on B @p FS? ~ M,,(F*?) by 


ao(a~a@a)=a®a(a) 


foro € Galp, a € B, anda E€ F*®. Leto € Galp. Since o(a @ 1) = a@a(1) =a@l, 
the reduced characteristic polynomials of ı(œ) and o (ı(œ)) are the same. By comparison 
(see e.g. Reiner [Rei2003, Theorem 9.3]), if 


f(a;T) = det(T — (a@)) = T” + an_1T” | + +++ + ao 


is the reduced characteristic polynomial of (œ), then the reduced characteristic poly- 
nomial of (a(z))(q) is 


o(f)(a;T) = det(T — o(u(@))) = T” + O(Gg-1)T | +++ +r (ag). 


And then since f(a@;T) = o(f)(a@;T) for all o € Galp, by the fundamental theorem 
of Galois theory, f(«; T) € F[T]. Therefore, the reduced norm and reduced trace also 
belong to F. 

Alternatively, we may argue as follows. The characteristic polynomial of left mul- 
tiplication by œ on B is the same as left multiplication by (¢)(@) on M, (FP) (by 
extension of basis), and the latter is the nth power of the reduced characteristic poly- 
nomial by Exercise 3.13. Finally, if f(T) € F*°P[T] has f(T)” € F[T] then in fact 
f(T) € F[T]: see Exercise 7.25. 


These definitions extend to a general semisimple algebra over F, but to do so it 
is convenient to give an alternate approach that avoids going to the separable closure 
and works in even more generality using universal elements; for more, see Garibaldi 
[Gar2004]. 

Let B be a (finite-dimensional) F-algebra with n := dimp B, and choose a basis 
€1,..-,én for B. Let F(x, ..., Xn) be a pure transcendental field extension of transcen- 
dence degree n, and let € := xe] ++: + Xnen E€ B 8f F(X,..., Xn); we call £ the 
universal element of B in the given basis. 
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Definition 7.8.2. The universal minimal polynomial of B (in the basis e1, . . ., €n) is 
the minimal polynomial mp(é;T) of £ over F(x1,..., Xn). 

For œ = aje +++: + anen E B (with a; € F), the polynomial obtained from 
mp(&;T) by the substitution x; <— a; is called the specialization of mg(é;T) at a. 


The following example will hopefully illustrate the role of this notion. 


a,b 


Example 7.8.3. For char F + 2 and B = 


} in the basis 1, i, j, ij we have € = 


t + xi + yj + zij (substituting t, x, y,z for x1,...,x4). We claim that the universal 
minimal polynomial is 
mp(é;T) = T? — UT + (t? - ax? — by? + abz’). 
Indeed, we verify that € satisfies mg(&; €) = 0 by considering £ € B @F F(t, x, y, z) = 
b 
—_&° _) and computing that trd(é) = 2t and nrd(€) = t? — ax? — by? + abz*; and 
F(t, x, y, Z) 


this polynomial is minimal because € ¢ F(t, x, y, z) does not satisfy a polynomial of 
degree 1 over F(t, x, y, z). 


7.8.4. If B ~ Bı x--- X B,, then in a basis for B obtained from the union of bases for 
the factors B; with universal elements €;, we have 


mp(E;T) = mp, (€1;T) +++ mp, (ér T). 


In the proofs that follow, we abbreviate by using multi-index notation, e.g. writing 
F[x] := Flx,..., Xn]. 


Lemma 7.8.5. We have mg(€;T) € F[x1,...,Xn][T], i.e., the universal minimal poly- 
nomial has coefficients in F[x,,..., Xn]. 


Proof. For part (a), we consider the map given by left multiplication by € on B8p F(x). 
In the basis e}, . . ., €n, almost by construction we find that the matrix of this map has 
coefficients in F[x] (it is the matrix of linear forms obtained from left multiplication by 
ei). We conclude that é satisfies the characteristic polynomial of this matrix, which is 
a monic polynomial with coefficients in F[x]. Since mg(é; T) divides this polynomial 
(over F(x)) by minimality, by Gauss’s lemma we conclude that mp(é;T) € F[x][T]. 

o 


Proposition 7.8.6. For all a € B, the specialization of mg (£; T) at a is independent of 
the choice of basis e}, . . ., en and is satisfied by the element a. Moreover, if p € Aut(B) 
and a € B, then a and (a) have the same specialized polynomials. 


In view of Proposition 7.8.6, we write mg(a;T) for the specialization of mg(é;T) 
at a € B; from it, we conclude that mg(a; œ) = 0. 


Proof. Since mg(&; €) = 0, by specialization we obtain mg(a;q@) = 0. For the inde- 
pendence of basis, let ej, ..., e, be another F-basis and &’ the corresponding uni- 
versal element. Writing e; in the basis e; allows us to write € = )i/_, €(x)e; where 
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€;(x) € F[x] are linear forms; moreover, writing œ = },;a;e; we have €;(a) = aj. 
The map x; > €(x) extends to an F-algebra automorphism ¢ of F[x] (repeat the 
construction with the inverse, and compose) with ¢(c)(a) = c(a’) for all c(x) € F[x]. 
We let ¢ act on polynomials over F[x] by acting on the coefficients; by uniqueness of 
minimal polynomials, we have ¢(mp(é;T)) = mpg(é’;T). Therefore, looking at each 
coefficient, specializing ¢(mp(é;T)) at a is the same as specializing mg(é’;T) at a’, 
as claimed. 

The second sentence follows by the same argument, as from ġ € Aut(B) we have 
a new basis e; := #(e;) so the specializations again agree. (This argument replaces the 
use of the Skolem—Noether theorem in the special case where B is a central simple 
algebra.) o 


Lemma 7.8.7. For any field extension K 2 F, we have mge, K(E;T) = mg (é; T). 


Proof. First, because an F-basis for B is a K-basis for B @r K, the element é (as 
the universal element of B), also serves as a universal element of B @r K. Since 
K(x1,...,%n) © F(xi,..., Xn), by minimality we have mg, (€;T) | mg(é; T). Con- 
versely, let F(x)[E] © B @pr F(x) be the subalgebra generated by £ over F(x); then 
FLE] ~= FÆLT]/ng(é; T)). Tensoring with K gives K(x)[é] ~ K(a)[T1/(ma (és) 
as the subalgebra of (B 8r K) ®x K(x) generated by £. Thus mg(é;T) | mg, (&;T), so 
equality holds. o 


We conclude by relating this construction to more familiar polynomials. 
Lemma 7.8.8. Let a € B. Then the following statements hold. 


(a) If B = K 2 F is a separable field extension, then mx(a;T) is the characteristic 
polynomial of left multiplication by a. 


(b) If B is a central simple F-algebra, then mg(a;T) is the reduced characteristic 
polynomial of a. 


Proof. For (a), we recall (as in the proof of Lemma 7.8.5 that £ satisfies the character- 
istic polynomial of left multiplication on K, a polynomial of degree n = [K : F]; on 
the other hand, choosing a primitive element «œ, we see the specialization mg (œ; T) is 
satisfied by œ so has degree at least n, so equality holds and mx(a;T) is the charac- 
teristic polynomial. Therefore the universal minimal polynomial is the characteristic 
polynomial, and hence the same is true under every specialization. 

For (b), it suffices to prove this when F = F” is algebraically closed, in which 
case B ~ M,,(F); by Proposition 7.8.6, we may assume B = M,,(F). By 7.8.1, we want 
to show that mg(a;T) for a € M,(F) is the usual characteristic polynomial. But the 
universal element (in a basis of matrix units, or any basis) satisfies its characteristic 
polynomial of degree n, and a nilpotent matrix (with 1s just above the diagonal) has 
minimal polynomial T”, so we conclude as in the previous paragraph. o 


In light of the above, we may make the following definition. 
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Definition 7.8.9. Let B be a semisimple F-algebra. For a € B, the reduced charac- 
teristic polynomial f(a; T) = T” —cT" |! +-+++(-1)"cn € F[T] is the specialization 
of the universal minimal polynomial mg(é;T), and the reduced trace and reduced 
norm are the coefficients c1, Cn, respectively. 


Example 7.8.10. For a semisimple algebra B ~ Bı x--- x B,, with each B; central 
simple, we find that the reduced characteristic polynomial is just the product of the 
reduced characteristic polynomials on each simple direct factor B;; this is well-defined 
(again) by the uniqueness statement in the Wedderburn—Artin theorem (Main Theorem 
Fie 10); 


Proposition 7.8.11. Let B be semisimple. Then the reduced trace trd: B > F is 
F-linear and satisfies trd(aB) = trd(Ba) for all a,B € B; and the reduced norm 
nrd: B > F is multiplicative, satisfying nrd(aB) = nrd(q@) nrd(). 


Proof. Consider (again) V := F(x)[&] € B8&r F(x) the subalgebra generated over F(x) 
by £; then € acts on V = F(x)[T]/(mg(&; T)) by left multiplication with characteristic 
polynomial mp(é; T). It follows that the reduced trace and reduced norm are the usual 
trace and determinant in this representation, so the announced properties follow on 
specialization. o 


Remark 7.8.12. It is also possible to define the reduced characteristic polynomial on 
a semisimple algebra B by writing B =~ Bı x --- x B, as a product of simple algebras; 
for details, see Reiner [Rei2003, §9]. 


7.9 Separable algebras 


For a (finite-dimensional) F'-algebra, the notions of simple and semisimple are sensitive 
to the base field F in the sense that these properties need not hold after extending the 
base field. Indeed, let K 2 F be a finite extension of fields, so K is a simple F- 
algebra. Then K p F” is simple only when K = F and is semisimple if and only if 
K 8p F” = FA x... x F”, i.e., K is separable over F. 

It is important to have a notion which is stable under base change, as follows. 
For further reference, see Drozd—Kirichenko [DK94, §6], Curtis—Reiner [CR81, §7], 
Reiner [Rei2003, §7c], or Pierce [Pie82, Chapter 10]. 


Definition 7.9.1. Let B be a finite-dimensional F'-algebra. We say that B is a separable 
F-algebra if B is semisimple and Z(B) is a separable F'-algebra. 


In particular, a separable algebra over a field F with char F = 0 is just a semisimple 
algebra. 


7.9.2. For a semisimple algebra B ~ My,(D1) X +--+ X Mn, (D+), by Example 7.5.2 we 
have Z(B) =~ Z(D1) X:+: x Z(D,), and B is separable if and only if Z(D;) is separable 
for each į = 1,...,r. 


Lemma 7.9.3. A finite-dimensional simple F-algebra is a separable algebra over its 
center K. 
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Proof. The center of B is a field K = Z(B) and as a K-algebra, the center Z(B) = K 
is certainly separable over K. (Or use Proposition 7.5.3 and Theorem 7.9.4 (iii) below.) 
o 


The notion of separability in this context is quite robust. 


Theorem 7.9.4. Let B be a finite-dimensional F-algebra. Then the following are equiv- 
alent: 


(i) B is separable; 
(ii) There exists a finite separable field extension K of F such that B 8f K ~ 
Mn, (K) X +++ X Mn, (K) for integers ni, ... n, = 1; 
(iii) For every extension K 2D F of fields, the K-algebra B rp K is semisimple; 
(iv) B is semisimple and the bilinear form 


BxB-F 
(a, P) > trd(@B) 
is nondegenerate. 
Moreover, if char F = 0, then these are further equivalent to: 
(v) The bilinear form (a, P) > Trp\r(a@P) is nondegenerate. 


A separable F-algebra is sometimes called absolutely semisimple, in view of 
Theorem 7.9.4 (iii). 


Proof. First we prove (i) = (ii). Let B; be a simple component of B; then Z(B;) is 
separable over F. Let K; 2 F be a separable field extension containing Z(B;) that splits 
Bi, so Bi zB) Ki ~ Mn; (Ki). Let K be the compositum of the fields K;. Then K is 
separable, and 


B; @p K =~ Mn;(Z(Bi) r K) = Mn; (K) X -+ X Mn; (K) 


the number of copies equal to [Z(B;) : F]. 

Next we prove (ii) > (iii). Suppose B &r K ~ []; Mn,(K) and let L 2 F be an 
extension of fields. Let M = KL. On the one hand, B @r M ~ (B 8r K) 8g M = 
I]; Mn; (M), so rad B &r M = {0}; on the other hand, B &F M ~ (B 8f L) 8z M and 
rad(B F L) C rad(B 8f L) 8z M = {0}, so B 8p L is semisimple. 

For the implication (iii) > (i), suppose B is not separable, and we show that there 
exists K 2 F such that B 8r K is not semisimple. If B is not semisimple over F, we 
can just take F = K. Otherwise, Z(B) is not separable as an F-algebra, and there is a 
component of Z(B) which is an inseparable field extension K. Then B 8r K contains a 
nonzero nilpotent element in its center and this element generates a nonzero nilpotent 
ideal, so rad(B &r K) + {0} and B 8r K is not semisimple. 

The implication (iii) > (iv) holds for the following reason. We have B &p F al x 
Mn, (F aly x... x Mn,.(F al) and the reduced trace pairing on each matrix ring factor is 
nondegenerate so the whole pairing is nondegenerate. By linear algebra we conclude 
that the bilinear form on B is nondegenerate. 
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The implication (iv) > (i) holds with char F arbitrary: if € € rad B then we € rad B 
is nilpotent and trd(@e) = 0 for all a € B, and by nondegeneracy e€ = 0. 

The final equivalence (iv) & (v) follows when char F = 0 since the algebra trace 
pairing on each simple factor is a scalar multiple of the reduced trace pairing. o 


Exercises 


Throughout the exercises, let F be a field. 


„b 
> 1. Prove that a quaternion algebra B = (=) with char F + 2 is simple by a direct 


calculation, as follows. 


(a) Let J be a nontrivial two-sided ideal, and let € = t + xi + yj + zij € I. By 
considering ie — ei, show that t + xi € I. 

(b) Arguing symmetrically and taking a linear combination, show that t € J, 
and conclude that t = 0, whence x = y = z = 0. 


> 


b 
Modify this argument to show that an algebra B = E 


) is simple when 


char F = 2. [We proved these statements without separating into cases in 7.2.11.] 


2. Let B be a quaternion algebra over F, and let K C B be an F-subalgebra that is 
commutative. Show that dimp K < 2. 


3. Let B be a quaternion algebra. Exhibit an explicit isomorphism 
B 8r B > M4(F). 
[Hint: see Exercise 2.11.] 


> 4. Let B = Ile ') :a,b,c € F} C Mo(F), and V = F? be the left B-module 
of column vectors. Show that V is indecomposable, but not simple, as a left 


B-module (cf. Example 7.2.5). 


> 5. This exercise proves basic but important facts about two-sided ideals in matrix 
algebras using matrix units. 


(a) Let D be a division F-algebra. Prove that M,,(D) is a simple F-algebra 
with center Z(D) for all n > 1. [Hint: Let E;; be the matrix with 1 in the 
ijth entry and zeros in all other entries. Show that Ex;MEje = mij Exe 
where mj; is the ijth entry of M.] 

(b) More generally, let R be a ring (associative with 1, but potentially non- 
commutative). Show that Z(M,,(R)) = Z(R) and that any two-sided ideal 
of M, (R) is of the form M, (I) E M, (R) where 7 is a two-sided ideal of R. 


> 6. Let F be a field, let B a simple algebra, and let J be a left B-module with 
dimp J = dimp B. Show that J is isomorphic to B as a left B-module, i.e., there 
exists a € I such I = Ba. 
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7. 


> 9, 


> 10. 


11. 


12. 


13. 


14. 
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In this exercise, we consider extensions of the Skolem—Noether theorem. 


(a) Let B be a quaternion algebra over F and let K1, K2 C B be F-subalgebras 
(not necessarily subfields). Suppose that @: Kı — Kz is an isomorphism 
of F-algebras. Show that ¢ lifts to an inner automorphism of B. [Hint: 
repeat the proof of Corollary 7.1.5.] 

(b) Show by example that Corollary 7.7.3 is false if the minimal polynomi- 
als are not supposed to be irreducible. In particular, provide an example 
of isomorphic algebras Kı, K2 C B that are not isomorphic by an inner 
automorphism of B. 


. Let B be a quaternion algebra over F, and let K C B be a separable, quadratic 


F-subalgebra. Show that there exists b € F* such that B = (K | b). [Hint: lift 
the standard involution on K via the Skolem—Noether theorem. ] 


Let B be a finite-dimensional F-algebra. Show that if a € rad B, then 1—Ba € B” 
for all 6 € B. [Hint: if 1 — Ba is not left invertible then it belongs to a maximal 
left ideal; left invertible implies invertible. ] 


Extend Corollary 7.1.5 to the case where K = F x F as follows: show that 
if Kı, K2 C B are F-subalgebras with K; = F x F, and ¢: Kı — Kı is an 
isomorphism of F-algebras, then ¢ lifts to an inner automorphism of B. 


Let n € Zs and let F be a field with char F { n. Let ¢ € F be a primitive nth 


,b 
root of unity. Let a,b € F* and let A = (=) be the algebra over F generated 


’ 


by elements i, j subject to 
Maa, J'"=b Ji={i]. 


(a) Show that dimp A = n’. 

(b) Show that A is a central simple algebra over F. 

(c) Let K = F[i] ~ F[x]/(x” — a). Show that if b € Nmgjr(K*%) then A ~ 
M, (F). [Such algebras are called cyclic algebras or sometimes power 
norm residue algebras. | 


Generalize the statement of Proposition 7.5.3(a) as follows. Let A,B be 
F-algebras, and let A’ C A and B’ C B be F-subalgebras. Prove that 


Caes(A’ ® B’) = CACA’) 8 Cp(B’). 
Let B be a finite-dimensional F-algebra. Show that the following are equivalent: 


(i) B is separable; 
(ii) B is semisimple and the center K = Z(B) is separable; 
(iii) B &r B® is semisimple. 


Let G + {1} be a finite group. Show that the augmentation ideal, the two-sided 
ideal generated by g — 1 for g € G, is a nontrivial ideal, and hence F[G] is not 
simple as an F-algebra. 
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15. 


16. 


17. 


> 18. 


> 19. 
> 20. 


21. 


22. 


Let G be a finite group of order n = #G. Show that F[G] is a separable F- 
algebra if and only if char F { n as follows. [This exercise is known as Maschke’s 
theorem.] 


(a) Suppose first that char F = 0 for a special but quick special case. Compute 
the trace pairing and conclude F[G] is separable. 

(b) If char F | n, show that N = X geg g is a nilpotent element in the center of 
F[G], so F[G] is not semisimple. 

(c) Suppose that char F { n. Let B = F[G]. Define the map of left B-modules 
by 


$: B > B 8r B? =: BS 


1 
D=- 288" 
$ P 2 888 
so that ø(a) = œg(1) for all a € B. Give B the structure of a B°-algebra 
by (œ, @°) : B + aBa®. Show that ¢ is a homomorphism of B°-modules, 
and that the structure map y: B® — B has y o ¢ = idg. Conclude that B 
is separable. 


Let B be an F-algebra, and let F*! be an algebraic closure of F. Show that B is 
simple if and only if B &r F” is simple. 


Let D be a (finite-dimensional) division algebra over F al Show that D = F®. 
Conclude that if B is a simple algebra over F”, then B ~ M,,(F”) for some n > 1 
and hence is central. 


Let B be a (finite-dimensional) F algebra, and let K 2 F be a finite separable 
extension of fields. Show that rad(B @f K) = rad(B) @F K. 
Show that if B is a semisimple F-algebra, then so is M,,(B) for any n € Zs. 


Let B be a (finite-dimensional) F-algebra with standard involution and suppose 
char F # 2. 


(a) Show that radB = radnrd. Conclude B is semisimple if and only if 
rad nrd = {0}. 

(b) Suppose B + F and B is central. Conclude that B is a quaternion algebra 
if and only if radnrd = {0}. 


Compute the Jacobson radical rad B of the F-algebra B with basis 1, i, j,ij 
satisfying 

i? =a, j° =0, and ij =-ji 
for a € F, and compute B/ rad B. In particular, conclude that such an algebra 
is not semisimple, so B is not a quaternion algebra. [Hint: restrict to the case 
char F # 2 first. ] 


Give an example of (finite-dimensional) simple algebras A, B over a field F such 
that A &r B is not simple. Then find A, B such that A p B is not semisimple. 
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> 23. 


24. 


25. 


> 26. 


27. 


28. 


> 29. 


> 30. 
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In Exercise 7.18, we saw that if D is a (finite-dimensional) division algebra over 
F then D 8p F” = M, (F®) for some n > 1. In this exercise, we show the same 
is true if we consider the separable closure. (We proved this already in Exercise 
6.3 for D a quaternion algebra.) 

Let F be a separably closed field, so every nonconstant separable polynomial 
with coefficients in F has a root in F. Let D be a finite-dimensional central 
division algebra over F with char F = p. For purposes of contradiction, assume 
that D + F. 


(a) Prove that dimp D is divisible by p. 
(b) Show that the minimal polynomial of each nonzero d € D has the form 


xP° —a for some a € F ande > 0. 
(c) Choose an F“!-algebra isomorphism ġ: D @p F*! ~ M,,(F*). Show that 


tr ø(x ® 1) = 0 for all x € D. 
(d) Prove that D does not exist. 


Let K 2 F be a separable (possibly infinite) extension, and let f(T) € K[T]. 
Suppose that f(T)” € F[T] for some n € Z3,. Show that f(T) € F[T]. [Hint: 
when p = char F | n, use the fact that a? € F implies a € F.] 


Let B be a finite-dimensional F-algebra, let a € B, and let fi (œ; T) and fr(a;T) 
be the characteristic polynomial of left and right multiplication of œ on B, 
respectively. 


(a) If B is semisimple, show that f.(a@;7) = fr(a;T). 
(b) Give an example where f.(a@;T) + fr(a@;T). 


Use the Skolem—Noether theorem to give another solution to Exercise 6.2: if 
K C Bisa separable quadratic F-algebra then B = (K, b | F) for some b € F™. 
Give a direct proof of Corollary 7.7.4. [Hint: Use the fact that there is a unique 
simple left B-module. ] 


Let B = (K,b | F) be a quaternion algebra. Show that the subgroup of Aut(B) 
that maps K C B to itself is isomorphic to the group 

K*/F* U j(K*/F*). 
Show that the subgroup of Aut(B) that restricts to the identity on K (fixing K 
elementwise) is isomorphic to K*/F*. 


Use the Skolem—Noether theorem and the fact that a finite group cannot be 
written as the union of the conjugates of a proper subgroup to prove Wedderburn’s 
little theorem: a finite division ring is a field. 


Let B be a quaternion algebra over F. In this exercise, we show that the commu- 
tator subgroup 
[B*, BX] = (aba |B"! : a, B € BX) 


is precisely 


[B*, BX] = B! = {y € B® : nrd(y) = 1} = SL, (B). 
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(a) Show that [B*%, BX] < B!. 
(b) Show that [GL2(F),GL2(F)] = SLa(F) if #F > 3. [Hint: choose z € F 


(c 


wa 


such that z? — 1 € F*, let y= (i = , and show that for all x € F we 


0 


bbb i?) 


and analogously for the transpose. See 28.3 for a review of elementary 
matrices. ] 


have 


Suppose that B is a division algebra. Let y € B!. Show that there exists 
a € K = F(y) such that aq! = y. [Hint: This is a special case of Hilbert’s 
theorem 90. Leta = y + 1 if y + -1, and a € B°\{0} if y = -1, with 
appropriate modifications if char F = 2.] Conclude from the Skolem— 
Noether theorem that there exists £ € B* such that BaB"! = @, and thus 
y € [B*, B™]. 


31. Show that every ring automorphism of H is inner. (Compare this with ring 
automorphisms of C!) 
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Chapter 8 


Simple algebras and involutions 


In this chapter, we examine further connections between quaternion algebras, simple 
algebras, and involutions. 


8.1 > The Brauer group and involutions 


An involution on an F-algebra B induces an isomorphism ~ : B —> B°?, for example 
such an isomorphism is furnished by the standard involution on a quaternion algebra B. 
More generally, if B4, Bz are quaternion algebras, then the tensor product Bı &F B2 has 
an involution provided by the standard involution on each factor giving an isomorphism 
to (B1 F B2)? = BYP QF BS>—but this involution is no longer a standard involution 
(Exercise 8.1). The algebra Bı ®f B2 is a central simple algebra over F called a 
biquaternion algebra. In some circumstances, we may have 


By 9p By =~ Mo(B3) (8.1.1) 


where B3 is again a quaternion algebra, and in other circumstances, we may not; 
following Albert, we begin this chapter by studying (8.1.1) and biquaternion algebras 
in detail. 


To this end, we look at the set of isomorphism classes of central simple algebras 
over F, which is closed under tensor product; if we think that the matrix ring is 
something that is ‘no more complicated than its base ring’, it is natural to introduce an 
equivalence relation on central simple algebras that identifies a division ring with the 
matrix ring (of any rank) over this division ring. More precisely, if A, A’ are central 
simple algebras over F we say A, A’ are Brauer equivalent if there exist n,n’ > 1 such 
that M,(A) = M,,(A’). In this way, (8.1.1) reads Bı &F B2 ~ B3. The set of Brauer 
equivalence classes [A] has the structure of a group under tensor product, known as the 
Brauer group Br(F) of F, with identity element [F] and inverse [A]~! = [A°?]. The 
class [B] € Br(F) of a quaternion algebra B is a 2-torsion element, and therefore so is a 
biquaternion algebra. In fact, by a striking theorem of Merkurjev, when char F + 2, all 
2-torsion elements in Br(F) are represented by a tensor product of quaternion algebras 
(see section 8.3). 
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Finally, our interest in involutions in Chapter 3 began with an observation of 
Hamilton: the product of a nonzero element with its involute in H is a positive real 
number (its norm, or square length). We then proved that the existence of such an 
involution characterizes quaternion algebras in an essential way. However, one may 
want to relax this setup and instead consider when the product of anonzero element with 
its involute merely has positive trace. Such involutions are called positive involutions 
and they arise naturally in algebraic geometry: the Rosati involution is a positive 
involution on the endomorphism algebra of an abelian variety, and it is a consequence 
that this algebra (over Q) is semisimple, and unsurprisingly quaternion algebras once 
again feature prominently (see sections 8.4—8.5). 


8.2 Biquaternion algebras 


Let F be a field. All tensor products in this section will be taken over F. 


8.2.1. Let Bı, B2 be quaternion algebras over F. The tensor product Bı ® B2 is a 
central simple algebra over F of dimension 4* = 16 called a biquaternion algebra. A 
biquaternion algebra may be written as a tensor product of two quaternion algebras in 
different ways, so the pair is not intrinsic to the biquaternion algebra. 

By the Wedderburn—Artin theorem (Main Theorem 7.3.10), we have exactly one 
of the three following possibilities for this algebra: 


- Bı ® By is a division algebra; 
e Bı ® B2 ~ M2(B3) where B3 is a quaternion division algebra over F; or 
- Bi 8 B2 = M4(F). 


We could combine the latter two and just say that Bı ® By ~ M2(B3) where B3 is a 
quaternion algebra over F, since M2(M2(F)) ~ Ma(F) as F-algebras. 


Example 8.2.2. By Exercise 8.2, when char F + 2 we have 


b 
(==) 3 (=>) = M2(B3) 


b b 2 
} In particular, (5 Jo ) ~ M4(F), since (=| =x M>)(F). 


,bıb 
where B3 = £ : 


Example 8.2.2 is no accident, as the following proposition indicates. 
Proposition 8.2.3. (Albert). The following are equivalent: 


G) There exists a quadratic field extension K D F that can be embedded as an 
F-algebra in both B, and Bo; 
(ii) Bı and By have a common quadratic splitting field; and 
(iii) Bı ® By is not a division algebra. 


Proof. The equivalence (i) © (ii) follows from Lemma 5.4.7. 
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For the implication (i) > (iii), for i = 1,2 let a; € B; generate K so a? =taj-—n 
with t,n € F. Let 
B:=a81-1@ay. 


Then 
Bla, @14+1@a.-1) =a, @1-1@a5 -#B 


(8.2.4) 
= (ta; —n)@®1-—1®@(ta.-—n)-tB =0. 


Therefore £ is a zerodivisor and Bı ® B? is not a division algebra. 
To finish, we prove (iii) > (i). We have an embedding 


Bi = Bi 8 B2 


arael 


and similarly B2; the images of Bı and Bz in Bı ® B2 commute. Write B2 = (K, b2 | F). 
Consider (B,)x = Bı ®K C Bı 8 By; then (B))x is a quaternion algebra over K (with 
dimr(B1)x = 8). If (Bı)x is not a division algebra, then K splits Bj and K — B; and 
we are done. So suppose that (B))x is a division algebra. Then 


B, @ B2 = (By)x + (Bi) Ki 


is free of rank 2 as a left (B,)x-module. 

Since Bı ® B2 ~ M2(B3) is not a division algebra, there exists € € Bı ® B2 nonzero 
such that e? = 0. Without loss of generality, we can write € = a; ®z+j where aq € Bı 
and z € K. Then 


0=€ = 8z +(a1 @Dj + (a1 @Dj + br. (8.2.5) 
From the basis 1, j over (B,)x, if Z = t — z with t € F, we conclude that 
a @zt+a,@(t-—z)=a, @t=0. 


Therefore t = 0, and z? = c for some c € F*. Then from (8.2.5) ca? +b =0 
so a? = —b/c and Bı contains the quadratic field F(\V-bic). But so does Bp, as 
(zj) = —boc as well. 

(For an alternate proof, see Jacobson [Jacn2009, Theorem 2.10.3].) oO 


Remark 8.2.6. In view of Proposition 8.2.3, we say that two quaternion algebras 
Bı, By over F are linked if they contain a common quadratic field extension K 2 F. 
For further discussion of biquaternion algebras and linkage in characteristic 2 (where 
one must treat separable and inseparable extensions differently), see Knus [Knu93], 
Lam [Lam2002], or Sah [Sah72]. Garibaldi-Saltman [GS2010] study the subfields of 
quaternion algebra over fields with char F # 2. 


From now on, we suppose that char F + 2. (For the case char F = 2, see Chapman- 
Dolphin—Laghbribi [CDL2015, §6].) 
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8.2.7. Motivated by Proposition 8.2.3, we consider the quadratic extensions repre- 
sented by Bı and B2 encoded in the language of quadratic forms (recalling Lemma 
5.5.4). Let 

V = {æ 8 1 — 1 8 a E Bı 8 Bp: trd(@ı) = trd(az)}. 


Then dimp V = 6, and we may identify V = B? @l-1 @ BS. The reduced norm on each 
factor separately defines a quadratic form on V by taking the difference: explicitly, if 
Bı = (aq, bı | F) and Bo = (ap, b2 | F), then taking the standard bases for B1, B2 
Q(B), B2) = (-a1, —b1, a, b1) Æ —(-ap, —b2, arb2) 
= (-a), —b1, a1b1, a2, b2, —a2b2). 


The quadratic form Q(B, B2) : V > F is called the Albert form of the biquaternion 
algebra Bı ® Bo. 


We then add onto Proposition 8.2.3 as follows. 


Proposition 8.2.8. (Albert). Let Bı ® B2 be a biquaternion algebra over F (with 
char F + 2) with Albert form Q(B), Bz). Then the following are equivalent: 


(i) Bı, B2 have a common quadratic splitting field; 
(iv) Q(B1, B2) is isotropic. 


Proof. The implication (ii) > (iv) follows by construction 8.2.7. To prove (iv) > 
(ii), without loss of generality, we may suppose Bj, B2 are division algebras; then 
an isotropic vector of Q corresponds to elements a; € Bı and a2 € Bz such that 
ay = as = c € F*. Therefore K = F(yc) is acommon quadratic splitting field. o 
Remark 8.2.9. Albert’s book [Alb39] on algebras still reads well today. The proof of 
Proposition 8.2.3 is due to him [Alb72]. (“I discovered this theorem some time ago. 
There appears to be some continuing interest in it, and I am therefore publishing it 
now.’) Albert used Proposition 8.2.8 to show that 


-1,-1 x,y 
Bı = |——] and By =|— 
F F 


over F = R(x, y) have tensor product Bı ®¢ B2 a division algebra by verifying that 
the Albert form Q(B, B2) is anisotropic over F. See Lam [Lam2005, Example VI.1] 
for more details. For the fields of interest in this book (local fields and global fields), 
a biquaternion algebra will never be a division algebra—the proof of this fact rests on 
classification results for quaternion algebras over these fields, which we will take up 
in earnest in Part II. 


8.3 Brauer group 


Motivated to study the situation where Bı & B2 ~ M2(B3) among quaternion algebras 
Bı, B2, B3 more generally, we now turn to the Brauer group. 

Let CSA(F) be the set of isomorphism classes of central simple F-algebras. The 
operation of tensor product on CSA(F) defines a commutative binary operation with 
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identity F, but inverses are lacking (for dimension reasons). So we define an equivalence 
relation ~ on CSA(F) by 


A ~ A’ if My(A) = M,,(A’) for some n,n’ > 1 (8.3.1) 


and we say then that A, A’ are Brauer equivalent. In particular, A ~ M,,(A) for all 
A € CSA(F) as needed above. 


Lemma 8.3.2. The set of equivalence classes of central simple F-algebras under the 
equivalence relation ~ has the structure of an abelian group under tensor product, 
with identity [F] and inverse [A]! = [A°?]. 


Proof. By Exercise 8.5, the operation is well-defined: if A, A’ € CSA(F) and A’ ~ 
A” € CSA(F) then AQ A’ ~ A@A”. To conclude, we need to show that inverses exist. 
This is furnished by Lemma 7.5.4: if dimp A = n and A® is the opposite algebra of A 
(3.2.2) then the map 


Ar A”? > Endr (A) =~ M, (F) 
ape (u> aup) 


is an isomorphism of F-algebras, so [A]! = [A°?] provides an inverse to [A]. o 
So we make the following definition. 


Definition 8.3.3. The Brauer group of F is the set Br(F) of Brauer equivalence classes 
of central simple F-algebras (8.3.1) under the group operation of tensor product. 


8.3.4. Let B be a quaternion algebra over F. We have B ~ Mo(F) if and only if [B] = [F] 
is the identity. Otherwise, B is a division algebra. Then the standard involution gives 
an F-algebra isomorphism B — B?, and hence in Br(F) we have [B]! = [B] and 
so [B] is an element of order 2. Since Br(F) is abelian, it follows that biquaternion 
algebras, or more generally tensor products B1 ® --- ® B; of quaternion algebras B;, 
are also elements of order at most 2 in Br(F). 


Theorem 8.3.5. (Merkurjev). Let char F + 2. Then Br(F)[2] is generated by quater- 
nion algebras over F, i.e., every (finite-dimensional) central division F-algebra with 
involution is Brauer equivalent to a tensor product of quaternion algebras. 


Remark 8.3.6. More generally, Merkurjev [Mer82] proved in 1981 that a division 
algebra with an involution is Brauer equivalent to a tensor product of quaternion 
algebras; more precisely, if D is a division F-algebra with (not necessarily standard) 
involution, then there exists n € Zs, such that M,(D) is isomorphic to a tensor 
product of quaternion algebras. His theorem, more properly, says that the natural 
map K2(F) — Br(F)[2] is an isomorphism. (Some care is required in this area: for 
example, Amitsur-Rowen-Tignol [ART79] exhibit a division algebra D of degree 8 
with involution that is not a tensor product of quaternion algebras, but M>(D) is a tensor 
product of quaternion algebras.) For an elementary proof of Merkurjev’s theorem, see 
Wadsworth [Wad86]. 
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Remark 8.3.7. Just as quaternion algebras are in correspondence with conics (Corollary 
5.5.2), with a quaternion algebra split if and only if the corresponding conic has a 
rational point (Theorem 5.5.3), similarly the Brauer group of a field has a geometric 
interpretation (see e.g. Serre [Ser79, §X.6]): central simple algebras correspond to 
Brauer—Severi varieties—for each degree n > 1, both are parametrized by the Galois 
cohomology set H!(Gal(F**? | F), PGL,,). 


8.4 Positive involutions 


We now turn to study algebras with involution more general than a standard involution. 
Throughout this section, let F c R be a subfield of R and B a finite-dimensional 
F-algebra. We define the trace map Tr : B — R by the trace of left multiplication. 


Definition 8.4.1. An involution * : B — B is positive if Tr(a*a) > 0 forall a € B\ {0}. 


Since the map (a, 8) + Tr(a”) is bilinear, an involution * on B is positive if and 
only if Tr(a*@) > 0 for « in a basis for B and so is positive if and only if its extension 
to B @f R is positive. 


Example 8.4.2. The standard involutions on R, C, and H, defined by a + trd(a) - a, 
are positive involutions. The standard involution on R x R is not positive since for 
æ = (x1, X2) E€ R x R we have Tr(@q@) = 2x1x2. The standard involution on M2(R) is 
also not positive, since for a € M2(R) we have Tr(aw) = 4 det(a). 


8.4.3. Let D be one of R, C, or H. Let B = M,,(D). The standard involution ~ on D 
extends to an involution on B, acting on coordinates. The conjugate transpose (or, 
perhaps better the standard involution transpose) map 


*: B>B 
ara=a 
also defines an involution on B, where + is the transpose map. If a = (4j;)i,j=1,..,.n then 
n 
Tr(a" æ) = n(dimpg D) a Tai; > 0; (8.4.4) 
ij=l 
thus * is positive, and the norm @ +» Tr(a*q) is (an integer multiple of) the Frobenius 


norm on B. 


We will soon see that every positive involution can be derived from the conjugate 
transpose as in 8.4.3. First, we reduce to the case where B is a semisimple algebra. 


Lemma 8.4.5. Suppose that B admits a positive involution *. Then B is semisimple. 


Proof. We give two proofs. First, we appeal to Theorem 7.9.4: since the trace pairing 
is positive definite, it is nondegenerate and immediately B is semisimple. 
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For a second (more general) proof, let J = rad B be the Jacobson radical of B. 
By Lemma 7.4.2, B is semisimple if and only if rad B = {0}, and by Lemma 7.4.8, 
J = rad B is nilpotent. Suppose for purposes of contradiction that J + {0}. Then there 
exists n > O such that J” # {0} but J”*! = {0}. Let € € J be such that e” + 0 but 
e”+! = 0. The involution gives an isomorphism B > B® taking maximal left ideals 
to maximal right ideals and therefore by Corollary 7.4.6 we conclude J* = J. Thus 
e"e* = 0 so Tr(e"(e*)”) = Tr(e"(e”)*) = 0, contradicting that * is positive. o 


8.4.6. Suppose B is semisimple with a positive involution *, and let B; be a simple 
factor of B. Then * preserves B;: for if B; = B; + B;, then B; is a simple factor and 
B;B; = 0 so Tr(B;B;) = Tr(B;B;) = {0}, a contradiction. 


Putting Lemma 8.4.5 with 8.4.6, we see it is enough to classify positive involutions 
on simple R-algebras. By the theorem of Frobenius (Corollary 3.5.8), a simple algebra 
over R is isomorphic to M, (D) with D = R, C, H, so 8.4.3 applies. 


Proposition 8.4.7. Let B ~ M,,(D) be a simple R-algebra and let * be the conjugate 
transpose involution on B. Let: B > B be another positive involution on B. Then 
there exists an element u € B™ with y* = u such that 


at = platy 
forall a € B. 


Proof. First suppose B is central over R. Then the involutions * and * give two 
R-algebra maps B — B®. By the Skolem—Noether theorem (Main Theorem 7.7.1), 
there exists u € B* such that at = w-'a* u. Since 


a = (at) = (W'a Wt =e Ge pea T oe wy (8.4.8) 
for all a € B, we have w'y* € Z(B) = R, so u* = cpu for some c € R. But 
(u*)* = u = (cp*) = c°, thus c = +1. But if c = —1, then u is skew-symmetric so 
its top-left entry is 411 = 0; but then for the matrix unit e}; we have 


Tr(ere!,) = Tren ey p) = Tew ter pe) = Tru! m1) = 0, (8.4.9) 


a contradiction. 

A similar argument holds if B has center Z(B) = C. The restriction of an involution 
to Z(B) is either the identity or complex conjugation; the latter holds for the conjugate 
transpose involution, as well as for Ìt: if z € Z(B) then Tr(zz') = n?(zz‘) > 0, and 
we must have z’ = Z. So the map œ > (a*)' is a C-linear automorphism, and 
again there exists u € B* such that a’ = yw-'a*y. By the same argument, we have 
pu = zp with z € C, but now u = (u*)* = Zzu so |z| = 1. Let w? = w/w = z; then 
(wp)* = wu = wz = wu. Replacing u by wu, we may take z = 1. Oo 


Corollary 8.4.10. The only positive involution on a real division algebra is the standard 
involution. 


Proof. Apply Proposition 8.4.7 with n = 1, noting that u* = g = u implies u € R. 
o 
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8.4.11. Let u € B* with u* = u. Then u is self-adjoint with respect to the pairing 
(a, B) > Tr(a” B): 


(ua, P) = Tr((ua)* p) = Tr(a" u“ p) = Tr(a" up) = (æ, up). 


It follows from the spectral theorem that the R-linear endomorphism of B given by 
left-multiplication by u on B as an R-algebra is diagonalizable (with real eigenvalues) 
via a symmetric matrix. We say u is positive definite (for *) if all eigenvalues of u 
are positive. The map a + Tr(a* uœ) defines a quadratic form on B, and u is positive 
definite if and only this quadratic form is positive definite. 


Lemma 8.4.12. Let u* = u. Then the involution at = p~'a* u is positive if and only if 
either u or — is positive definite. 


Proof. Diagonalize the quadratic form œ + Tr(a*uq@) to get (aj,...,@m) in a nor- 
malized basis e1, .. ., €m, and suppose without loss of generality that a; = +1. If all 
a; = —1, then we can replace u with —y without changing the involution to suppose 
they are all +1. 

Suppose j is not positive, and without loss of generality a; < 0 and az > 0, then 
Tr((e, + e2)" u(ei + e2)) = -1 + 1 = 0, a contradiction. Conversely, if u is positive 
definite, then all eigenvalues are +1. Let v = y% be such that y* = v, and then 


Tr(a“ W'ap) = Tr(a*v av’) = Tr((va* v7! (vay) 


8.4.13 
= Tr((vlav)*(vtayv)) > 0 : 


for all a € B, so Ý is positive. oO 


Example 8.4.14. If n = 1, and B = D, then the condition u* = u implies u € R, and 
the condition u positive implies u > 0; rescaling does not affect the involution, so we 
can take u = | and there is a unique positive involution on D given by *. 


Example 8.4.15. Let B = M2(R). Then u = E 2 is positive definite if and only if 


a > Oand b? —4ac < 0. Combining Proposition 8.4.7 with Lemma 8.4.12, we see that 
all positive involutions Ý on B are given by a’ = py! a*p where u is positive definite. 

We can instead relate positive involutions to the standard involution œ instead of 
the transpose; to this end, it is enough to find jJ € B* = GL2(R) such that @ = y"!a*y, 


and the element 7 = k o) does the trick, because 


BENTL as 


O 1\(2a b b 2c = 
From the product E , | b a = es 2 a: we conclude that all positive 


involutions are given by at = p` t@u where pi? € Reo. 
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Remark 8.4.17. Beyond the application to endomorphism algebras, Weil [Weil60] 
has given a more general point of view on positive involutions, connecting them to 
the classical groups. For more on involutions on finite-dimensional algebras over real 
closed fields, see work of Munn [Mun2004]. 


8.5 *«Endomorphism algebras of abelian varieties 


We conclude this chapter with an advanced (optional) application: we characterize 
endomorphism algebras of (simple) abelian varieties in terms of algebras with invo- 
lutions. We borrow from the future the notions from section 43.4. Briefly, a complex 
torus of dimension g is a complex manifold of the form A = V/A for g > 0, where 
A C V ~ C8 isa lattice (discrete subgroup) and A = Z?8. A complex abelian variety 
is a certain kind of complex torus. A complex abelian variety A is simple if A has no 
abelian subvariety other than {0} and A. 

An endomorphism of A is a C-linear map a: V — V such that a(A) € A. Let 
End(A) be the ring (Z-algebra) of endomorphisms of A. 


Proposition 8.5.1. B = End(A)® Q is a finite-dimensional algebra over Q that admits 
a positive involution: B > B. 


Proof. The algebra B acts faithfully on A &Q = Q’S, so is isomorphic to a subalgebra 
of M2(Q) hence is finite-dimensional over Q. For positivity, see Proposition 43.4.24 
(for the case when A is principally polarized). o 


Remark 8.5.2. The involution Ý : B —> B is called the Rosati involution (and depends 
on a choice of polarization 4: A — A“, where AY is the dual abelian variety). 


Now Lemma 8.4.5 and Proposition 8.5.1 imply that B is semisimple as a Q-algebra, 
with 


B = | | MDD 


Fe 
i=l 
where each D; C B is a division algebra. It follows that A is isogenous to a product 
n ny 
Ay KoA, 


where 7),...,”, > Oand Aj,..., A, are simple pairwise nonisogenous abelian subva- 
rieties of A such that D; = End(A;) 87 Q. 

We therefore reduce to the case where A is simple, and D := End(A) ® Q is a 
division algebra. Let K := Z(D) be the center of D and let 


Ko = KM ={ae€K:a' =a} 
be the subfield of K where Ë acts by the identity. 


Lemma 8.5.3. Ko is a totally real number field, i.e., every embedding Ky —> C factors 
through R, and if + acts nontrivially on K, then K is a CM field, i.e., K is a totally 
imaginary extension of Ko. 


132 CHAPTER 8. SIMPLE ALGEBRAS AND INVOLUTIONS 


Proof. The positive involution * restricts to complex conjugation on Z(D) by Propo- 
sition 8.4.7, so for all embeddings Kp — C, the image lies in R. For the same reason, 
we cannot have Ý acting nontrivially on K and have an embedding K © R. o 


The following theorem of Albert classifies the possibilities for D. 


Theorem 8.5.4. (Albert). Let D be a (finite-dimensional) division algebra over Q with 
positive involution t and center K = Z(D), let Ky := K“ and n := [Ko : Q]. Then Ko 
is a totally real number field, and one of the four following possibilities holds: 


Œ) D =K = Ko and‘ is the identity; 
AD K = Ko and D is a quaternion algebra over Ko such that 


D @g R = Mx(R)”, 


and there exists u € D* such that p? = d € KX is totally negative and at = 
way for all a € D; 
D K = Ko and D is a quaternion algebra over Ko such that 


D & R = H”, 


and‘ is the standard involution; or 
(IV) K 2 Ko and 
D ®@g R = Ma(C)” 


for some d > 1, and’ extends to the conjugate transpose * on each factor Mq(C). 


Proof. We have assembled many of the tools needed to prove this theorem, and hope- 
fully motivated its statement sufficiently well—but unfortunately, a proof remains just 
out of reach: we require some results about quaternion algebras over number fields 
not yet in our grasp. For a proof, see Mumford [Mum70, Application I, §21] or 
Birkenhake—Lange [BL2004, §§5.3-5.5]. 

To connect a few dots as well as we can right now, we give a sketch in the case 
where K = Ko for the reader who is willing to flip ahead to Chapter 14. In this case, 
D is a central division algebra over K = Ko and has a Ko-linear involution giving 
an isomorphism D æ D°? of Ko-algebras. Looking in the Brauer group Br(Ko), we 
conclude that [D] = [D°?] = [D]~!, so [D] € Br(Ko) has order at most 2. By class 
field theory (see Remark 14.6.10), we conclude that either D = Ko or D is a (division) 
quaternion algebra over Ko. If D = Ko, we are in case (I), so suppose D is a quaternion 
algebra over Ky. We have D@g R = [],,).0 Dy a direct product of n quaternion algebras 
D, over R indexed by the real places v of Ko. We have Dy, ~ M2(R) or D, = H, and our 
positive involution induces a corresponding positive involution on each D,,. If there 
exists v such that D, = H, then by Corollary 8.4.10, the positive involution on D,, is the 
standard involution, so it is so on D, and then all components must have D, = H as the 
standard involution is not positive on Mj(R)—and we are in case (II). Otherwise, we 
are in case (III), with Proposition 8.4.7 and Example 8.4.15 characterizing the positive 
involution. oO 
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Exercises 


Let F be a field. 


1. Let Bı, B2 be quaternion algebras over F, with standard involution written ~ in 
both cases. Let A := Bı @ B2. 


(a) Show that the mapa: A — A defined by a; a2 + w180 for a; € Bı and 
a2 € B, extends to an involution on A, but it is not a standard involution. 
[Hint: consider sums. ] 

(b) Suppose that char F + 2. Diagonalize A = A* ® A™ into +1 and -1 
eigenspaces for o`. Show that 


A* =F @(B, @By) and A = (BI @F)O(F 8 B3). 


b 
> 2. Suppose char F + 2 and let By := (“2 


b 
) and Bz := (=>) be quaternion 


algebras over F. 


(a) Let B3 be the F-span of 1, i3 := i181, j3 := J18 j2, and k3 := i3j3 = ij, @j2 


a, bıbz 


inside Bı ® B2. Show that B3 = ) as F-algebras. 


(b) Similarly, let By be the F-span of 1, ig := 1 8 j2, ja := (i1 & k2)/a, and 
bo, -b 
ka := i4ją4. Show that By ~ (==) = M)(F). 
(c) Show that 
Bı & B2 = B3 ® B4 =~ Mo(B3). 


[Hint: Show that B3 and B4 are commuting subalgebras, or consider the 
map B3 & B4 —> Bı ® By given by multiplication. ] 
(d) Restore symmetry and repeat (a)-(c) to find algebras B} = B3 and B, Š 
bı, —b1 
F 


) with B1 ® By = B} ® BY ~ M2(B3). 


3. Suppose char F + 2. Show that Bı ® B2 ~ M4(F) if and only if the Albert form 
Q(B, B2) is totally hyperbolic. 


4. Let G be a finite group. Show that the map induced by g œ> g`! for g € G 
defines an positive involution on R[G]. Then show that this map composed with 
coordinatewise complex conjugation defines a positive involution on C[G] (as 
an R-algebra). 

> 5. Show that if ~ is the equivalence relation (8.3.1) on CSA(F), then ~ is compatible 
with tensor product, i.e., if A,A’ € CSA(F) and A’ ~ A” e CSA(F) then 
A®A’~A®@A". 

6. Show that every class in the Brauer group Br(F) contains a unique division 
F-algebra, up to isomorphism. 

7. Show that BrF = {1} if F is separably closed, and that Br(R) = Z/2Z and 
Br(F,) = {1}. 
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Let B € CSA(F) and suppose that B has an involution (not necessarily standard). 
Show that [B] has order at most 2 in Br F. 

Let K 2 F be a field extension. Show that the map A + A 8r K induces a group 
homomorphism Br F — Br K. Conclude that the set of isomorphism classes of 
central division F-algebras D such that D 8r K ~ M,,(K) for some n > 1 forms 
a subgroup of Br F, called the relative Brauer group Br(K | F). 

In this exercise, we give an example of a central simple algebra of infinite 
dimension, called the Weyl algebra. 

Suppose char F = 0, let F[x] be the polynomial ring over F in the variable x. 
Inside the enormous algebra End F[x] is the operator f(x) > x f(x), denoted 
also x, and the differentiation operator ô : F[x] — F[x]. These two operators 
are related by the product rule: 


O(x f(x) — xef (x) = fx). 


Accordingly, the subalgebra of Endr F[x] generated by ô, x is isomorphic to an 
algebra given in terms of generators and relations: 


W := F(6, x)/(6x — xô — 1), 


the quotient of the “noncommutative polynomial ring” in two variables F'(6, x) 
by the two-sided ideal generated by ôx — xô — 1. 


(a) Show that every element of W can be written in the form Xio filo" 
where f(x) € F[x] for all i, i.e., W has F-basis elements xt ô’ for i, j > 0. 

(b) Show that Z(W) = F. 

(c) Let I be a two-sided of W. Show that if there exists nonzero f(x) € F[x]AZ, 
then J = W. Similarly, show that if 6” € I for some n > 0, then J = W. 

(d) Show that W is simple. [Hint: argue by induction. ] 


Let B be a finite-dimensional R-algebra with positive involution *: B — B. Let 
P(B,*) := {u € B : yp” = wand pis positive definite for *}. 


(a) Show that B* acts on P(B,*) by B- u := 6* up. 

(b) Show that P(B,*) is a convex open subset of {æ € B : a* = a}, an R-vector 
subspace of B. 

(c) Let y: B > B be an R-algebra automorphism or anti-automorphism. 
Show that a7 := Yla) defines a positive involution for a € B, and 
that y maps P(B, *) bijectively to P(B, *). 
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Chapter 9 


Lattices and integral quadratic forms 


In many ways, quaternion algebras are like “noncommutative quadratic field exten- 
sions”: this is apparent from their very definition, but also from their description as 
wannabe 2 x 2-matrices. Just as the quadratic fields Q vd) are wonderously rich, so 
too are their noncommutative analogues. In this part of the text, we explore these 
beginnings of noncommutative algebraic number theory. 

In this chapter, we begin with some prerequisites from commutative algebra, 
embarking on a study of integral structures and linear algebra over domains. 


9.1 > Integral structures 


Just as we find the integers Z inside the rational numbers Q, more generally we want 
a robust notion of integrality for possibly noncommutative algebras: this is the theory 
of orders over a domain. 

We first have to understand the linear algebra aspects of this question. Let R be a 
domain with field of fractions F := Frac R, and let V be a finite-dimensional F-vector 
space. An R-lattice in V is a finitely generated R-submodule M c V with MF =V.IfR 
is a PID (for example, R = Z), then M is an R-lattice if and only if M = Rx, ®---®Rxy, 
where x1, ..., Xn is a basis for V as an F-vector space. 

Between M and V lies intermediate structures, where instead of allowing all denom- 
inators (in the field of fractions), we only allow certain denominators; these are the 
localizations of M. To fix ideas, suppose R = Z, so M = Z”; we call a Z-lattice simply 
a lattice. For a prime p, we define the localization of Z away from p to be 


Zip) = {a/bEQ: pt db} cQ. 


In the localization, we can focus on those aspects of the lattice concentrated at the 
prime p. Extending scalars, Mp) := M Zp) C V is a Z(p)-lattice in V, again called the 
localization of M at p. These localizations determine the lattice M in the following 
strong sense (Theorem 9.4.9). 


Theorem 9.1.1. (Local-global dictionary for lattices). Let V be a finite-dimensional 
Q-vector space, and let M C V be a lattice. Then the map N +> (Nip))p establishes a 


© The Author(s) 2021 139 
J. Voight, Quaternion Algebras, Graduate Texts in Mathematics 288, 
https://doi.org/10.1007/978-3-030-56694-4_9 


140 CHAPTER 9. LATTICES AND INTEGRAL QUADRATIC FORMS 


bijection between lattices N and collections of lattices (N(p))p (indexed by primes p) 
where Mp) = Nip) for all but finitely many primes p. 


By this theorem, the choice of “reference” lattice M is arbitrary. Because of the 
importance of this theorem, a property of a lattice that holds if and only if it holds over 
every localization is called a local property. 

Finally, often vector spaces come equipped with a measure of length, or more 
generally a quadratic form; we can restrict these to the lattice M c V with V a Q- 
vector space. More intrinsically, we define a quadratic form Q: M — Z to be a map 
satisfying: 


(i) O(ax) = a’ Q(x) for alla € Z and x € M, and 


(ii) The associated map T: M x M > Z by T(x, y) = Q(x + y) - Q(x) - Q(y) is 
(Z-)bilinear. 


Condition (i) explains (partly) the ‘quadratic’ nature of the map, and part (ii) is the usual 
way relating norms (quadratic forms) to bilinear forms. Choosing a basis e1, . . . , €n for 
M = Z", we may then write 


2 2 
O(xje] +++ + Xen) = AX} + a12X1X2 + +++ + AnnxXp E€ Aixi,- Xn] (9.1.2) 


as a homogeneous polynomial of degree 2. 


9.2 Bits of commutative algebra 


We begin with a brief review of some bits of commutative algebra relevant to our 
context: we need just enough to do linear algebra over (commutative) domains with 
good properties. Good general references for the basic facts from algebra we use 
(Dedekind domains, localization, etc.) are Atiyah-Macdonald [AM69], Matsumura 
[Mat89, Chapter 8], Curtis-Reiner [CR81, §1, §4], Reiner [Rei2003, Chapter 1], and 
Bourbaki [Bou98]. 

Throughout this chapter, let R be a (commutative) noetherian domain with field of 
fractions F := Frac R. 


9.2.1. An R-module P is projective if it is a direct summand of a free module; 
equivalently, P is projective if and only if every R-module surjection M — P of 
R-modules has a section, i.e., an R-module homomorphism g: P — M such that 
fog =idp. 

Accordingly, a free R-module is projective. A projective R-module M is necessarily 
torsion free over R, which is to say, if rx = 0 with r € R and x € M, then r = 0 or 
x=0. 


9.2.2. A fractional ideal of R is a nonzero finitely generated R-submodule b € F, or 
equivalently, a subset of the form b = da where a C R is a nonzero ideal and d € F™. 
Two fractional ideals a, b of R are isomorphic (as R-modules) if and only if there exists 
c € F* such that b = ca: indeed, given an isomorphism a = b, we may extend scalars 
to F to obtain an F-linear map F = F, which must be given by c € F™, and conversely. 
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A Dedekind domain is a noetherian, integrally closed domain such that every 
nonzero prime ideal is maximal. 


Example 9.2.3. A field or a PID is a Dedekind domain; in particular, the rings Z and 
Fp[t] are Dedekind domains. If F is a finite extension of Q or F p(t), then the integral 
closure of Z or Fp[t] in F respectively is a Dedekind domain. 


9.2.4. Suppose R is a Dedekind domain. Then a finitely generated R-module is pro- 
jective if and only if it is torsion free. Moreover, every nonzero ideal a of R can be 
written uniquely as the product of prime ideals (up to reordering). For every fractional 
ideal a of R, the set a7! := {a € F : aa C R} is a fractional ideal with aa”! = R. 
Therefore the set of fractional ideals of R forms a group under multiplication. The 
set of principal fractional ideals comprises a subgroup, and we define Cl R to be the 
quotient, or equivalently the group of isomorphism classes of fractional ideals of R. 


9.3 Lattices 


Let V be a finite-dimensional F-vector space. 


Definition 9.3.1. An R-lattice in V is a finitely generated R-submodule M C V with 
MF = V. We refer to a Z-lattice as a lattice. 


The condition that MF = V is equivalent to the requirement that M contains a 
basis for V as an F-vector space. 


Example 9.3.2. An R-lattice in V = F is the same thing as a fractional ideal of R. 


We will be primarily concerned with projective R-lattices; if R is a Dedekind 
domain, then a finitely generated R-submodule M C V is torsion free and hence 
automatically projective (9.2.4). 


9.3.3. If there is no ambient vector space around, we will also call a finitely generated 
torsion free R-module M an R-lattice: in this case, M is a lattice in the F-vector space 
M Qpr F because the map M —> M 8p F is injective (as M is torsion free). 


Remark 9.3.4. Some authors omit the second condition in the definition of an R-lattice 
and say that M is full if MF = V. We will not encounter R-lattices that are not full 
(and when we do, we call them finitely generated R-submodules), so we avoid this 
added nomenclature. 


By definition, an R-lattice can be thought of an R-submodule that “allows bounded 
denominators’, as follows. 


Lemma 9.3.5. Let M C V be an R-lattice and let J C V be a finitely generated 
R-submodule. Then the following statements hold. 


(a) For all x € V, there exists nonzero r € R such that rx € M. 
(b) There exists nonzero r € R such thatrJ C M. 
(c) J is an R-lattice if and only if there exists nonzero r € R such thatrM CJ C 
=1 
rM. 
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Proof. First (a). Since FM = V, the R-lattice M contains an F-basis x1,..., Xn for V, 
so in particular M 2 Rx; ©--:@® Rx». For all x € V, writing x in the basis x1, .. ., Xn 
and clearing (finitely many) denominators, we conclude that there exists nonzeror € R 
such that rx € M. 

For (b), let y1, . . ., Ym generate J as an R-module; then for each i, there exist r; € R 
nonzero such that r;y; € M hence r := [];r; + 0 satisfies rJ C M, and therefore 
J c r`!M. For (c), we repeat (b) with M interchanged with J to find nonzero s € R 
such that sM C J, so then 


rsMCsMCJCr!MC (rs) 'M. o 


For the rest of this section, we suppose that R is a Dedekind domain and treat lattices 
over R; for further references, see Curtis—Reiner [CR62, §22], O’ Meara [O’Me73, §81], 
or Frohlich—Taylor [FT91, $11.4]. It turns out that although not every R-lattice has a 
basis, it can be decomposed as a direct sum, as follows. 


Theorem 9.3.6. Let R be a Dedekind domain, let M C V be an R-lattice and let 
Yi,- --, Yn be an F-basis for V. Then there exist xı, ..., Xn E€ M and fractional ideals 
1, . - -, An such that 

M = mx D: O AnXn (9.3.7) 


and x; € Fy, +++-+ Fy; for j =1,...,n. 


Accordingly, we say that every R-lattice M is completely decomposable (as a 
direct sum of fractional ideals), and we call the elements x), ..., x, a pseudobasis for 
the lattice M with respect to the coefficient ideals aj, . . ., an. The matrix with rows x; 
in the basis y; is lower triangular by construction; without loss of generality (rescaling), 
we may suppose that the diagonal entries are equal to 1, in which case we say that the 
pseudobasis for M is given in Hermite normal form. 

More generally, if M = axı ++: + Xm, the sum not necessarily direct, then we 
say that the elements x; are a pseudogenerating set for M with coefficient ideals a;. 


Proof of Theorem 9.3.6. We argue by induction on n, the case n = 1 corresponding to 
the case of a single fractional ideal. 
Let W := Fy, +--+ + Fyn-1, and let N = M N W. Then there is a commutative 
diagram 
0 >N >M > M/N —~0 


Tt oo 


0 W V V/W ——>0 


Since N = W N M, we have M/N — V/W, and V/W = F projecting onto Fyn. 
Since M/N is nonzero and finitely generated, by 9.2.4 we conclude M/N ~ aC F 
is a fractional ideal, hence projective. Therefore the top exact sequence of R-modules 
splits (the surjection has a section), so there exists x € M such that M = N ẹ@ ax as 
R-modules. The result then follows by applying the inductive hypothesis to N. m 
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An argument generalizing that of Theorem 9.3.6 yields the following [O’Me73, 
81:11]. 


Theorem 9.3.9. (Invariant factors). Let R be a Dedekind domain and let M,N C V 
be R-lattices. Then there exists a common pseudobasis xı, . . ., xn for M, N; i.e., there 
exists a basis x1, . . ., Xn for V and fractional ideals a,,..., an and b,,..., 6, such that 
M = x1 ®-+:®anXn 
N = bixi ®-:-@ by xy, 


Moreover, letting 0; := ba," we may further take dı | ++: | dn, and then such 0; are 
unique. 
The unique fractional ideals 0),...,0, given by Theorem 9.3.9 are called the 


invariant factors of N relative to M. 


9.3.10. Let M C V be an R-lattice with pseudobasis as in (9.3.7). The class [aj --- an] € 
CI R is well-defined (Exercise 9.7) and called the Steinitz class. 

In fact, if we do not require that x; € Fy; + +--+ Fy; for j = 1,...,n in Theorem 
9.3.6, then we can find a pseudobasis for M with a; =: = aņn-1 = R, i.e., 


M = Rx, ®--- ® RXn-1 ® AXn 


with [a] the Steinitz class of M. 


9.4 Localizations 


Properties of a domain are governed in an important way by its localizations, and 
consequently the structure of lattices, orders, and algebras can often be understood by 
looking at their localizations (and later, completions). 

For a prime ideal p € R, we denote by 


Ro = {r/s€F:s¢p}CF (9.4.1) 


the localization of R at p. (We reserve the simpler subscript notation for the completion, 
defined in section 9.5.) 


Example 9.4.2. If R = Z and p = (2), then Ri) = {r/s € Q : s is odd} consists of the 
subring of rational numbers with odd denominator. 


Since R is a domain, the map R — Rọ) is an embedding and we can recover R as 


an intersection 
R =( Rw =( Rm E F (9.4.3) 
p m 


where the intersections are over all prime ideals of R and all maximal ideals of R, 
respectively. 
Let V be a finite-dimensional F-vector space and let M C V be an R-lattice. For a 
prime p of R, let 
Mp) := MR) SV 
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be the extension of scalars of M over Rip); identifying V = MF =~ M 8r F under 
multiplication, we could similarly define 


Mw) ‘= M 8r Rop). 


In either lens, Mp) is an Rip)-lattice in V. In this way, M determines a collection 
(M(p))p indexed over the primes p of R. 


9.4.4. Returning to 9.2.1, a finitely generated R-module M is projective if and only if 
it is locally free, i.e., Mp) is free for all prime ideals of R. 

The ability to argue locally and then with free objects is very useful, and so very 
often we will restrict our attention to projective (equivalently, locally free) R-modules. 


9.4.5. The localization of a Dedekind domain R is a discrete valuation ring (DVR). A 
DVR is equivalently a local PID that is not a field. In particular, a DVR is integrally 
closed and every finitely generated module over a DVR is free. 

Consequently, if R is a Dedekind domain, then every fractional ideal of R is locally 
principal, i.e., if a C F is a fractional ideal, then for all primes p of R we have 
Ap) = ap Rip) for some ap € F*. 


We now prove a version of the equality (9.4.3) for R-lattices (recall Definition 
93T): 


Lemma 9.4.6. Let M be an R-lattice in V. Then 
M =( Mo =f Mm EV 
p m 
where the intersection is over all prime (maximal) ideals p. 


Proof. It suffices to prove the statement for maximal ideals since Mm) E Mep) when- 
ever m D p. The inclusion M C (m Mum) is clear. Conversely, let x € V satisfy 
x € Mm) for all maximal ideals m. Let 


(M:x):={reR:irxe M}. 


Then (M : x) is an ideal of R. For a maximal ideal m of R, since x € Mm) there exists 
0 £ rm E R\m such that rmx € M. Thus rm € (M : x) and (M : x) is not contained in 
any maximal ideal of R. Therefore (M : x) = R and hence x € M. o 


Corollary 9.4.7. Let M,N be R-lattices in V. Then the following are equivalent: 


(i) MCN; 
(ii) Mop) © Np) for all prime ideals p of R; and 
(iii) Mim) © Nim) for all maximal ideals m of R. 


Proof. The implications (i) > (ii) > (iii) are direct; for the implication (iii) > (i), we 
have M = (p Mím) E Np Mm) = N by Lemma 9.4.6. m 


In particular, it follows from Corollary 9.4.7 that M = N for R-lattices M, N if and 
only if M) = Nop) for all primes p of R. 
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9.4.8. A property that holds if and only if it holds locally (as in Corollary 9.4.7, for the 
property that one lattice is contained in another) is called a local property. 


To conclude this section, we characterize in a simple way the conditions under 
which a collection (M(p))p of Rip)-lattices arise from a global R-lattice. We will see 
that just as a nonzero ideal of R can be factored uniquely into a product of prime 
ideals, and hence by the data of these primes and their exponents, so too can a lattice 
be understood by a finite number of localized lattices, once a “reference” lattice has 
been chosen (to specify the local behavior of the lattice at other primes). 


Theorem 9.4.9. (Local-global dictionary for lattices). Let R be a Dedekind domain, 
and let M C V be an R-lattice. Then the map N > (Nq))p establishes a bijection 
between R-lattices N C V and collections of lattices (Np))p indexed by the primes p 
of R satisfying M) = Nip) for all but finitely many primes p. 


In Theorem 9.4.9, the choice of the “reference” lattice M is arbitrary: if M’ is 
another lattice, then by Theorem 9.4.9 Mp) = Ms) for all but finitely many primes p, 
so we get the same set of lattices replacing M by M”. In particular, any lattice N C V 
agrees with any other one at all but finitely many localizations. 


Remark 9.4.10. In Theorem 9.4.9, there is a bit of notational abuse: when we write a 
collection (Nep))p, we do not mean to imply that there is (yet) an R-lattice N such that 
the localization of N at p is equal to Nip). This conclusion is what is provided by the 
theorem (the statement of surjectivity), so the notational conflict is only temporary. 


Proof of Theorem 9.4.9. Let N C V be an R-lattice. Then there exists 0 + r € R such 
that rM C N Cr7!M. Butr is contained in only finitely many prime (maximal) ideals 
of R, so for all but finitely many primes p, the element r is a unit in Rp) and thus 
Mop) = Nip): 

So consider the set of collections (Np))p of lattices where Nip) is an Rip)-lattice for 
each prime p with the property that Mp) = N(p) for all but finitely many primes p of 
R. Given such a collection, we define N = (Np Mp) E V. Then N is an R-submodule 
of V. We show it is an R-lattice in V. For each p such that Mp) # Np), there exists 
rp E€ R such that rp Mp) E Np) E rp Mo). Therefore, if r = []p rp is the product of 
these elements, then rMo) E N S r! Mop) for all primes p with Mep) # Nip). On the 
other hand, if Mp) = Np) then already rM) © Mp) = N) E r !No) = r! Mop). 
Therefore by Corollary 9.4.7, we have rM C N Cr7!M, and so N is an R-lattice. 

By Lemma 9.4.6, the association (Np))p > Mp Nop) is an inverse to N > (Nip))p- 
Conversely, given a collection (Np))», for a nonzero prime p, we have (Na Na) 


(p) > 
Nip) Since (Rg) = F so (Nq)(p) = V whenever q # p. Oo 


9.5 Completions 


Next, we briefly define the completion and show that the local-global dictionary holds 
in this context as well. (We will consider completions in the context of local fields 
more generally starting in chapter 12, so the reader may wish to return to this section 
later.) For a general reference on completions (and the induced topology), see e.g. 
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Atiyah—-Macdonald [AM69, Chapter 10], Matsumura [Mat89, Chapter 8], Bourbaki 
[Bou98, Chapter III, §3]. To avoid diving too deeply into commutative algebra we 
suppose that R is a DVR, with maximal ideal p: for example, we might take the 
localization of a Dedekind domain R at a prime ideal by 9.4.5. There is a natural 
system of compatible projection maps R/p”"*! > R/p” indexed by integers n > 1, and 
we define the completion of R at p to be the inverse (or projective) limit under this 
system: 


Ry := lim R/p 


n 


5 (9.5.1) 
:= fa = (an)n € I] R/p” : an+1 = Gn (mod p”) for all n > 1} : 


n=1 


The completion Rp is again a commutative ring, and we have a natural map R — Rp 
defined by a > (a)n. Since R has a discrete valuation we have ọọ p” = {0}, so this 
map is injective. Moreover, since p is maximal, then in fact this inclusion factors via 
R => Rp) > Ry inducing isomorphisms (Exercise 9.8) 


R/p® => Rp)/p° Rp) > Ry/p° Rp (9.5.2) 


for all e > 1; in particular, the operation of completion is in a sense ‘stronger’ than 
the operation of localization, and the valuation on R extends naturally to Ry, so 
R = F N Ry C Fy. However, once local the completion looks rather similar in the 
context of lattices, as follows. Let Fy := F 8r Rp and Vp := V 8r Fy. 


Lemma 9.5.3. Let R be a DVR with maximal ideal p € R. Then the maps 
Mp) => Mp = Mg) SRo) Rp 


(9.5.4) 
My NV = My 


are mutually inverse bijections between the set of R-lattices in V and the set of Ry- 
lattices in Vp. 


Proof. Let M C V bean R-lattice. By 9.4.5 we have M ~ R” free over R; choose a basis 
M = Rx, ®---ORxp. Then My = M@RRy ~ Ry X1B---ORy Xn. Let M’ := MyNV C Vp. 
Then x’ € M’ if and only if x’ = axı +--+ + anXn with a; € Ry N F = R, so indeed 
M’ = M. Conversely, let My © Vp and let M’ := My N V. Then (M’), E My, and 
we prove the opposite inclusion. First, a bit of setup. Let y),..., Yn be an F-basis for 
V, and let N = Ry, @---@ Ryn. By Lemma 9.3.5, there exists nonzero r € Ry such 
that rN, C My © rN»; Choosing an element s € R with the same valuation as r, we 
have r/s € R% so in fact may suppose that r € R. Rescaling the basis vectors y; and 
replacing r? by r we may suppose that (rN )p = rN E My C Ny. From the previous 
paragraph, we have rNy = rN r Ry C M}. Letting (r) = p° and taking (9.5.2) on each 
coordinate, we have an isomorphism y: N/rN = Ny/rNy induced from the natural 
inclusion N —> Ny. Now to show the inclusion, let y € My. Let x € N C V be such that 
g(x +rN) = y + rNp; lifting to Np, we find that there exists z € rNp S (M’)y C My 
such that x = y + z € Mp N V, so y = x — z E€ (M”)p. o 
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In particular, Lemma 9.5.3 implies that in the local-global dictionary for lattices 
over a Dedekind domain R (Theorem 9.4.9), we may also work with collections of 
Ry-lattices (Np), over the completions at primes. 


9.6 Index 


Continuing with R a noetherian domain, let M, N C V be R-lattices. 


Definition 9.6.1. The R-index of N in M, written [M : N]p, is the R-submodule of F 
generated by the set 


{det(6) : 6 € Endz (V) and 6(M) € N}. (9.6.2) 


The style of Definition 9.6.1, given by a large generating set (9.6.2), is the replace- 
ment for being able to work with given bases; this style will be typical for us in what 
follows. The determinants det(6) are meant in the intrinsic sense, but can be computed 
as the determinant of a matrix upon choosing a basis for V. 


Lemma 9.6.3. The index [M : N]r is a nonzero R-module, and if a € Aut (V) then 
[aM : N] = det(a)![M : N]. 


Proof. Exercise 9.10. o 


Lemma 9.6.4. If M, N are free (as R-submodules), then [M : N]pr is a free R-module 
generated by the determinant of any ô € Endp(V) giving a change of basis from 
MtoN. 


Proof. Let x,..., x, bean R-basis for M, thereby an F-basis for V. Let y1, . . ., Yn be an 
R-basis for N; then the map x; +> y; first extends to an R-linear isomorphism M æ N 
and thereby to an F-linear map 6 € Endr (V), and of course 6(M) C N by construction, 
so det(o) € [M : N]r. Conversely, let 6’ € Endr(V) be such that 6’(M) € N. The 
map 6’6-': N —> N is an R-linear map, so det(5’6~!) = det(6’)det(6)"! € R, so 
det(6”) € det(O)R. o 


Example 9.6.5. If N = rM withr € R, then [M : N]r = r"R where n = dimp V. 
Example 9.6.6. If R = Zand N C M, then [M : N]zis the ideal generated by #(M / N), 
the usual index taken as abelian groups. In this case, for convenience we will often 
identify [M : N]z with its unique positive generator. 


Forming the R-index commutes with localization, as follows. 


Lemma 9.6.7. Let p be a prime of R. Then 


[Mp : Np)lRo) = (IM : NIR). 
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Proof. If 6(M) S N, then 6(Mip)) © Np) by Rip)-linearity, giving the inclusion (2). 
For (C), let 6 € Endz (V) be such that 6(M(y)) E Mp). For any x € M, we have 6(x) € 
0(M) © 6(M(p)) E Np), so there exists y € N and s € R\p such that sô(x) = y E€ N. 
Let x1, .. ., Xm generate M as an R-module, and for each i, let s; € R\p be such that 
5;0(x;) € N. Let s := [], 5;. Then sô(M) C N, so det(sd) = s” det(6) € [M _: N]p, if 


n := dimp V. Finally, s € Roy we conclude that det 6 € ([M : N]r)ip), as desired. O 


Proposition 9.6.8. Suppose that M, N are projective R-modules. Then [M : N]p is a 
projective R-module. Moreover, if N C M then [M : N]r = Rif and only if M = N. 


Proof. Let p be a prime of R and consider the localization ([M : N]r)p) at p. Since 
M,N are projective R-modules, they are locally free (9.2.1). By Lemma 9.6.4, the 
local index [M(p) : Mp)]R,,) iS a principal R(p)-ideal. By Lemma 9.6.7, we conclude 
that [M : N]p is locally principal, therefore projective. 

The second statement follows in a similar way: we may suppose that R is local and 
thus N C M are free, in which case M = N if and only if a change of basis matrix 
from N to M has determinant in R*. o 


For Dedekind domains, the R-index can be described as follows. 


Lemma 9.6.9. If R is a Dedekind domain and N C M, then [M : Np is the product 
of the invariant factors (or elementary divisors) of the torsion R-module M/N. 


Proof. Exercise 9.12. m 


9.7 Quadratic forms 


In setting up an integral theory, we will also have need of an extension of the theory 
of quadratic forms integrally, generalizing those over fields (Section 4.2). For further 
reading on quadratic forms over rings, we suggest the books by O’Meara [O’Me73], 
Knus [Knu88], and Scharlau [Scha85]. 


Definition 9.7.1. A quadratic map is a map Q: M — N between R-modules, satis- 
fying: 
(i) O(rx) = r?Q(x) for allr € R and x € M; and 
(ii) The map T: M x M — N defined by 
T(x, y) = Q(x + y) - Q(x) - Q) 
is R-bilinear. 
The map T in (ii) is called the associated bilinear map. 


Remark 9.7.2. The bilinearity condition (ii) can be given purely in terms of Q: we 
require 


Oxt+y+z)=Oxt+y)+ Q(x +z) + Oy + 2) - Ax) — AVY) - A(z) 
for all x, y,z € M. 
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Definition 9.7.3. A quadratic module over R is a quadratic map Q: M — L where 
M is a projective R-module of finite rank and L is a projective R-module of rank 1. A 
quadratic form over R is a quadratic module with codomain L = R. 

A quadratic module Q: M — L is free if M and L are free as R-modules, and a 
quadratic form Q: M —> R is free if M is free as an R-module. 


Example 9.7.4. Let Q: V — F be a quadratic form. Let M C V be an R-lattice such 
that Q(M) C L where L is an invertible R-module. (When R is a Dedekind domain, 
we may take L = Q(M), see Exercise 9.13.) Then the restriction Q|y: M —> Lisa 
quadratic module over R. 

Conversely, ifQ: M — Lisaquadratic module over R, then the extension Q: MOR 
F — L @p F ~ F is a quadratic form over F. Moreover, at the slight cost of some 
generality (replacing an object by an isomorphic one), by choosing an isomorphism 
L®@prF ~ F we may suppose that Q takes values in an invertible fractional ideal [ c F. 


Example 9.7.5. If Q: M — L is a quadratic module and a € R is a projective R-ideal, 
then Q extends naturally by property (i) to a quadratic module aM — a?L. 


Definition 9.7.6. A similarity between two quadratic modules Q: M — L and 
Q’: M’ > L’ isa pair of R-module isomorphisms f: M æ M’ and h : L — L’ such 
that Q’(f(x)) = h(Q(x)) for all x € M, i.e., 
such that the diagram 


M-L OTT 
ls is 
M’ e L’ 

commutes. An isometry between quadratic modules is a similarity with L = L’ and h 


the identity map. 


Definition 9.7.8. Let Q: M — L be a quadratic module over R. Then Q is nondegen- 
erate if the R-linear map 
T: M > Homg(M, L) 


(9.7.9) 
x O T(x, y)) 


is injective; and Q is nonsingular (or regular) if the map (9.7.9) is an isomorphism. 


Example 9.7.10. If R = F is a field, then (by linear algebra) Q is nondegenerate if and 

only if Q is nonsingular. 

Example 9.7.11. A quadratic module is nondegenerate if and only if its base extension 
QOr:MƏRF>LQRF=>F 


is nondegenerate, since the kernel can be detected over F. Recalling the definition 
of discriminant (Definition 4.3.3 for char F + 2 and Definition 6.3.1 in general), we 
conclude that Q is nondegenerate if and only if disc QF # 0. 
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The apparent notion of discriminant of a quadratic module needs some care in 
its definition in this generality; it is delayed until section 15.3, where discriminantal 
notions are explored in some detail. 


Example 9.7.12. Borrowing from the future (see Lemma 15.3.8): if M = R” is free, 
then choosing a basis for M and computing (half-)discriminant disc Q, we will see that 
M is nonsingular if and only if disc Q € R*. 


We now define the notions of genus and classes. 


Definition 9.7.13. Let Q: M — L be a quadratic module. The genus Gen Q is the set 
of quadratic modules that are locally isometric to Q, i.e., Qn) ~ Qp) for all primes 
p C R. The class set Cl Q is the set of isometry classes in the genus. 


We conclude with some comments on the codomain of a quadratic map. 


Definition 9.7.14. A quadratic module Q: M —> L is primitive if Q(M) generates L 
as an R-module. 


9.7.15. If Q: R” — R is a quadratic form, written 


O(%1,..-,Xn) = yy ayjzxiX; E€ R[X,..., Xn], 


l<i<j<n 


then Q is primitive if and only if the coefficients a;; generate the unit ideal R. 

If R is a Dedekind domain, then Q(M) C L is again projective (locally at a prime 
generated by an element of minimal valuation), so one can always replace Q: M —> L 
by Q: M — Q(M) to get a primitive quadratic module; when R is a PID, up to 
similarity we may divide through by greatest common divisor of the coefficients a;; in 
the previous paragraph. 


9.7.16. In our admittedly abstract treatment of quadratic modules so far, we have 
specifically allowed the codomain of the quadratic map to vary at the same time as the 
domain—in particular, we do not ask that they necessarily take values in R. 


Remark 9.7.17. In certain lattice contexts with R a Dedekind domain, a quadratic form 
with values in a fractional ideal a is called an a-modular quadratic form. Given the 
overloaded meanings of the word modular, we do not employ this terminology. In the 
geometric context, a quadratic module is called a line-bundle valued quadratic form. 
Whatever the terminology, we will see in Chapter 22 that it is important to keep track 
of the codomain of the quadratic map just as much as the domain, and in particular we 
cannot assume that either is free when R is not a PID. 


9.8 Normalized form 


To conclude this chapter, we discuss an explicit normalized form for quadratic forms. 
Let R be a local PID; then R is either a field or a DVR. In either case, R has valuation 
v: R > Zso U {oo} and uniformizer 2; when R is a field, we take a trivial valuation 
and z = 1. 
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Let Q: M —> R bea quadratic form over R. Then since R is a PID, M = R” is free. 
We compute a basis for M in which Q has a particularly nice form, diagonalizing Q 
as far as possible. In cases where 2 € R*, we can accomplish a full diagonalization; 
otherwise, we can at least break up the form as much as possible, as follows. For 
a, b,c € R, the quadratic form Q(x, y) = ax? + bxy + cy? on R? is denoted [a, b, c]. 


Definition 9.8.1. A quadratic form Q over R is atomic if either: 
(i) Q = (a) for some a € R*, or 
(ii) 2 ¢ R* and Q = [a, b, c] with a, b,c € R satisfying 
v(b) < v(2a) < v(2c) and v(a)v(b) = 0. 


In case (ii), we necessarily have v(2) > 0 and v(b? — 4ac) = 2v(b). 


Example 9.8.2. Suppose R = Zz is the ring of 2-adic integers, so that v(x) = ord2(x) 
is the largest power of 2 dividing x € Z2. Recall that Z% JZ% is represented by the 
elements +1, +5, therefore a quadratic form Q over Zz is atomic of type (i) above 
if and only if Q(x) = +x? or Q(x) = +5x?. For forms of type (ii), the conditions 
v(b) < v(2a) = v(a) + 1 and v(a)v(b) = 0 imply v(b) = 0, and so a quadratic 
form Q over Zz is atomic of type (ii) if and only if Q(x, y) =~ ax? + xy + cy? with 
ordz(a) < ord2(c). Replacing x by ux and y by u`!y for u € Z% we may suppose 
a = +2' or a = +5 -2' with ¢ > 0, and then the atomic representative [a, 1, c] of the 
isomorphism class of Q is unique. 


A quadratic form Q is decomposable if Q can be written as the orthogonal sum 
of two quadratic forms (Q ~ Qı = Q2) and is indecomposable otherwise. It follows by 
induction on the rank of M that Q is the orthogonal sum of indecomposable forms. We 
will soon give an algorithmic proof of this fact and write each indecomposable form 
as a scalar multiple of an atomic form. We begin with the following lemma. 


Lemma 9.8.3. An atomic form Q is indecomposable. 


Proof. If Q is atomic of type (i) then the space underlying Q has rank 1 and is therefore 
indecomposable. Suppose Q = [a, b,c] is atomic of type (ii) and assume for purposes 
of contradiction that Q is decomposable. It follows that if x, y € M then T(x, y) € 2R. 
Thus we cannot have v(b) = 0, so v(a) = 0, and further v(b) > v(2) = v(2a); this 
contradicts the fact that Q is atomic. oO 


Proposition 9.8.4. Let R be a local PID and let Q: M —> R be a quadratic form. Then 
there exists a basis of M such that the form Q can be written 


Q =n’ Qim... an” 


where the forms Q; are atomic and Q < e] < ++ < en < œ. 


In the above proposition, we interpret 7° = 0. A form as presented in Proposition 
9.8.4 is called normalized; this normalized form need not be unique. 
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Proof. When R = F is a field with char F # 2, we are applying the standard method 
of Gram-Schmidt orthogonalization to diagonalize the quadratic form. This argument 
can be adapted to the case where R = F is a field with char F = 2, see e.g. Scharlau 
[Scha85, §9.4]. For the general case, we make further adaptations to this procedure: 
see Voight [Voi2013, Algorithm 3.12] for a constructive (algorithmic) approach. o 


Exercises 


Let R be a noetherian domain with field of fractions F := Frac R. 


1. 


> 2. 


> 9, 


Let V be a finite-dimensional F-vector space and let M,N C V be R-lattices. 
Show that M + N and M N N are R-lattices. 

Let B be an F-algebra and let J C B be an R-lattice. Show that there exists a 
nonzeror E€ RAT. 


. Give an example of a non-noetherian ring R and modules N C M such that M 


is finitely generated but N is not finitely generated. 


. Let k be a field and R = k[x, y]. Show that the R-module (x, y) is not projective. 
. Let R be a Dedekind domain. Show that every ideal of R is projective, as 


follows. Let a C R be a nonzero ideal. (The zero ideal is trivially projective.) 


Since aa! = R, we may write 1 = )}?_, a,b; with a; € a and b; € al. 


(a) Define the map ¢: R” > a by @(x1,...,%n) = Dj_, aixi. Observe that ø 
is an R-module homomorphism, and construct a right inverse y to ¢, i.e., 


oy = idg. 


(b) Using (a), show that a is a direct summand of R”, so a is projective. 


. Let m C R be a maximal ideal and let M be a finitely generated R-module. Let 


anng M := {r € R : rx = 0 forall x € M} 


be the annihilator of M. Show that Mim) = {0} if and only if m + anng M = R. 


. Suppose R is a Dedekind domain. Let V be a finite-dimensional F-vector space 


and let M C V be an R-lattice. Given a pseudobasis M = a,x; ®--: ® AnXn as 
in (9.3.7), let [a] ---a,] € Cl R. Show that this class (the Steinitz class, 9.3.10) 
is well-defined for M independent of the choice of pseudobasis. 


. Let R be a DVR with maximal ideal m. Show that if s ¢ m then 1/s € Rm, so 


there are natural inclusions 
R => Rm) => Rm 
from the domain into its localization into the completion, inducing isomorphisms 
R/p? © Rp)/pP Rep) > Rp/P Rp 


forall e > 1. 

Let V be a finite-dimensional vector space over F and M C V an R-lattice. Let 
p be a prime of R. Show that if Mp) E V is an Ry)-lattice then My N V = Mop). 
Conclude that Lemma 9.5.3 holds. 
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10. 


11. 
12. 


13. 


14. 


15. 


16. 


Let V be a finite-dimensional F'-vector space and let M, N C V be R-lattices. 


(a) Show that the index [M : N]r is a nonzero R-module. [Hint: use Lemma 
9.3.5. ] 


(b) For a € Autr (V), show [aM : N] = det(@) t [M : N]. 


Find R-lattices M, N C V such that [M : N]r = R but M + N. 

Prove Lemma 9.6.9, as follows. Suppose R is a Dedekind domain, and let 
N CM C V be R-lattices in a finite-dimensional vector space V over F. Prove 
that [M _: N]p is the product of the invariant factors (or elementary divisors) of 
the torsion R-module M/N. 

Suppose R is a Dedekind domain. Let Q: V > F be a quadratic form over F, 
let M C V be an R-lattice, and let L := Q(M) C F be the R-submodule of F 
generated by the values of Q. Show that L is a fractional R-ideal. 

Consider the ternary quadratic form Q(x, y,z) = xy + xz over Z2. Compute a 
normalized form for Q. 

Consider the following ‘counterexamples’ to Theorem 9.4.9 for more general 
integral domains as follows. Let R = Q[x, y] be the polynomial ring in two 
variables over Q, so that F = Q(x, y). Let V = F and 7 = R. 


(a) Show that yR has the property that yRp # Rp for infinitely many prime 
ideals p of R. 


(b) Consider the collection of lattices given by Jp = f(x)Rp if p = (y, f(x) 
where f(x) € Q[x] is irreducible and Jp = Ry otherwise. Show that 


Np Jp = (0). 


[Instead, to conclude that a collection (Jy), of Ry-lattices arises from a global 
R-lattice J, one needs that the collection forms a sheaf.] 

In this advanced exercise, we consider generalizations of the notion of lattices to 
a geometric context; we assume background in algebraic geometry at the level 
of Hartshorne [Har77, Chapter IT]. 


Let X be a separated, integral scheme—so for each open U, the ring @x(U) is 
a(n integral) domain—and let Gx be its structure sheaf. Let F be the function 
field of X (so F = @x({n}) where 7 is the generic point of X). Let V be a 
finite-dimensional F-vector space. 


Define a sheaf of @y-lattices in V (also called an Ox-lattice in V), to be a sheaf 
M of Cx-modules such that for each affine open set U C X, the set (U) is an 
Ox(U)-lattice in V. As usual, for P € X a point, we denote by -%p) the stalk of 
M at P. 


(a) Show that a sheaf of Ox-lattices in V is naturally a subsheaf of the constant 
sheaf V over X. 

(b) Let X = U; U; be an affine open cover of X, with U; = Spec R;. Since X 
is separated, each intersection U; N U; = Spec R;; is affine, so there are 
natural inclusions R;, R; <> Rij © F of rings for each i, j. Show that a 
sheaf of @y-lattices is specified uniquely by R;-lattices M; © V for each i, 
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subject to the condition that M;R;; = M;R,; for each i, j. [Hint: this is an 
easy case of gluing, where isomorphism is replaced by equality in V.] 
Now suppose further that X is noetherian, normal, and of dimension < 1 
(also called a Dedekind scheme). Then the local rings of X at closed 
points are DVRs with fraction field F, and nonempty affine open subsets 
of X are the complements of finite subsets of closed points and of the 
form U = Spec R with R an Dedekind domain. (For example, we may take 
X = Spec R for R a Dedekind domain or X a smooth projective integral 
curve over a field.) 


(c 


wa 


Extend the local-global dictionary for lattices to X, in the following way. 
Let U = Spec R C X be a nonempty affine open subset, and let M C V 
be an R-lattice. Show that the map “~ — (.%p))p establishes a bijection 
between @x-lattices VY in V and collections of lattices (Np))p indexed 
by the points P € X, such that for all but finitely many P € U given by the 
prime p C R, we have Mp) = Np) C V. 
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Chapter 10 


Orders 


In this chapter, continuing with a second background installment, we study when 
lattices over a domain are closed under a multiplication law: these will be orders, an 
integral analogue of algebras over fields. 


10.1 > Lattices with multiplication 


We begin with a brief indication of the theory of orders over the integers. Let B be a 
finite-dimensional Q-algebra. An order O C B is a lattice that is also a subring of B 
(in particular, 1 € O). The property of being an order is a local property for a lattice, 
i.e., one may check that it is closed under multiplication in every localization Oy»), for 
p prime. 

An order is maximal if it is not properly contained in another order. For example, 


3 


b 
if we start with the quaternion algebra B := 22) with a,b € Z nonzero, then the 


lattice 
O:=Z+Zi+Zj+Zij CB 


is closed under multiplication, and so defines an order—but it is never a maximal order. 


An important construction of lattices comes about as follows: if J C B is a lattice, 
then 
OL :={@eEB:al ch} 


is an order, called the left order of 7; we similarly define the right order. 

If O C B is an order and a € O, then a is integral (over Z), satisfying a monic 
polynomial with integer coefficients. If B is a quaternion algebra, then œ € B satisfies 
its reduced characteristic polynomial of degree 2, and a is integral if and only if 
trd(a),nrd(@) € Z (Corollary 10.3.6). When B = F is a number field, the most 
important order in F is the ring of integers, the set of all integral elements: it is the 
unique maximal order. 

Unfortunately, this construction does not work in the noncommutative setting: the 
set of all integral elements does not form an order. For one thing, if O C Bisa 
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maximal order and œ € B*, then a@Oa~! C B is a maximal order and when B is 
noncommutative, we may have «Oa! + O. But there are more serious problems, as 
the following example indicates. 


0 0 1/2 0 
a’ = B* = 0, so a, B are integral over R = Z, but a + £ and af are not integral since 
nrd(@ + 6) = —1/4 and trd(@B) = 1/4. (Such a counterexample does not require the 
existence of zerodivisors: see Exercise 10.10.) 


Example 10.1.1. Let B = M2(Q) and let a = b o and 8 = | k | Then 


Understanding orders in quaternion algebras is a major task of this second part of 
the text. In the simplest case B = M>(Q), every maximal order is conjugate (and thus 
isomorphic) in B to M2(Z). The reader may wish to skip ahead to Chapter 11 to get to 
know the Hurwitz order before returning to study orders more generally. 


10.2 Orders 


Throughout, let R be a domain with field of fractions F := Frac(R), and let B be 
a finite-dimensional F-algebra. For further reference about orders (as lattices), see 
Reiner [Rei2003, Chapter 2] and Curtis—Reiner [CR81, $§23, 26]. 


Definition 10.2.1. An R-order O C B is an R-lattice that is also a subring of B. 


In particular, if O is an R-order, then since O is a subring we have 1 € O, and since 
O is an R-module we have R c O. We will primarily be concerned with R-orders that 
are projective as R-modules, and call them projective R-orders. 


10.2.2. An R-algebra is a ring O equipped with an embedding R ~ O whose image 
lies in the center of O. An R-order O has the structure of an R-algebra, and if O is 
an R-algebra that is finitely generated as an R-module, then O is an R-order of the 
F-algebra B = O @p F. 


Example 10.2.3. The matrix algebra M„(F) has the R-order M,,(R). The subring 
R[G] = Doce Rg is an R-order in the group ring F[G]. 


Example 10.2.4. Let a, b € R\{0} and consider the quaternion algebra B = (a, b | F). 
Then O = R@ Ri @ Rj © Rk is an R-order, because it is closed under multiplication 
(e.g., ik =i(ij) =aj € O). 

Let J C B be an R-lattice in the F-algebra B. 


10.2.5. An important construction of orders comes as follows. Let 


OD :={@e€ Bi al CN. (10.2.6) 


Lemma 10.2.7. OL(I) © B is an R-order. 
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Proof. Then O.(/) is an R-submodule of B which is a ring. We show it is also an 
R-lattice. For all a € B, by Lemma 9.3.5(b), there exists nonzero r € R such that 
r(al) C I, hence OL(I)F = B. Also by this lemma, there exists nonzero s € R such 
that s = s - 1 € I; thus OL(Ds C I so O,(D) C s7!J. Since R is noetherian and s~!J is 
an R-lattice so finitely generated, we conclude that O_(/) is finitely generated and is 
thus an R-lattice. Oo 


Definition 10.2.8. The order O_(J) = {a € B : aI C J} in (10.2.6) is called the left 
order of J. We similarly define the right order of J by 


Or() := {a E€ B: la CI}. 


Example 10.2.9. It follows from Lemma 10.2.7 that B has an R-order: the R-span of 
an F-basis for B defines an R-lattice, so O(I) is an R-order. (This is a nice way of 
“clearing denominators” from a multiplication table to obtain an order.) 


We can read other properties about lattices from their localizations, such as in the 
following lemma. 


Lemma 10.2.10. Let B be a finite-dimensional F-algebra and let I C B be an R-lattice. 
Then the following are equivalent: 


(i) I is an R-order; 
(ii) Ip) is an Rip)-order for all primes p of R; and 
(iii) Km) is an R(m)-order for all maximal ideals m of R. 


Proof. For (i) > (ii) > (ii), if J is an R-order then Ip) is an Rip)-order for all primes 
p, hence a fortiori for all maximal ideals m. 

To conclude, we prove (iii) => (i), and suppose that Im) is an R(m)-order for all 
maximal ideals m. Then (),_ Zm) = Z by Lemma 9.4.6. Thus 1 € (m Zm) = J, and for 
all a, B € I we have aß € (m Im) = I, so J is a subring of B and hence an order. o 


Remark 10.2.11. The hypothesis that R is noetherian is used in Lemma 10.2.7, but it is 
not actually needed; the fact that O, (7) is an order follows by a process often referred 
to as noetherian reduction. A basis of B yields a multiplication table, consisting of 
finitely many elements of F; moreover, we know that Z is finitely generated as an 
R-module. Writing these generators in terms of a basis we can express these generators 
over the basis using finitely many elements of F. Let Ro be the subring of R generated 
by these finitely elements, with field of fractions Fo, let Bo be the Fo-algebra with the 
same multiplication table as B; let Jọ be the Ro-submodule generated by the generators 
for J written over Ro. Then B = Bo 8p, F and I = Ip Sp, R. But now Rp is a finitely 
generated commutative algebra over its prime ring (the subring generated by 1), so by 
the Hilbert basis theorem, Rọ is noetherian. The argument given then shows that Jp is 
finitely generated as an Ro-module, whence Z is finitely generated as an R-module. 

Noetherian reduction applies to many results in this text, but non-noetherian rings 
are not our primary concern; we retain the noetherian hypothesis for simplicity of 
argument and encourage the interested reader to seek generalizations (when they are 
possible). 
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10.3 Integrality 


Orders are composed of integral elements, defined as follows. If a € B, we denote by 
R[a] = iq Raf the (commutative) R-subalgebra of B generated by a. 


Definition 10.3.1. An element a € B is integral over R if æ satisfies a monic polyno- 
mial with coefficients in R. 


Lemma 10.3.2. For a € B, the following are equivalent: 


(i) a is integral over R; 
(ii) Ra] is a finitely generated R-module; 
(iii) æ is contained in a subring A that is finitely generated as an R-module. 


Proof. This lemma is standard; the only extra detail here is to note that in (iii) we do 
not need to assume that the subring A is commutative: (ii) > (iii) is immediate taking 
A = R{a], and for the converse, if A C B is a subring that is finitely generated as an 
R-module, then R[@] E A and since R is noetherian and A is finitely generated as an 
R-module, it follows that R[a] is also finitely generated as an R-module. o 


Corollary 10.3.3. If O is an R-order, then every a € O is integral over R. 


10.3.4. We say R is integrally closed (in F) if whenever a e€ F is integral over R, 
then in fact a € R. Inside the field F, the set of elements integral over R (the integral 
closure of R in F) forms a ring: if «œ, 8 are integral over R then a + £ and af are 
integral since they lie in R[a@, 8] which is a finitely generated submodule of F. The 
integral closure of R is itself integrally closed. 


Lemma 10.3.5. Suppose that R is integrally closed. Then a € B is integral over R if 
and only if the minimal polynomial of a over F has coefficients in R. 


Proof. Let f(x) € R[x] be a monic polynomial that «œ satisfies, and let g(x) € F[x] be 
the minimal polynomial of a. Let K be a splitting field for g(x), and let a1,...,@ be 
the roots of g(x) in K. Since g(x) | f(x), each such a; is integral over R, and the set of 
elements in K integral over R forms a ring, so each coefficient of g is integral over R 
and belongs to F; but since R is integrally closed, these coefficients must belong to R 
and g(x) € R[x]. o 


Corollary 10.3.6. [f B is an F-algebra with a standard involution, and R is integrally 
closed, then a € B is integral over R if and only if trd(œ), nrd(@) € R. 


We may characterize orders in separable algebras as follows. 


Lemma 10.3.7. Let O C B be a subring ofa separable F-algebra B such that OF = B. 
Then O is an R-order if and only if every a € O is integral. 


Proof. Let O C B be a subring of an F-algebra B such that OF = B. Recall from 
Theorem 7.9.4 that a separable F-algebra is a semisimple F-algebra such that the 
symmetric bilinear pairing (a, 8) + trd(@f) is nondegenerate. 
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We need to show that O is finitely generated. Let a1,...,a@, be an F-basis for B 
contained in O. If 8 € O then £ = 9); aia; with a; € F. We have Ba; € O since O 
is a ring, so trd(Ba;) = Dj a; trd(a;a;) with trd(a;a;) € R. Now since B is separable, 
the matrix (trd(a;q;));,;=1,..,.n is invertible, say r = det(trd(a;a;)), so we can solve 
these equations for a; using Cramer’s rule and we find that a; € r~!R. Consequently 
O Cr !(Ra; ©---@ Ra,) is a submodule of a finitely generated module so (since R 
is noetherian) O is finitely generated. o 


10.4 Maximal orders 


The integral closure of R in F is the largest ring containing integral elements. Accord- 
ingly, we make the following more general definition. 


Definition 10.4.1. An R-order O C B is maximal if it is not properly contained in 
another R-order. 


If B is a commutative F-algebra and R is integrally closed in F, then the integral 
closure S of R in K is integrally closed and therefore S is a maximal R-order in K. How- 
ever, if B is noncommutative, then the set of elements in B integral over R is no longer 
necessarily itself a ring, and so the theory of maximal orders is more complicated. (This 
may seem counterintuitive at first, but certain aspects of the noncommutative situation 
are quite different!) The problem in the noncommutative setting is that although R[a] 
and R[f] may be finitely generated as R-modules for a, 6 € B, this need not be the 
case for the R-algebra generated by a and £. 


10.4.2. It follows from Lemma 10.3.7 that a separable F-algebra B has a maximal R- 
order, as follows. By Lemma 10.2.7, B has an R-order O (since it has a lattice, taking 
the R-span of an F-basis), so the collection of R-orders containing O is nonempty. 
Given a chain of R-orders containing O, by Lemma 10.3.7 the union of these orders is 
again an R-order. Since R is noetherian, there exists a maximal element in a chain. 


For the rest of this section, we restrict attention and suppose that R is a Dedekind 
domain. We begin by showing that the property of being a maximal order is a local 


property. 


Lemma 10.4.3. An R-order O C B is maximal if and only if Oc») is a maximal Rip)- 
order for all primes p of R. 


Proof. If Oq) is maximal for each prime p then by Corollary 9.4.7 we see that O is 
maximal. Conversely, suppose O is maximal and suppose that Op) © Ow ) is a proper 
containment of orders for some nonzero prime p. Then the set O’ = (one Oa) N Ow) 
is an R-order properly containing O by Lemma 10.2.10 and Theorem 9.4.9. oO 


Lemma 10.4.4. Let O C B be an R-order. Then for all but finitely many primes p of 
R, we have that Op) = O Br Rip) is maximal. 


Proof. By 10.4.2, there exists a maximal order O’ 2 O. By the local-global principle 
for lattices (Theorem 9.4.9), we have O, = Oy for all but finitely many primes p. o 
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The structure of (maximal) orders in quaternion algebras over domains of arithmetic 
interest is the subject of the second part of this text. 


10.5 Orders in a matrix ring 


In this section, we study orders in a matrix ring; we restore generality, and let R be a 
noetherian domain with F = Frac R. 

The matrix ring over F is just the endomorphism ring of a finite-dimension vector 
space over F, and we seek a similar description for orders as endomorphism rings of 
lattices (cf. 10.2.5). 

Let V be an F-vector space with dimp V = n and let B = Endf(V). Choosing a 
basis of V gives an identification B = Endr (V) ~ M,(F). Given an R-lattice M € V, 
we define 

Endr(M) := {f € Endr(V): f(M) € M} CB. (10.5.1) 


The left order (10.2.5) is the special case of (10.5.1) where M = I € V =B. 


Example 10.5.2. If V = Fx; ®---® Fx, and M = Rx, ®---® Rxn, then Endr(M) = 
M,,(R). 

More generally, if M is completely decomposable, i.e. M = a,x; 8-:-@®ay,x, with 
each a; C F invertible fractional ideals, then we have Endr(M) € M,,(F) the subring 
of matrices whose ij-entry belongs to the R-module 


aja; = Home(q;, aj) C Home (F, F) = F 
where the isomorphisms come from multiplication. For example, if n = 2 then 


R ara, 


Endr(M) = Er R 
2 


] C M)(F). 


(Note how the cross terms are aligned correctly in the multiplication!) For example, if 


-1 
M = Rx, + ax, then Endg(M) = k R ). 


Lemma 10.5.3. Let M be an R-lattice of V. Then Endg(M) is an R-order in B = 
Endz (V). 


Proof. As in the proof of Lemma 10.2.7, we conclude that Endr(M)F = B. Let 
@1, @2,...,@, be an F-basis for V and let N = Ra, ®---@Ra,. Thus Endr(N) ~ M, (R) 
is finitely generated as an R-module. 

By Lemma 9.3.5 there exists nonzero r € R such that rN C M C r—!N. Therefore, 
if @ € Endr(M), so that ¢(M) € M, then 


(ONN) =r rN) C r6(M) CrM CN 


and thus Endr(M) €C r`? Enda(N); since R is noetherian, this implies that Endr(M) 
is finitely generated as an R-module and Endr(M) is an R-order in B. o 
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Lemma 10.5.4. Let O C B = Endr (V) be an R-order. Then O C Endr(M) for some 
R-lattice M C V. In particular, if O C B is a maximal R-order, then O = Endr(M) for 
some R-lattice M. 


Proof. Quite generally, if N is any R-lattice in V, then M = {x € N : Ox C N} is an 
R-submodule of N with FM = V (as in Lemma 10.2.7), thus M is an R-lattice in V 
and O C Endr(M). If further O is maximal, then the other containment so equality 
holds. o 


Corollary 10.5.5. If R is a PID, then every maximal R-order O C B = M,(F) is 
conjugate in B to M, (R). 


Proof. The isomorphism B ~ M,,(F) arises from a choice of basis x1, .. ., Xn for V; 
letting N = Di Rx; we have Endr(N) ~ M, (R). The R-order M, (R) is maximal by 
Exercise 10.6, since a PID is integrally closed. 

By Lemma 10.5.4, we have O C Endr(M) for some R-lattice M C V, so if O is 
maximal then O = Endg(M). If R is a PID then M is free as an R-module, and we 
can write M = Ry; ©---@ Ryn; the change of basis matrix from x; to y; then realizes 
Endg(M) as a conjugate of Endr(N) ~ M, (R). o 


Exercises 
Let R be a noetherian domain with field of fractions F. 


1. Let c C R be an ideal. Show that 
R R a b 
i i = ((? l € M2(R): c€ :] C M2(R) 


is an R-order in M2(F). Note further that if c is projective (equivalently, locally 
free) as an R-module, then this R-order is projective as an R-module. 


2. Let B bea finite-dimensional F-algebra with a standard involution and let O € B 
be an R-order. Verify that nrd: O —> R is a quadratic form over R. 


3. Let O,O’ C B be R-orders in an F-algebra B. 


(a) Show that O N O’ is an R-order. 
(b) If O c O’, show that O’* N O = O%. 


4. Let O C B be an R-order in an F-algebra B and suppose that R is integrally 
closed. Show that F NO = R. 


5. Let Aj,..., A, be F-algebras and let B = A, x--- x A,. Show that O C B is an 
R-order if and only if O is an R-lattice in B and O N A; is an R-order for each i. 


6. Let R be integrally closed. Show that M,,(R) is a maximal R-order in M,,(F). 


7. Let B = (K,b| F) be a quaternion algebra with b € R and let S be an R-order in 
K. Let O = S + Sj. Show that O is an R-order in B. 
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. Suppose that R is a PID, and let O C B be an R-order in the quaternion algebra 


B. Let a € O be such that $ = R[a] is (commutative) domain that is a maximal 
R-order in its field of fractions. 


(a) Show that 1, a extends to an R-basis for O. 
(b) If moreover S is a PID, show that there exists 8 € O such that 1, a, $, af is 
an R-basis for O. 


. Let B be an F-algebra with a standard involution and let a € B. Show that if œ 


is integral over R then trd(a”) € R for all n € Zso. Is the converse true? 


Generalize Example 10.1.1: Exhibit a division quaternion algebra B over Q and 
elements a, 6 € B such that a, 8 are integral over Z but both a + £ and af are 
not. 


Let a € M,,(F) have characteristic polynomial with coefficients in R. Show that 
a is conjugate by an element 6 € GL, (F) to an element of M, (R). Explicitly, 
how do you find such a matrix 8? 


Let B = M,,(F) and let J C B be an R-lattice. Let Jt‘ = {at : œ € I} be the 
transpose lattice. Show that O_(/*) = Op(/). 


Let J, J C B be R-lattices. Let JJ be the R-submodule of B generated by products 
aBwherea € I, 8 € J; i.e., 


IJ := {Xk aibi: a € 1B € J}. 
(a) Show that JJ is an R-lattice. 


(b) Let p be a prime of R. Show that products commute with localization in 
the sense that 


UJ) @r Rp) = (I Br Rwy) (J 8r Rwy) E Bo) = B. 
Let O C B be an R-order in an F-algebra B. 


(a) Show that O,(O) = OR(O) = O. 


(b) Let a € B*, and let «O = {aß : B € O}. Show that œO is an R-lattice and 
that O.(@O) = @Oa™!. 


Let O C B be an R-order in an F-algebra B. Let y € O and let N : BX — F* 
be a multiplicative map. Show that y € O* if and only if N(y) € R*, and in 
particular, if B has a standard involution, then y € O* if and only if nrd(y) € R*. 
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Chapter 11 


The Hurwitz order 


With the preceding chapters on lattices and orders in hand, we are now prepared 
to embark on a general treatment of quaternion algebras over number fields and the 
arithmetic of their orders. Before we do so, for motivation and pure enjoyment, in this 
chapter we consider the special case of the Hurwitz order. Not only is this appropriate 
in ahistorical spirit, it is also instructive for what follows; moreover, the Hurwitz order 
has certain exceptional symmetries that make it worthy of specific investigation. 


11.1 > The Hurwitz order 


Hurwitz developed the theory of integral quaternions in a treatise [Hur19] in 1919. A 
more modern treasure trove of detail about quaternion groups and the Hurwitz order 
(as well as many other things) can be found in the book by Conway—Smith [CSm2003]; 
the review by Baez [Bae2005] also provides an accessible overview. 

We consider in this chapter the restriction of the Hamiltonians from R to Q, namely, 


Q 
Zi, j) = Z + Zi + Zj + Zk, (11.1.1) 


1 
the quaternion algebra B = | } We further restrict to those elements with integer 


coordinates 


where k := ij. By Example 10.2.4, this is an order in B, called the Lipschitz order. 
In the rest of this chapter, we will work over Z and so we will simply refer to lattices 
and orders. 

The Lipschitz order is not a maximal order, and maximal orders have better prop- 
erties. This is analogous to the fact that the ring Z[Vv-3] is an order in Q(v-3) but is 
not maximal (not integrally closed), properly contained in the better-behaved maximal 
order Z[(-1 + v-3)/2] of Eisenstein integers. The comparison with the Eisenstein 
integers is more than incidental: the element «œ = i + j + k satisfies a* + 3 = 0, so it is 


natural to consider a 
-l+itjtk 
v= — 
2 


which satisfies w*+w+1 = 0. We can enlarge the Lipschitz order to include w—indeed, 
this is the only possibility. 
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Lemma 11.1.2. The lattice 
O = Z + Zi + Zj + Zw = Hii, j} + Zi, jw (11.1.3) 


in B is the unique order that properly contains Zii, j}, and O is maximal. 


The order O in (11.1.3) is called the Hurwitz order, and it contains Z</i, j} with 
index 2. Note that if a € O, then a € Zi, j) if and only if trd(@) € 2Z. 


Proof. By Exercise 11.1, the lattice O is an order. 

Suppose that O’ 2 Zi, j) and let œ = t + xi + yj + zk € O’ with t, x, y,z € Q. 
Then trd(av) = 2t € Z, so by Corollary 10.3.6 we have t € 5Z. Similarly, ai € O’ 
therefore trd(a@i) = -2x € Zand x € 5Z, and in the same way y,z € 5Z. Finally, 
nrd(q) = t? + x? + y? + z? € Z, and considerations modulo 4 imply that t, x, y, z either 
all belong to Z or to 5 + Z; thus a € O and so O’ = O. o 


11.1.4. We can recast this calculation in terms of the local-global dictionary for lattices 
(Theorem 9.1.1). Since O[5] = Zi, iF], for every odd prime p we have O) = 
ZKi, jp), and Oy 2 Zli, jjo- 


11.2 > Hurwitz units 


We now consider unit groups; in this section, we take k := ij. An element y = 
t+xi+yj+zk € Zli, jy is a unit if and only if nrd(y) = t? + x? + y? +22 € ZX, 
i.e. nrd(y) = 1, and since t, x, y, z E€ Z we immediately have 


Zli, jy = {+1, ti, +j, tk} ~ Qg 


is the quaternion group of order 8. In a similar way, taking y € O in the Hurwitz 
order and allowing t, x, y, z € iZ so that 2t, 2x, 2y, 2z all have the same parity, we find 
that 

O* = Qg U (+1 +i + j + k)/2 


is a group of order 24. 

We have O* # S4 (the symmetric group on 4 letters) because there is no embedding 
Qg => S4. (The permutation representation Qg — S4 obtained by the action on the 
cosets of the unique subgroup (—1) of index 4 factors through the quotient Qg —> 
Qs/{+1} =~ V4 — S4, where V4 is the Klein 4-group.) There are 15 groups of order 24 
up to isomorphism! We identify the right one as follows. 


Lemma 11.2.1. We have O*% ~ SL2(F3). 


Proof. We reduce modulo 3. There is a ring homomorphism 


-1,-1 
O > 0/30 = F3t, j) = (==) 
F3 
Any quaternion algebra over a finite field is isomorphic to the matrix ring by Wed- 
derburn’s little theorem (Exercises 3.16, 6.16, and 7.30). Specifically, the element 
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e€ =i+ j+ k has e? = 0 € O/30. The left ideal 7 generated by € is an F3-vector space, 
and we compute that it has basis e and ie = —1 — j + k. As in (7.6.3) (Proposition 
7.6.2) this yields an isomorphism 


0/30 > M2(F3) 
EN 0 -l\ fl 1 
ON, ophi -i 
(a statement that can be explicitly and independently verified in Exercise 11.3). We 
obtain a group homomorphism O* — SL2(F3), since the reduced norm corresponds 
to the determinant and nrd(O*) = {1}, and this homomorphism is injective because if 


y € O* has y — 1 € 30 then y = 1, by inspection. Since #O* = #SL2(F3) = 24, the 
map O* <> SL2(F3) is an isomorphism. Oo 


11.2.2. The group SL2(F3) acts on the left on the set of nonzero column vectors 
F? up to sign, a set of cardinality (9 — 1)/2 = 4. (More generally, SL2(F,,) acts on 
p! (@p):=( "2 ~ {(0, 0)})/F5,. a set of cardinality p+ 1.) This action yields a permutation 
representation SL2(F3) — S4; the kernel of this map is the subgroup generated by the 
scalar matrix —1 and so the representation gives an injective group homomorphism 
from PSL2(F3) := SL2(F3)/{+1} into S4. Since A4 < S4 is the unique subgroup of size 
24/2 = 12, we must have PSL2(F3) = A4, giving an exact sequence 


1 > {+1} > O* > Ay > 1. (11.2.3) 


11.2.4. We can also visualize the group O% and the exact sequence (11.2.3), thinking of 
the Hamiltonians as acting by rotations (section 2.4). Recall there is an exact sequence 
(Corollary 2.4.21) 


1 > {41} > H! > SOG) > 1 (11.2.5) 


obtained by the left action a + ava! for a € H! and v € H? = R°; specifically, 
by Proposition 2.4.18, a quaternion a = cos 8 + I(q@) sin 6 acts by rotation through the 
angle 26 about the axis (œ). 

We have been considering 


-1,-1 


o=s8-=| |> Ber- (F7) -B (11.2.6) 


and we now consider the corresponding embedding of groups O! = OX <> H!. We 
are led to think of the group O*/{+1} ~ A4 as the group of symmetries (rigid motions) 
of a tetrahedron (or rather, a tetrahedron and its dual), as in Figure 11.2.7. 
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Figure 11.2.7: Symmetries of a tetrahedron, viewed quaternionically 


Inside the cube in R? with vertices (+1,+1,+1) = +i + j +k, we can find four 
inscribed tetrahedra, for example, the tetrahedron T with vertices 


i+j+ki-j—-—k,-i+j—-—k,-i-jtk. 


Then the elements +i, +j, +k act by rotation about the x, y, z axes by an angle z (so 
interchanging points with the same x, y, z coordinate). The element +w = +(-1 +i + 
j +k)/2 rotates by the angle 27/3 fixing the point (1, 1, 1) and cyclically permuting the 
other three points, and by symmetry we understand the action of the other elements of 
O*. We therefore call O* the binary tetrahedral group. Following Conway—Smith 
[CSm2003, §3.3], we also write 2T = O™ for this group; the notation A4 is also used. 


The subgroup Qg < 2T is normal (as it is characteristic, consisting of all elements 
of O of order dividing 4), and so we can write 2T = Qg x (w) where (w) ~ Z/3Z acts 
on Qs by conjugation, cyclically rotating the elements i, j, k. Finally, the group 2T has 
a presentation (Exercise 11.6) 


2T = (r, s,t |r? = $? =f =rst =—-1) (11.2.8) 


viar =i, s =w =(—l +i + j+ k)/2, andt = (-1 +i + j -— k)/2. 


We conclude by noting that the difference between the Lipschitz and Hurwitz 
orders is “covered” by the extra units. 


Lemma 11.2.9. For every B € O, there exists y € O* such that By € Zii, j}. 
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Proof. If 8 € Zi, j) already, then we are done. Otherwise, 26 = t + xi + yj + zk with 
all t, x, y, z € Z odd. Choosing matching signs, there exists y € O% such that 28 = 2y 
(mod 40). Thus 

(26)! =2 (mod 40) 


so By"! € Z +20 = Zii, j}, so we may take y~! for the statement of the lemma. o 


11.3 > Euclidean algorithm 


The Eisenstein order Z[(—1 + V—3)/2] has several nice properties. Perhaps nicest of 
all is that it is a Euclidean domain, so in particular it is a PID and UFD. (Alas, the ring 
Z{V-3] just fails to be Euclidean.) 


11.3.1. The Hurwitz order also has a left (or right) Euclidean algorithm generalizing 
the commutative case, as follows. There is an embedding B — B ®g R = H, and 
inside H ~ R* the Hurwitz order sits as a (Z-)lattice equipped with the Euclidean 
inner product, so we can think of the reduced norm by instead thinking of distance. 
In the Lipschitz order, we see by rounding coordinates that for all y € B there exists 
H E€ Zii, jy such that nrd(y - y) <4. (1 /2)? = 1—a farthest point occurs at the center 
(1/2, 1/2, 1/2, 1/2) of a unit cube. But this is precisely the point where the Hurwitz 
quaternions occur, and it follows that for all y € B, there exists u € O such that 
nrd(y — u) < 1. (In fact, we can take nrd(y — u) < 1/2; see Exercise 11.7.) 


Paragraph 11.3.1 becomes a right Euclidean algorithm as in the commutative case. 


Lemma 11.3.2. (Hurwitz order is right norm Euclidean). For all a, B € O with B # 0, 
there exists u, p € O such that 


«a =ßu+p (11.3.3) 
and nrd(p) < nrd(8). 


Proof. If nrd(œ) < nrd(8), we may take u = 0 and p = a, so suppose nrd(a) > 
nrd(B) > 0. Let y = Bla € B. Then by 11.3.1, there exists u € O such that 
nrd(y — u) < 1. Let p = a — Bu. Then by multiplicativity of the norm, 


nrd(p) = nrd(a — Bu) < nrd(8). o 


A similar statement to Lemma 11.3.2 holds for division on the left, i.e., in (11.3.3) 
we may take a = uf + p (with possibly different elements u, p € O, of course). 


Proposition 11.3.4. Every right ideal I C O is right principal, i.e., there exists B € I 
such that I = BO. 


Proof. Let I C O be a right ideal. If Z = {0}, we are done. Otherwise, there exists an 
element 0 + 6 € J with minimal reduced norm nrd(8) € Z>o. We claim that J = BO. 
For all œ € J, by the left Euclidean algorithm in Lemma 11.3.2, there exists u € O 
such that a = uß + p with nrd(p) < nrd(); but p = a — Bu € I, so by minimality, 
nrd(p) = 0 and p = 0, hence a = Bu € BO as claimed. oO 
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Definition 11.3.5. Let a, 8 € O. We say £ right divides a (or a is a right multiple of 
£) and write £ |r a if there exists y € O such that a = By. 

A right common divisor of a, 8 € O is an element y € O such that y |r @, 8. A 
right greatest common divisor of «œ, 6 is a common divisor y such that 6 |r y for all 
common divisors 6 of a, p. 


It follows from Lemma 11.3.2 in the same way as in the commutative case that if 
a, p are not both zero, then there exists a greatest common divisor of œ, p, taking the 
last nonzero remainder in the right Euclidean algorithm. 


Corollary 11.3.6. (Bézout’s theorem). For all a, B € O not both zero, there exists 
4 v € O such that æu + By = y where y is a right greatest common divisor of a, p. 


Proof. By Proposition 11.3.4, we may write «O + BO = yO for some y € O, and then 
y € œO + BO implies there exists u, v € O such that au + By = y. o 


Proposition 11.3.7. Let O’ C B be a maximal order. Then there exists œ € B* such 
that O' = a~!Oa, and in particular O' ~= O as rings. 


Proof. By clearing denominators, there exists nonzero a € Z such that aO’ C O. Let 
I = aO'O be the right ideal of O generated by aO’. Then O’ € O_(J), and equality 
holds since O’ is maximal. By Proposition 11.3.4, we have Z = BO for some £ € B*. 
We have OL (1) = BOB! by Exercise 10.14, so O’ = BOB™! and we may take a = B"!. 

o 


Example 11.3.8. The Lipschitz order Z(i, j} does not enjoy the property that every 
right ideal is principal, as the following example shows. 

Let J = 20 = 2Z + 2iZ + 2jZ + (1 +i + j+ k)Z. Then I C Zi, j} and I has 
the structure of a right Z(i, j)-ideal, in fact 7 = 2Zi, 7) + (1 +i + j+ k)Z<i, j}. We 
claim that / is not principal as a right Z<i, j)-ideal. Indeed, suppose J = «Z<i, j) with 
a € I. Since a € 20, we have 4 | nrd(@). But 2 € I so 2 = af with 8 € Zii, j}, so 
4 = nrd(2) = nrd(q@) nrd(8), whence nrd(a@) = 4 and nrd(Z) = 1 so B € Zi, j)* and 
so 20 = I = aZii, j) = 2ZKi, j}. Cancelling the factor 2, we conclude O = Zii, j}, a 
contradiction. 

For more, see Exercise 11.10. 


11.4 > Unique factorization 


It does not follow that there is unique factorization in O in the traditional sense, as the 
order of multiplication matters. Nevertheless, there is a theory of prime factorization 
in O as follows. 


Lemma 11.4.1. Let p be prime. Then there exists n € O such that nī = nrd(x) = p. 


Proof. We have nrd(1 +i) = 17 + 1? = 2, so we may suppose p > 3 is odd. Then 
O/pO = (-1,-1 | Fp) ~ M2(Fp) by Wedderburn’s little theorem, and there exists a 
right ideal J mod p c O/pO with dimg,, (7 mod p) = 2. Let 


I={a¢€O:amodp € I mod p} 
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be the preimage of J mod p in the map O > O/pO. Then pO Ç I Ç O. Then I c O 
is aright ideal, and Z + O. But J = BO is right principal by Proposition 11.3.4. 

We claim that nrd(f) = p. Since p € I, we have p = Bu for some u € O, whence 
nrd(p) = p = nrd(£) nrd(u) so nrd(£) | p. We cannot have nrd(£) = 1 or nrd(£) = p°, 
as these would imply Z = O or I = pO, impossible. We conclude that nrd(6) = p. O 


Remark 11.4.2. Once we have developed a suitable theory of norms, the proof that 
nrd(£) = p above will be immediate: if we define NU) := #(O/J) then NJ) = p? by 
construction, and it turns out that N(7) = nrd(B)?. 


Theorem 11.4.3. (Lagrange). Every integer n > 0 is the sum of four squares, i.e., 
there exist t,x, y,z € Z such that n = 1? + x? + y? +2. 


Proof. We seek an element 6 € Zi, 7) such that nrd(8) = n. By multiplicativity of 
the reduced norm, it is sufficient to treat the case where n = p is prime. We obtain 
m € O such that nrd(z) = p by Lemma 11.4.1. But now the result follows from Lemma 
11.2.9, as there exists y € O* such that ry € Z(i, j}. o 


Remark 11.4.4. A counterpart to Lagrange’s theorem (Theorem 11.4.3) is the following 
theorem of Legendre and Gauss on sums of three squares: Every integer n that is not 
of the form n = 4°m with m = 7 (mod 8) can be written as the sum of three squares 
n = x? + y? + z?. We will revisit this classical theorem in Chapter 30 as motivation for 
the study of embedding numbers, and the number of such representations will be given 
in terms of class numbers, following Gauss. A direct proof of the three square theorem 
is given by Mordell [Mor69, §20, Theorem 1], but he notes that “no really elementary 
treatment [of this theorem] is known”. 


We finish this section with a discussion of ‘unique factorization’ in the Hurwitz 
order. 


Definition 11.4.5. An element z € O is irreducible if whenever z = af witha, B € O 
then either a € O* or B € O*. 


Lemma 11.4.6. Let n € O. Then n is irreducible if and only if nrd(z) = p € Z is 
prime. 


Proof. If nrd(z) = p is prime and a = «£p then nrd(z) = p = nrd(a@) nrd(f) so either 
nrd(@) = 1 ornrd(8) = 1, thus a € O* or B € O%. Conversely, suppose z is irreducible 
and let p | nrd(z). Let I = tO + pO = aO. Then nrd(q) | nrd(p) = p?. We cannot 
have nrd(q@) = 1, as every element of J has reduced norm divisible by p. We similarly 
cannot have nrd(@) = p?, since this would imply x € pO; but by Lemma 11.4.1, p is 
reducible, a contradiction. We conclude that nrd(@) = p. From z € I = œO we obtain 
z = aß with 8 € O; by irreducibility, 8 € O* and nrd(z) = nrd(a) = p. Oo 


Definition 11.4.7. An element a € O is primitive if a ¢ nO for all n € Z>2. 


Theorem 11.4.8. (Conway—Smith). Let œ € O be primitive and let a = nrd(q). Factor 
a = pıp2' +- pr into a product of primes. Then there exists 1,12, .. ., n, E€ O such that 


& =N: N, and nrd(r;) = p; for alli. (11.4.9) 
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Moreover, every other such factorization is of the form 


a = (my TV2) °° Vp Ar) (11.4.10) 
where y1, ..., Yr € O*. 


Proof. Let I = œO + p10; as in the proof of Lemma 11.4.6, we find J = mO with 
nrd(z) = pı, arguing that nrd(7,) + Pt since a € p10 is in contradiction to a being 
primitive. Then zr, is unique up to right multiplication by a unit and œ = a2. The 
result then follows by induction. o 


The factorization (11.4.10) is said to be obtained from œ = 7---7; by unit 
migration. 


Remark 11.4.11. To look at all possible prime factorizations of œ as in (11.4.9), it is 
necessary to consider the possible factorizations a = p4 - + + p,. Conway—Smith call this 
process metacommutation [CSm2003, Chapter 5]; metacommutation is analyzed by 
Cohn—Kumar [CK2015], Forsyth-Gurev—-Shrima FGS, and in a very general context 
by Chari [Cha2020]. 


11.5 Finite quaternionic unit groups 


We conclude this section by a discussion of quaternion unit groups extending the 
discussion 11.2: we classify finite subgroups of H* and realize the possible subgroups 
as coming from quaternionic unit groups. 


11.5.1. To begin with the classification, suppose that T c H* is a finite subgroup. 
Then nrd(I) is a finite subgroup of RX, hence identically 1, so T c HI!. 


>0’ 


Similarly, if T c H*/R* = H!/{+1} is a finite subgroup, then it lifts via the 
projection H! —> H!/{+1} to a finite subgroup of H!. 


So let T c H! be a finite subgroup. Then 


T/{+1} > H!/{+41} = SOG) 


the latter isomorphism by Hamilton’s original (!) motivation for quaternion algebras 
(Corollary 2.4.21). Therefore T/{+1} € SO(3) is a finite rotation group, and these 
groups have been known since antiquity. 


Proposition 11.5.2. A finite subgroup of SO(3) is one of the following: 
(i) a cyclic group; 


(ii) a dihedral group; 
(iii) the tetrahedral group A4 of order 12; 
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(iv) the octahedral group S4 of order 24; or 
(v) the icosahedral group As of order 60. 


Cases (iii)—-(v) are the symmetry groups of the corresponding Platonic solids and 
are called exceptional rotation groups. 


Proof. Let G < SO(3) be a finite subgroup with #G = n > 1; then G must consist of 
rotations about a common fixed point (its center of gravity), which we may take to be 
the origin. The group G then acts on the unit sphere, and every nonidentity element of 
G acts by rotation about an axis, fixing the poles of its axis on the sphere. Let V be 
the subset of these poles in the unit sphere; the set V will soon be the vertices of our 
(possibly degenerate) polyhedron. Let 


X = {(g,v): g € G\{1} and v is a pole of g}. 


Since each g € G {1} has exactly two poles, we have #X = 2(n — 1). On the other 
hand, we can also count organizing by orbits. Choose a representative set vj,...,v, of 
poles, one from each orbit of G on V, and let 


ni = #StabG(v;) = Hg € G : gv; = vi} 


be the order of the stabilizer: this group is a cyclic subgroup about a common axis. 
Then 


n-2=4X = Kowa- D=X Zau -D=1) (1- 2), 


i=1 i=1 i=1 


by the orbit—stabilizer theorem. Dividing both sides by n gives 


2 r 1 
ae = (i =): (11.5.3) 
Sincen > 1, wehave 1 < 2—2/n < 2;andsinceeachn; > 2,wehave 1/2 < 1-1/n; < 1. 
Putting these together, we must have r = 2, 3. 

If r = 2, then (11.5.3) becomes 2 = n/n, + n/m, with n/n; = #(Gv;) = 1, so 
nı = m = n, there is only one axis of rotation, and G is cyclic. 

If r = 3, then the only possibilities for (nı, n2,n3) with nj < n2 < m are 
(2, 2, c), (2, 3, 3), (2, 3, 4), (2, 3,5); the corresponding groups have sizes 2c, 12, 24, 60, 
respectively, and can be identified with D2,., A4, S4, A5 by a careful but classical analy- 
sis of orbits. See Armstrong [Arm88, Chapter 19], Grove—Benson [GB2008, $2.4], or 
Conway-Smith [CSm2003, §3.3]. oO 


In 11.2.4, we gave a quaternionic visualization of the binary tetrahedral group 
(lifting the tetrahedral group to H'); we repeat this with the two other exceptional 
rotation groups, taking again k := ij. 


174 CHAPTER 11. THE HURWITZ ORDER 


11.5.4. The octahedral group S4 pulls back to the binary octahedral group 20 c H! 
of order 24-2 = 48, whose elements act by rigid motions of the octahedron (or dually, 
the cube). We make identifications following 11.2.4, shown in Figure 11.5.5. 


x i+j-k 


Figure 11.5.5: Symmetries of an octahedron and a cube, viewed quaternionically 


The binary tetrahedral group 2T < 20 of order 24 acts as a subgroup of rigid 
motions; the group 20 is generated by an element which maps to a rotation of order 4 
around the 6 faces, i.e., one of the 12 elements 


+l+i +1+j +lk 


The group 20 has a Coxeter presentation 


20 = (r,s,t | r = =t =rst= -1) 
(with —1 central and (—1)* = 1). One also writes 20 = Sa. 
Let F = Q(V2) and R = Z[V2]. If we consider the Hamiltonians restricted to F 


(>= 
as B = 


? then the group 20 C H! generates an R-order: letting i, j be the 


standard generators and still k := ij, and letting œ = (1 + i)/ V2 and £ = (1 + j)/ V2, 
then 
O20 = R + Ra + RB + Rap; (11.5.6) 


this order contains the scalar extension of the Hurwitz order to R and is in fact a maximal 
R-order. (The extension of scalars is necessary: S4 contains an element of order 4 which 
lifts to an element of order 8 in 20; such an element has trace +(Zg + g’) = +vV2.) 
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11.5.7. Finally, we treat the binary icosahedral group 27 C H! of order 60-2 = 120, 
acting by rigid motions of the icosahedron (or dually, the dodecahedron). We choose 
the regular icosahedron to have vertices at 


+itjxk, trit |j, +Tj+ tk atket li 


where t = (1 + V5)/2 is the golden ratio. The elements of order 5 are given by 
conjugates and powers of the element ¢ = (t +t! + j)/2, which acts by rotation 
about a face. The group 2/ can be presented as 


2I = (r,s,t |r? = $? =P =rst = —1) 


and we have 2/ = As ~ SL2(Fs). Letting now F = Q(V5) and R = Z[r], the R-algebra 
generated by 2/ is the maximal order 


Oz, = R+ Ri + RE + Ril. (11.5.8) 


For further references, see Conway—Sloane [CS188, §8.2], who describe the binary 
icosahedral group in detail, calling it the icosian group. 


We now consider the related possibilities over Q. (We will return to a general 


„b 
classification in section 32.4.) To put ourselves in a situation like (11.2.6), let B = ) 


a,b 


be a quaternion algebra over Q such that B @g R = 


that B is definite. By Exercise 2.4, B is definite if and only if a,b < 0. Let O C B be 
an order in B; we would like to understand its unit group. 


= H: in this case, we say 


Lemma 11.5.9. The group O* = O! is finite. 


Proof. We may take B = (a,b | Q) with a,b < 0. Consider the reduced norm 
nrd: B > Q, given by nrd(t+xi+yj+zij) = t?+a|x?+|bly?+|ablz’, so nrd(B*) c OR 
At the same time, nrd(O*) c Z* = {+1}, so we conclude O* = O!. This group is finite 
because the restriction nrd |o of the reduced norm to O ~ Z4 defines a (still) positive 
definite quadratic form, so there are only finitely many elements of O of any fixed 
reduced norm. (For a geometric perspective, viewing the elements of O! as lattice 
points on an ellipsoid in R*), see Proposition 17.5.6.) o 


In view of Lemma 11.5.9, the classification of finite rotation groups (Proposition 
11.5.2) applies. We consider each case in turn. 


11.5.10. Among the (nontrivial) cyclic groups, only subgroups of order 2,4,6 are 
possible over Q. Indeed, a generator satisfies a quadratic equation with integer coeffi- 
cients and so belongs to the ring of integers of an imaginary quadratic field; and only 
two imaginary quadratic fields have units other than +1, namely, the Eisenstein order 
Z[(-1 + v-3)/2] of discriminant —3 and the Gaussian order Z[v-1 ] of discriminant 
—4 with groups of size 4, 6, respectively. (The more precise question of whether or not 
there is a unit of specified order is a question of embedding numbers, the subject of 
Chapter 30.) 


176 CHAPTER 11. THE HURWITZ ORDER 


11.5.11. Next, suppose that O*/{+1} is dihedral, and let j € O%\{+1} act by inversion 
(equivalently, conjugation) on a cyclic group (of order 2, 3, by 11.5.10), generated by 
an element i. Let K = Q(i). Since j acts by inversion, we have j? € Q, and since 


K,-1 
j € O* we have j? = —1. It follows that ja = @j for all a € K. Thus B = (55) 
and we have two possibilities: 


(i) If i has order 4, then B = (—1, —1 | Q) and O contains the order generated by i, j. 
This is the case treated in section 11.2: O is the Lipschitz order, and O* = Qg is 
the quaternion group of order 8. 

(ii) Otherwise, i = w has order 6, and B ~ (—3, -1 | Q). By Exercise 11.11(a), we 
have (—3,-1 | Q) # (-1,-1 | Q). By an argument similar to Lemma 11.1.2— 
and boy, there is more of this to come in Chapter 32—we see that 


O = Z + Zw + Zj + Zwj (11.5.12) 


is maximal. The group O*%/{+1} ~ Dg is a dihedral group of order 6, and the 
group O% is generated by w, j with relations w° = j? = —1 and jw = w`! j; in 
other words, O* = C3 x C4 is the semidirect product of the cyclic group C3 of 
order 3 by the action of the cyclic group C4 with a generator acting by inversion 


on C3. Because i? = —1 is central, we also have an exact sequence 


l> —> OX 9G > 1 


where C6 ~ (w) and C2 = (j)/{+1}. This group is also called the binary dihedral 
or dicyclic group of order 12, denoted 2D¢. 


11.5.13. To conclude, suppose that O*/{+1} is exceptional. Each of these groups 
contain a dihedral group, so the argument from 11.5.11 applies: the only new group we 
see is the (binary) tetrahedral group obtained from the Hurwitz units (section 11.2). 
Here is another proof: the group S4 contains an element of order 4 and A5 an element 
of order 5, and these lift to elements of order 8, 10 in OX, impossible. 


We have proven the following theorem. 


Theorem 11.5.14. Let B = (a,b | Q) be a quaternion algebra over Q with a,b < 0, 
and let O C B be an order. Then O% is either cyclic of order 2, 4, 6, quaternion of order 
8, binary dihedral of order 12, or binary tetrahedral of order 24. 

Moreover, O* is quaternion, binary dihedral, or binary tetrahedral if and only 
if O is isomorphic to the Lipschitz order, the order (11.5.12), or the Hurwitz order, 
respectively. 


Proof. Combine 11.5.10, 11.5.11, and 11.5.13. o 


Exercises 


> 1. Check directly that the Hurwitz order 
l+i+j+k 
O=Z+zi+zj+z( Htt) 


EXERCISES 177 


> 6. 


> 7, 


Let B =| 


-1,-1 
is indeed an order in B = (==) 


1 
} and let O C B be the Hurwitz order. For the normalizer 


Ti 


Npx(O) := {a € BX : a™!Oa = O} 


show the equality Ngx(Z<i, 7)) = Ngx(O). [Hint: consider units and their traces. ] 
(a) Show that the Lipschitz order Z(i, j} is the unique suborder of the Hurwitz 
order O with index 2 (as abelian groups). 


(b) Show that 
Zli, jy = {a € O: trd(a) is even}. 
Check that the map 
0/30 > Mo(F3) 
ae 0 -l\ fl 1 
BENE op -1 


from Lemma 11.2.1 is an F3-algebra isomorphism. 


. Generalizing the previous exercise, show that for an odd prime p that O/pO ~ 


M2(Fp). 


. Draw the subgroup lattice for SL2(F3), indicating normal subgroups (and their 


quotients). 
Show explicitly that 


2T = (r,s,t |r? = s? =P =rst =—-1) 


(cf. (11.2.8). 


Let 

A=Z*+Z(4,4,3,4) cR 
be the image of the Hurwitz order O under the natural embedding O > H = R4. 
Show that for every x € R4, there exists A € A such that |||]? < 1/2. [Hint: 
without loss of generality we may take 0 < xi < 1/2 for all i, then show we may 


take xı + x2 + x3 + x4 < 1; conclude that the maximum value of \|x||? with these 


conditions occurs at the point G, 5, 0, 0)./ 


. Let B be a definite quaternion algebra over Q and let O C B be an order. Show 


that O is left Euclidean if and only if O is right Euclidean (with respect to a 
norm N). 


. Let O C B := (-1,-1 | Q) be the Hurwitz order. 


(a) Consider the natural ring homomorphism O —> O/20 = O &z F? giving 
the reduction of the algebra O modulo 2. Show that O/20O is an F-algebra, 
that #(O/2O) = 16, and that (O/2O)* = A4 is isomorphic to the alternating 
group on 4 elements. Conclude that 0/20 + Mo(F2) and hence that O/20 
is not a quaternion algebra over F >. 
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(b) Show that the group of ring automorphisms of O/20 is 
Aut(O/20) = S4. 


(c) More generally, if F is a field of characteristic 2 show that there is an exact 
sequence 
1 > F? > Aut(O 8z F) > SL,(F) > 1 


where F? is an additive group. [Hint: let J = rad(O 8z F) be the Jacobson 
radical of the algebra, and show that the sequence is induced by F-linear 
automorphisms of J and the automorphisms w > w + e with e € J.] 


[This kind of construction, considered instead over the octonions, arises 
when constructing the exceptional group G2 in characteristic 2 [Wils2009, 
§4.4.1].] 


Although the Lipschitz order just misses being Euclidean with respect to the 
norm (see Example 11.3.8), bootstrapping from the Hurwitz order we still obtain 
a result on principality by restricting the set of ideals, as follows. 


Let J € Zii, j} be a right ideal. 


(a) Show that JO = BO for some $ € ION Zii, j}. 

(b) Prove that I0) = Z)(i, j} if and only if J is generated by elements of odd 
reduced norm. 

(c) If Io) = Zoli, j}, show that JO N Zi, j) = I and conclude that 7 is right 
principal. [Hint: Argue locally. ] 


Let B := (—1, —3 | Q), and let 
Jltj 
i——. 


O=Zi,(1+p/2)=2+ +75! 4 ; 


(a) Show that B # (-1, -1 | Q). 
(b) Show that O is a maximal order in B. 
(c) Show that O is Euclidean with respect to the reduced norm 


nrd(t + xi + y(1 + f)/2+zi1 + j)/2) =? +ty tx? txzty +2. 
(d) Show that every maximal order in B is conjugate to O. 


Let G < O(2) be a finite subgroup such that tr(g), det(g) € Q for all g € G. 
Show that G is conjugate in O(2) to one of the following: (i) a cyclic group of 
order 1, 2,3, 4,6 that is a subgroup of SO(2), or (ii) a dihedral group of order 
2, 4, 6, 8, 12, not contained in SO(2). 


Let p be an odd prime. 


(a) The group GL2(Zęp)) acts by right multiplication on the set of matrices 
m € Mo(Zip)) with p || det(z) (i.e., p exactly divides the numerator of 
det(zr), written in lowest terms). Show that there are precisely p + 1 orbits, 
represented by 
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and 
1 0 
T= , a=0,1,...,p—1. 
| j 


[Hint: use column operations. ] 

(b) Repeat (a) but with SL2(Zi,)) acting on the set of matrices 7 € M2(Zip)) 
with det(z) = p, with the same conclusion. 

(c) Show that the number of (left or) right ideals of O of reduced norm p is 
equal to p + 1. 

(d) Accounting for units, conclude that the number of ways of writing an odd 
prime p as the sum of four squares is equal to 8(p + 1). 


14. Inthe following exercise, we consider a computational problem, suitable for those 
with some background in number theory algorithms (see e.g. Cohen [Coh93]). 


(a) Show that one can find x, y, z € Z such that x? + y? + z? = pm with p {m 
in probabilistic polynomial time in log p. 

(b) Describe the right Euclidean algorithm as applied to œ = xi + yj + zk and 
p to obtain m € O with nrd(z) = p. Adjust as in Lemma 11.2.9 to find a 
solution to x? + y? + z? = p with x, y, z € Z. Estimate the running time of 
this algorithm. 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
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Chapter 12 


Ternary quadratic forms over local 
fields 


In this chapter, we classify quaternion algebras over local fields using quadratic forms; 
this generalizes the classification of quaternion algebras over R. 


12.1 > The p-adic numbers and local quaternion algebras 


Before beginning, we briefly remind the reader about the structure of the p-adic 
numbers. The p-adics were developed by Hensel, who wanted a uniform way to say 
that a Diophantine equation has a (consistent) solution modulo p” for all n. In the early 
1920s, Hasse used them in the study of quadratic forms and algebras over number 
fields. At the time, what is now called the local-global principle then was called the 
p-adic transfer from the “small” to the “large”. As references on p-adic numbers, see 
for example Gouvêa [Gou97], Katok [Kat2007], or Koblitz [Kob84]. 

Just as elements of R can be thought of infinite decimals, an element of Q, can be 
thought of in its p-adic expansion 


co 


a = (...43d2a|d9.d_|a_2*++d_k)p = > anp” (12.1.1) 
n=-k 
where each a; € {0,..., p — 1} are the digits of a. We continue “to the left’ because a 


decimal expansion is a series in the base 1/10 < 1 and instead we have a base p > 1. 
Put a bit more precisely, we define the p-adic absolute value on Q by defined by 
l0lp := 0 and 
Iclp =p? for c € Q*, (12.1.2) 


where v,(c) is the power of p occurring in c in its unique factorization (taken to be 
negative if p divides the denominator of c written in lowest terms). Then the field Q, 
is the completion of Q with respect to ||, that is to say, Qp is the set of equivalence 
classes of Cauchy sequences of rational numbers, and it obtains a topology induced by 
the metric d,(x, y) = |x — ylp. We have |a|, = p* for a as in (12.1.1) with a_, + 0. 
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Of course, all of the information in the p-adic absolute value is encoded in the p-adic 
valuation v,: Q > RU {ov}. 

Inside Q, is the ring Z, of p-adic integers, the completion of Z with respect to 
| |p: the ring Zp consists of those elements of Q, with a, = 0 for n < 0. (The ring Zp 
might be thought of intuitively as Z/p™Z, if this made sense.) 

Equipped with their topologies, the ring Z, is compact and the field Q, is 
locally compact. These statements can be understood quite easily by viewing Zp 
in a slightly different way, as a projective limit with respect to the natural projection 
maps Z/p"*'Z > Z/p"Z: 


Zp = lim Z/p"Z 
X (12.1.3) 
= |x = (Xen € | [z/"z > Xn41 = Xn (mod p”) for all n > i ; 


n=1 


In other words, each element of Z, is a compatible sequence of elements in Z/p"Z for 
each n. The equality (12.1.3) is just a reformulation of the notion of Cauchy sequence 
for Z, and so for the purposes of this introduction it can equally well be taken as a 
definition. 

As for the topology in (12.1.3), each factor Z/p”Z is given the discrete topology, 
the product [| _) Z/p”Z is given the product topology, and Z,, is given the subspace 
topology. Since each Z/p”Z is compact (it is a finite set!), by Tychonoff’s theorem the 
product ||?) Z/p”Z is compact; and Zp is closed inside this product (a convergent 
limit of Cauchy sequences is a Cauchy sequence), so Zp is compact and still Hausdorff. 
The topology on Z, is a bit strange though, as Zp is totally disconnected: every 
nonempty connected subset is a single point. In fact, Z, is homeomorphic to the 
Cantor set, which is itself homeomorphic to the product of countably many copies 
of {0, 1}. (More generally, every nonempty totally disconnected compact metric space 
with no isolated points is homeomorphic to the Cantor set.) 

The set Zp is a compact neighborhood of 0, as it is the closed ball of radius 1 
around 0: 

Zp = {x € Qp : lxlp < 1} = {x € Qp : Vp (x) 2 O}. (12.1.4) 


In a similar way, the disc of radius 1 around a € Qp is a compact neighborhood of 
a homeomorphic to Zp, so Qp is locally compact. Being able to make topological 
arguments like the one above is the whole point of looking at fields like Qp: our 
understanding of infinite algebraic objects is informed by topology. 

With this review, and topological arguments now at our disposal, we consider 
quaternion algebras over Qp. The ‘original’ quaternion algebra, of course, was the 
division ring H of Hamiltonians over the real numbers (the ‘original’ field with a 
topology), and indeed H is the unique division quaternion algebra over R (Corollary 
3.5.8). We find a similar result over Q, (a special case of Theorem 12.3.2), as follows. 


Theorem 12.1.5. There is a unique division quaternion algebra B over Qp, up to 
isomorphism; if p # 2, then 
mca 
Q 
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where e € Z is a quadratic nonresidue modulo p. 


For example, if p = 3 (mod 4) then —1 is quadratic nonresidue and (—1, p | Qp) is 
the unique division quaternion algebra over Qp. 

Because we have exactly two such possibilities, we define the Hilbert symbol: for 
a,b € Qs we have (a, b)g, = 1,-1 according as the quaternion algebra (a, b | Qp) = 
M2(Q,) is split or not. According to Theorem 12.1.5, the Hilbert symbol over Qp 
uniquely identifies the two possible isomorphism classes of quaternion algebras over 
Qp—just like it does over R. 

Our approach to Theorem 12.1.5 uses quadratic forms: we use the classification 
of isomorphism classes of quaternion algebras given in terms of similarity classes 
of ternary quadratic forms (Theorem 5.1.1). The following proposition then implies 
Theorem 12.1.5. 


Proposition 12.1.6. There is a unique ternary anisotropic quadratic form Q over Qp, 
up to similarity; if p + 2, then Q ~ (1,-e,—p) where e is a quadratic nonresidue 
modulo p. 


Happily, this proposition can be proved using some rather direct manipulations 
with quadratic forms and gives a very “hands on” feel; it is also suggests the arguments 
we use for a more general result. The main input we need is a quadratic Hensel’s 
lemma, or more precisely, the following consequence. 


Lemma 12.1.7. For p + 2, the classes in QÙ IG? are represented by 1, e, p,ep where 
e is a quadratic nonresidue modulo p. 

Proof. Let a € Q% and let m := vp(a). Then a = bp” with b := a/p™ € Z%, and by 
squaring a € Qs if and only if b € i ie and m is even. We claim that b € ie if and 
only if its reduction b modulo p is a square in (Z/pZ)*. With the forward implication 
immediate, suppose b = c° (mod p) with c € Z*, then b/c? € 1 + pZp. But squaring 
is a bijection on 1 + pZp, by expanding the square root as a convergent series (see 
Exercise 12.1) and using that p + 2. Thus b/c? € Vieng and the result follows. o 


We now proceed with the proof when p # 2. 


Proof of Proposition 12.1.6, p +2. We start by showing that Q(x, y, z) = x?-ey?-pz? 
is anisotropic. Suppose Q(x, y, z) = 0 with not all x, y,z € Q, zero. Rescaling by p, 
we may assume that x, y,z € Zp and not all x, y,z € pZ,. We then reduce modulo p 
to find that x? = ey? (mod p). If p 4 y, then (x/y)* =e (mod p); but e is a quadratic 
nonresidue modulo p, a contradiction. So p | y; thus p | (ey? + pz?) = x7, so p | x; 
thus p° | (x? — ey”) = pz’, so p | z, a contradiction. 

To show uniqueness, let Q be a ternary anisotropic form over Qp. Since p + 2, 
we may diagonalize. In such a diagonal form taken up to similarity, we may also 
rescale each coordinate up to squares as well as rescale the entire quadratic form. 
Putting this together with Lemma 12.1.7, without loss of generality we may suppose 
Q(x, y, z) = (l, =b, =c) = x? - by? — cz’ with b,c € {l,e, p,ep}, the signs chosen for 
convenience. If b = 1 or c = 1, the form is isotropic by inspection. So we are left to 
consider the cases (b, c) = (e, e), (e, p), (e, ep), (p, p), (p, ep), (ep, ep). 
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When (b, c) = (e, e), we have after rescaling x? + y? — ez’. We claim this form is 
always isotropic. Indeed, the form reduces to a nondegenerate ternary quadratic 
form over F,,. Such a form is always isotropic by a delightful counting argument 
(Exercise 5.5(b), or a second chance in Exercise 12.6!). Lifting, there exist 
x,y,z E€ Zp, not all zero modulo p, such that x? = —y* + ez? (mod p). Since 
e is a nonsquare, we have p { x (arguing similarly as in the first paragraph). 
Let d := -y* + ez? € QX. By the possibilities in Lemma 12.1.7, we must have 
d€ Q3; solving x? = d for x € Q, then shows that Q is isotropic. 

The case (e, p) is our desired form. 

In the third case (eep!), we substitute x < ex and divide by e to obtain the form 
=y? + ex? — pz’. We claim that there is an isometry (—1,e) ~ (1, —e): indeed, 
in the first bullet we showed that the quadratic form (—1,e,—1) is isotropic, 
so —x? + ey? represents 1; using this representation as the first basis vector, 
extending to a basis, and diagonalizing, we conclude that (—1,e) ~ (1, b}. By 
discriminants, we have —e = b up to squares. This brings us back to the first 
case. 

In cases (p, p) or (ep, ep), replacing x < px and dividing gives the quadratic 
forms x? + y? — pz? and x? + y? — epz’. If -1 € Z%?, then the form is isotropic; 
otherwise, we may take e = —1 and we are back to cases (e, p), (e, ep). 

In the final case (keeping pep!), we substitute x < px and divide by —p to get 
y? + ez? — px. If —1 € Ly then by substitution we change the middle sign to 
return to the first case. Otherwise, we may take e = —1, and the form is isotropic, 
a contradiction. 


This consideration of cases completes the proof. o 


Although direct, the proof we just gave has the defect that quadratic forms behave 
differently in characteristic 2, and so one may ask for a proof that works uniformly in 
all characteristics: we give such a proof in the next chapter by extending valuations. 

One of the nice applications of this classification is that it gives a necessary 
condition for two quaternion algebras to be isomorphic. Let B = (a,b | Q) be a 
quaternion algebra over Q and consider its scalar extension B, = B8&QQ, = (a, b | Qp). 
If B’ is another quaternion algebra over Q and B = B’, then this implies B, ~ B, for all 
primes p, and of course the same is true over R. Perhaps surprisingly, it turns out that 
the collection of all of these tests is also sufficient: if B, B’ become isomorphic over R 
and over Q, for all primes p, then in fact B ~ B’ are isomorphic over Q! This profound 
and powerful principle—detecting global isomorphism from local isomorphisms, a 
local-global principle—will be examined in chapter 14. 


12.2 Local fields 


In this section, we set up notation and basic results from the theory of local fields. 
The theory of local fields is described in many places, including Neukirch [Neu99, 
Chapters II, V], the classic texts by Cassels [Cas86] and Serre [Ser79]. Weil [Weil74] 
approaches number theory from the ground up in the language of local fields, building 
up the theory of local division rings. 
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Our motivation for local fields is as follows: we want a topology compatible with 
the field operations in which the field is Hausdorff and locally compact (every element 
has a compact neighborhood), analogous to what holds over the real and complex 
numbers. And to avoid trivialities, we will insist that this topology is not the discrete 
topology (where every subset of F is open). To carry this out, we begin with some 
basic definitions. 


Definition 12.2.1. A topological group is a group equipped with a topology such that 
the group operation and inversion are continuous. A homomorphism of topological 
groups is a group homomorphism that is continuous. 

A topological ring is a ring A equipped with a topology such that the ring oper- 
ations (addition, negation, and multiplication) are continuous. A homomorphism of 
topological rings is a ring homomorphism that is continuous. A topological field is a 
field that is also a topological ring in such a way that division by a nonzero element is 
continuous. 


One natural way to equip a ring with a topology is by way of an absolute value. To 
get started, we consider such notions first for fields. Throughout this section, let F be 
a field. 


Definition 12.2.2. An absolute value on F is a map 
|l: F > Rso 


such that: 


(i) |x| = 0 if and only if x = 0; 
(ii) |xy| = |x||y| for all x, y € F; and 
(iii) |x + y| < |x| + |y| for all x, y € F (triangle inequality). 


An absolute value || on F gives F the structure of a topological field by the metric 
d(x,y) = |x — y|. Two absolute values ||1,||2 on F are (strictly) equivalent if there 
exists c > 0 such that |x|; = |x|§ for all x € F; equivalent absolute values induce the 
same topology on F. 


Remark 12.2.3. If|| is an absolute value on F, then it need not be the case that x > |x|° 
for c > 0 is again absolute value, because it need not satisfy the triangle inequality. 
In particular, we will find it convenient to consider the square of the usual absolute 
value on F = C, which suffers from this deficiency. There are various ways around 
this problem; perhaps the simplest is just to ignore it. 


Definition 12.2.4. An absolute value is nonarchimedean if the ultrametric 
inequality 
|x + y| < sup{|xl, Iyl} 


is satisfied for all x, y € F, and archimedean otherwise. 


Example 12.2.5. The fields R and C are topological fields with respect to the usual 
archimedean absolute value. 
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Remark 12.2.6. A field with absolute value is archimedean if and only if it satisfies 
the archimedean property: for all x € F*, there exists n € Z such that |nx| > 1. In 
particular, a field F equipped with an archimedean absolute value has char F = 0. 


Example 12.2.7. Every field has the trivial (nonarchimedean) absolute value, defined 
by |0| = 0 and |x| = 1 for all x € F*%; the trivial absolute value induces the discrete 
topology on F. 


A nonarchimedean absolute value on a field F arises naturally by way of a valuation, 
as follows. 


Definition 12.2.8. A valuation of a field F isa map v: F —> RU {œ} such that: 


(i) v(x) = œ if and only if x = 0; 
(ii) v(xy) = v(x) + v(y) for all x, y € F; and 
(iii) v(x + y) = min(v(x), v(y)) for all x, y € F. 


A valuation is discrete if the value group v(F%) is discrete in R (has no accumulation 
points). 


Here, we set the convention that x + co = œ + x = œ for all x € RU {oo}. By 
(ii), the value group v(F*) is a subgroup of the additive group R, and so whereas an 
absolute value is multiplicative, a valuation is additive. 


Example 12.2.9, Each x € Q* can be written x = p’a/b with a, b € Z relatively prime 
and p { ab; the map v,,(x) = r defines the p-adic valuation on Q. 


Example 12.2.10. Let k bea field and F = k(t) the field of rational functions over k. For 


fO = gO/h® € kK {0} with g(t), h(t) € k[t], define v( f (t)) := deg h(t) — deg g(t). 
Then v is a discrete valuation on F. 


Given the parallels between them, it should come as no surprise that a valuation 
gives rise to an absolute value on F by defining 


jx] = co) (12.2.11) 


for a fixed c > 1; the induced topology on F is independent of the choice of c. By 
condition (iii), the absolute value associated to a valuation is nonarchimedean. 


Example 12.2.12. The trivial valuation is the valuation v satisfying v(0) = oo and 
v(x) = 0 for all x € F*. The trivial valuation gives the trivial absolute value on F. 


Two valuations v, w are equivalent if there exists a € Rs» such that v(x) = aw(x) 
for all x € F; equivalent valuations give the same topology on a field. A nontrivial 
discrete valuation is equivalent after rescaling (by the minimal positive element in the 
value group) to one with value group Z, since a nontrivial discrete subgroup of R is 
cyclic; we call such a discrete valuation normalized. 
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12.2.13. Given a field F with a nontrivial discrete valuation v, the valuation ring is 
R := {x € F : v(x) 2 0}. We have R* = {x € F : v(x) = 0} since 


v(x) + v(x7!) = v(xx7!) = v1) = 0 
for all x € F*. The valuation ring is a local domain with unique maximal ideal 
p:= {x € F : v(x) > 0} = R\R™. 


An element a € p with smallest valuation is called a uniformizer, and comparing 
valuations we see that 7R = (xr) = p. Since p Ç R is maximal, the quotient k := R/p 
is a field, called the residue field of R (or of F). 


Recall that a topological space is locally compact if each point has a compact 
neighborhood (every point is contained in a compact set containing an open set). 


Definition 12.2.14. A local field is a Hausdorff, locally compact topological field with 
a nondiscrete topology. 


In a local field, we can hope to understand its structure by local considerations in a 
compact neighborhood, hence the name. Local fields have a very simple classification 
as follows. 


Theorem 12.2.15. A field F with absolute value is a local field if and only if F is one 
of the following: 


(i) F is archimedean, and F = R or F = C; 
(ii) F is nonarchimedean with char F = 0, and F is a finite extension of Qp for some 
prime p; or 
(iii) F is nonarchimedean with char F = p, and F is a finite extension of the Laurent 
series field F,,((t)) for some prime p; in this case, there is a (non-canonical) 
isomorphism F = Fg((t)) where q is a power of p. 


A field F with absolute value | | is a nonarchimedean local field if and only if F is 
complete with respect to | |, and | | is equivalent to the absolute value associated to a 
nontrivial discrete valuation v: F — R U {œ} with finite residue field. 


Proof. See Neukirch [Neu99, Chapter II, $5], Cassels [Cas86, Chapter 4, $1], or Serre 
[Ser79, Chapter II, $1]. o 


Although a local field is only locally compact, the valuation ring is itself compact, 
as follows. 


Lemma 12.2.16. Suppose F is nonarchimedean. Then F is totally disconnected and 
the valuation ring R C F is a compact, totally disconnected topological ring. 


Proof. To see that F is totally disconnected (whence R too is totally disconnected), 
by translation it suffices to show that the only connected set containing 0 is {0}. Let 
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x € F* with |x| = 6 > 0. The image |F*| C Ry is discrete, so there exists 0 < € < ô 
such that |y| < ô implies |y| < 6 — e for all y € F. Thus an open ball is a closed ball 


D(O, 6) = {y E F : |y] < ô} = {y E F : |y] < ô — e} = D[0, 6 — e]; 


since x € F* and 6 > 0 were arbitrary, the only connected subset containing 0 is {0}. 
Next, we show R is compact. There is a natural continuous ring homomorphism 


$: R> | [R/p" 
n=1 
where each factor R/p” is equipped with the discrete topology and the product is given 
the product topology. The map ¢ is injective, since NX; p” = {0} (every nonzero 
element has finite valuation). The image of ¢ is obviously closed. Therefore R is home- 
omorphic onto its closed image. But by Tychonoff’s theorem, the product []7_, R/p” 
of compact sets is compact, and a closed subset of a compact set is compact, thus R is 
compact. Oo 


One key property of local fields we will use is Hensel’s lemma. 


Lemma 12.2.17. (Hensel’s lemma, univariate). Let F be a nonarchimedean local 
field with valuation v and valuation ring R, and let f(x) € R[x]. Leta € R satisfy 
m := v(f(a)) > 2v(f’(a)). Then there exists a € R such that f(a) = 0 anda =a 
(mod p”). 


Proof. The result is straightforward to prove using Taylor expansion or the same 
formulas as in Newton’s method. o 


Perhaps less well-known is the multivariate version. 


Lemma 12.2.18. (Hensel’s lemma). Let F be a nonarchimedean local field with 
valuation v and valuation ring R, and let f (x1, . . ., Xn) € R[x1, . . -, Xn] with n = 1. 
Let a € R” have m := v(f(a)) and suppose that 


Of 
ae o) > 0. 


Then there exists a € R” such that f(a) = 0 and 


m> 2min | 
L 


a=a (mod p”). 


Proof. One can reduce from several variables to the one variable version of Hensel’s 
lemma (Lemma 12.2.17): see Exercise 12.11. o 


Remark 12.2.19. With essentially the same proof, Hensel’s lemma holds more gener- 
ally for R a complete DVR (without the condition on the residue field) and becomes 
axiomatically the property of Henselian rings. 
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12.3 Classification via quadratic forms 


We now seek to classify quaternion algebras over local fields. 


12.3.1. First, suppose F is archimedean. When F = C, the only quaternion algebra 
over C up to isomorphism is B ~ M2(C). When F = R, by the theorem of Frobenius 
(Corollary 3.5.8), there is a unique quaternion division algebra over R. 


The classification of quaternion algebras over nonarchimedean local fields is quite 
analogous to the classification over R, as follows. 


Main Theorem 12.3.2. Let F + C be a local field. Then there is a unique division 
quaternion algebra B over F up to F-algebra isomorphism. 


We approach the proof of Main Theorem 12.3.2 from two vantage points. In this 
section, we give a proof using quadratic forms; in the next section, we give another 
proof by extending the valuation (valid in all characteristics). 

To prove this theorem, having dispatched the cases F = R, C in 12.3.1 above, from 
the previous section we may suppose F is a nonarchimedean local field with discrete 
valuation v, valuation ring R, maximal ideal p = mR with uniformizer 7, and residue 
field R/p = k. 


12.3.3. Since R is a DVR, all R-lattices M are free (and we only consider those of 
finite rank): i.e., M ~ R” for some n € Zo. Given such an R-lattice M, we can reduce 
modulo p to get M/pM ~ M 8r k ~ k”; conversely, any lift to M of any k-basis of 
M/pM is an R-basis for M, by Nakayama’s lemma. 


We recall Main Theorem 5.2.5, Corollary 5.2.6, and Main Theorem 5.4.4: iso- 
morphism classes of quaternion algebras over a field F are in natural bijection with 
nondegenerate ternary quadratic forms up to similarity, and the matrix algebra corre- 
sponds to any isotropic form. So to prove Main Theorem 12.3.2, it is equivalent to 
prove the following statement. 


Theorem 12.3.4. Let F + C be a local field. Then there is a unique anisotropic ternary 
quadratic form over F up to similarity. 


Rescaling shows equivalently that there is a unique anisotropic ternary quadratic 
form over a local field F + C of discriminant 1 up to isometry. So our task becomes 
a hands-on investigation of ternary quadratic forms over F. The theory of quadratic 
forms over F is linked to that over its residue field k, so we first need to examine 
isotropy of quadratic forms over a finite field. 


Lemma 12.3.5. A quadratic form Q: V > k over a finite field k with dim; V > 3 is 

isotropic. 

Proof. The proof is a delightful elementary exercise (Exercise 12.6). o 
Recall definitions and notation for quadratic forms over R provided in section 9.7, 


we embark on a proof in the case where chark + 2, beginning with the following 
lemma. 
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Lemma 12.3.6. Suppose chark + 2. Let Q: M —> R be a nonsingular quadratic 
form over R. Then the reduction Qg: M r k — k of Q modulo p is nonsingular 
(equivalently, nondegenerate) over k; moreover, Q is isotropic over R if and only if 
Q mod p is isotropic. 


Proof. For the first statement, by definition we have disc Q € R*, so disc Qg € k* by 
reduction. 

For the second, we first prove (=), let x € Mx{0} have Q(x) = 0. Since Q is 
homogeneous, we may suppose that x ¢ pM (divide by powers of m as necessary), 
so its image in M &, k is nonzero and thereby shows that Q% is isotropic. For (<=), 
let a € M be such that Q;(a) = 0 € k and a has nonzero reduction. Choose a basis 
M = R” and write Q(x1, . . ., Xn) = Q(x1e1 ++ ++ +Xnen) € R[x, ..., Xn] in the standard 
basis as a homogeneous polynomial of degree 2, let T be the associated symmetric 
bilinear form and [T] = (T(e;, e;));i,; the Gram matrix. We are almost ready to apply 
Hensel’s lemma (Lemma 12.2.18), but need to ensure convergence. We observe that 


0 os, P T(xi, Xj) Xj (12.3.7) 


OX; 


j=l 

so the vector of partial derivatives ((0Q/0x;)(a)); = [T]a is just the matrix product 
of the Gram matrix with the vector a = (a;);. Working modulo p, we have disc Qg = 
2" det[T] € k*, using that 2 € k*, so the kernel of [T] mod p is zero. Since a has 
nonzero reduction, we conclude that [T]a also has nonzero reduction, which means that 
min; v((0Q/0x;)(a)) = 0. Therefore the hypotheses of Hensel’s lemma are satisfied 


with m = 1, and we conclude there exists a nonzero a € M such that O(a) = 0 and so 
Q is isotropic. oO 


From Lemma 12.3.6, we obtain the following. 


Proposition 12.3.8. Suppose char k + 2. Let Q: M —> R be a nonsingular quadratic 
form over R with M free of rank at least 3. Then Q is isotropic. 


Proof. Combine Lemmas 12.3.5 and 12.3.6. o 
Considering valuations, we also deduce the following from Lemma 12.3.6. 


Lemma 12.3.9. Suppose char k + 2. Then F*/F”? ~ (Z/2Z)* and is represented by 
the classes of 1,e,m,em where e € R* is an element which reduces modulo p to a 
nonsquare in k. 


We now turn to the proof of our theorem in the case char k # 2. 


Proof of Theorem 12.3.4 (char k + 2). Let Q ~ (a, —b,—c) be an anisotropic ternary 
quadratic form over F. Then Q is nondegenerate. After rescaling and a change of basis 
(Exercise 12.8), we may suppose that a = 1 and 0 = v(b) < v(c). If v(b) = v(c) = 0 
then the quadratic form modulo p is nonsingular, so by Lemma 12.3.5 it is isotropic 
and by Lemma 12.3.6 we conclude Q is isotropic, a contradiction. 
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We are left with the case v(b) = 0 and v(c) = 1. By Lemma 12.3.9, we may 
suppose b = 1 or b = e where e is a nonsquare in k. If b = 1, then the form is obviously 
isotropic, so b = e. Similarly, c = m or c = en. In fact, the latter case is similar to the 
former: dividing by e, we have 


(1, -e, =en) ~ (e7',-1,-2) ~ (-1,e,-2) 


and since (—1, e) = (1, -e) (Exercise 12.7), we conclude Q ~ (1, —e, -7). 

To finish, we show that the form (1, —e, —7) is anisotropic. Suppose that x? — ey? = 
nz? with x,y,z € F° not all zero. By homogeneity, rescaling by a power of 7 if 
necessary, we may suppose x,y,z € R and at least one of x,y,z € R*. Reducing 
modulo p we have x? = ey? (mod p); since e is a nonsquare in k, we must have 
v(x), v(y) = 1. But this implies that v(z) = 0 and so v(mz?) = 1 = v(x? — ey?) > 2,a 
contradiction. Oo 


Predictably, the proof when char k # 2 involving quadratic forms does not general- 
ize in a simple way. However, armed with the above outline and acknowledging these 
complications, we now pursue the case char k = 2. A key ingredient will understanding 
certain binary quadratic forms, as follows. 


Lemma 12.3.10. There is a unique anisotropic binary quadratic form over k, up to 
isometry. Moreover, there is a unique (anisotropic) binary quadratic form over R whose 
reduction modulo is anisotropic, up to isometry (over R). 


When char k + 2, this unique class of binary forms over R is (1, e} in the notation 
above. We will want a similar bit of notation in the case char k = 2. 


12.3.11. Recall the issues (6.1.4) with inseparability in characteristic 2. Let p(k) = 
{z+ 27: z € k} be the Artin-Schreier group of k. The polynomial x? + x + a € k[x] 
is reducible if and only if a € g(k), and since k is finite, k/g(k) ~ Z/2Z (Exercise 
12.9). 

Let ¢ € R be such that its reduction to k represents the nontrivial class in k/ø(k). 


Proof of Lemma 12.3.10. We begin with the first part of the statement. Let Q be an 
anisotropic binary quadratic form over k, with T the associated bilinear form. The 
quadratic form Q (1) is isotropic by Lemma 12.3.5, and Q is anisotropic, so Q 
represents 1, say Q(x) = 1. Extending to a basis, we may rescale the second basis 
element y so that T(x, y) = 0, 1. 


+ Suppose T(x, y) = 0, so Q = (1, —b} for b € k*. Since Q is isotropic, b ¢ k? 
so char k + 2, the class b € kX [ke is unique, and indeed Q is anisotropic. 
Suppose T(x, y) = 1. If char k + 2, we may complete the square and reduce to 
the previous case, so we suppose char k = 2 and Q(x, y) = x? + xy + cy? with 
c € k. Working now with the Artin—Schreier group, since Q is anisotropic we 
must have c + p(k) = t + p(k) € k, giving uniqueness. 


For the second statement, if Q is a binary quadratic form over R whose reduction 
modulo p is anisotropic, then we can find a change of basis over k to put Qx in the 
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unique isometry class found above; lifting this basis, we may suppose that Q; is equal 
to this fixed form. The statement then follows by using Hensel’s lemma to lift the 
identity between any two such lifts, and it is a nice application of Hensel’s lemma in 
two (several) variables: see exercise (Exercise 12.12). im 


We now return to the proof of our theorem. 


Proof of Theorem 12.3.4 (char k = 2). We first claim that the form [1, 1,t]&(z) is 
anisotropic; this follows from a straightforward modification of the argument as in the 
proof when char k + 2 above. 

We now show this form is the unique one up to similarity. Suppose that Q is a 
ternary anisotropic form over R. Let x € V be nonzero; since Q is anisotropic, we 
may scale x so that a := Q(x) € R. Since dimV > 3, there exists nonzero y’ € V 
such that T(x, y’) = 0; rescale y’ so that Q(y’) € R. Let y := x + y’. Then T(x, y) = 
T(x,x + y’) = a and b := Q(y) € R, so Q on this basis is ax? + axy + b? ~ [a, a, D]. 
We compute that disc [a,a,b] = a(a — 4b) = a? (mod p), so disc[a,a,b] € R* 
and [a,a,b] is nonsingular; completing to a basis with a nonzero element in the 
orthogonal complement and rescaling, we may suppose without loss of generality 
that Q = [a,a,b]®(c) with a,b,c € R and both v(a) = 0,1 and v(c) = 0,1, with 
disc Q = a(a — 4b)c. This leaves four cases. 


- If v(a) = v(c) = 0, then reducing modulo p we have disc Qg = a’c + 0 so Qk 
is nondegenerate. By Lemma 12.3.5, Q; is isotropic. Hensel’s lemma (Lemma 
12.2.18) applies to Q, showing that Q is isotropic and giving a contradiction—we 
omit the details. 

Suppose v(a) = 0 and v(c) = 1. Rescaling by a unit we may suppose c = z. 
The reduction [a, a, b]; modulo p is nondegenerate, so if it is isotropic then Q is 
isotropic, a contradiction. Therefore by Lemma 12.3.10, we may suppose a = 1 
and b = t; this is the desired form. 

Next, consider the case v(a) = 1 and v(c) = 0; we may suppose c = 1. If 
v(b) > 2, then 27![a, a,b] ~ [1, 1, b/r] so the argument in the previous case 
applies to show that Q is isotropic. If v(b) = 1, then we scale z by z and divide 
Q by to reduce to the previous case. Finally, if v(b) = 0, we scale y by m and 
n'[a, a, bn] ~ [1, 1, b(x7/a)] is again isotropic, a contradiction as in the case 
v(b) 2 2. 

In the case v(a) = v(c) = 1, if v(b) > 0 then dividing through by z we reduce to 
the first case, so we may suppose v(b) = 0. Then we multiply by a and replace 
x — ax, and y — y/b and z + 2/z to work with [1, a/b, a/b] (ac/n?), which 
after interchanging x, y reduces to the previous case v(a) = 1, v(c) = 0. 


Having exhausted the cases and the reader, the result now follows. oO 


Corollary 12.3.12. Let F be a nonarchimedean local field with valuation ring R and 
uniformizer n € R. Let B be a quaternion algebra over F. 
If char k + 2, then B is a division algebra if and only if 


B= ($3). where e € R™ is nontrivial in ae ia 
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and if char F = char k = 2, then B is a division algebra if and only if 


t 
B= | =) where t € R is nontrivial in k/ 9k). 


In Theorem 13.3.11, we rephrase this corollary in terms of the unramified quadratic 
extension of F. 


Remark 12.3.13. In mixed characteristic where char F = O and chark = 2, in the 
extension K = F[x]/(x? + x + t) for t nontrivial in k/ (k) we can complete the square 
to obtain K = F(Ve) with e € FX\F*. 


Definition 12.3.14. Let B be a quaternion algebra over F. The Hasse invariant of B 
is defined to be —1 if B is a division algebra and +1 if B ~ M2(F). 


12.4 Hilbert symbol 


Let F be a local field with char F + 2. We record the splitting behavior of quaternion 
algebras as follows. 


Definition 12.4.1. We define the Hilbert symbol 
(, )p: FX x F* > {+1} 


,b 
by the condition that (a, b)r = 1 if and only if the quaternion algebra (=) ~ M2(F) 


is split. 


The Hilbert symbol is well-defined as a map 
PF! x F* |F => {+1} 


(Exercise 2.4). By Main Theorem 5.4.4(v), we have (a,b)p = 1 if and only if the 
Hilbert equation ax? + by? = 1 has a solution with x, y € F: this is called Hilbert’s 
criterion for the splitting of a quaternion algebra. 


„b 
— ) and (a, b) is intentional; 


Remark 12.4.2. The similarity between the symbols 


but they are not the same, as the former represents an algebra and the latter takes the 
value +1. 
In some contexts, the Hilbert symbol (a, b)F is defined to be the isomorphism class 


,b 
of the quaternion algebra (=) in the Brauer group Br(F), rather than +1 according to 


whether or not the algebra is split. Conflating these two symbols is not uncommon and 
in certain contexts it can be quite convenient, but we warn that it can lead to confusion 
and caution against referring to a quaternion algebra or its isomorphism class as a 
Hilbert symbol. 


Lemma 12.4.3. Let a, b € F*. Then the following statements hold: 
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(a) (ac?, bd”) = (a, b)p for all c,d € F”. 

(b) (b,a)F = (a, b)r. 

(c) (a, b)r = (a, —ab)r = (b, -ab)r. 

(d) (d, a)r = (4,-a)r = 1. 

(e) Ifa # 1, then (a, 1 —- a)r = 1. 

(£) Ifo € Aut(F), then (a, b)r = (o(a), o (b))F. 


Proof. Statements (a)-(c) follow from Exercise 2.4. For (d), the Hilbert equation 
xX +a y? = | has the obvious solution (x, y) = (1,0). And (a, —a) is isotropic (taking 
(x,y) = (1, 1)) so is a hyperbolic plane and represents 1 as in the proof of Main 
Theorem 5.4.4, or we argue 


(a, -a)r = (a,a*)r = (a, 1)F = (l,a) = 1 


by Exercise 2.4. For part (e), by Hilbert’s criterion (a, 1 — a)r = 1 since the quadratic 
equation ax? +(1- a)y? = | has the solution (x, y) = (1, 1). Finally, part (f): the Hilbert 
equation ax? + by” = 1 has a solution with x, y € F if and only if o(a)x? + a(b)y” =1 
has such a solution. oO 


Remark 12.4.4. Staring at the properties in Lemma 12.4.3 and seeking to axiomatize 
them, the study of symbols like the Hilbert symbol leads naturally to the definition of 
K2(F). In its various formulations, algebraic K-theory (K for the German “Klasse”, 
following Grothendieck) seeks to understand certain kinds of functors from rings to 
abelian groups in a universal sense, encoded in groups K,,(R) for n € Zso and R a 
commutative ring: see e.g. Karoubi [Kar2010]. For a field F, we have Kọ(F) = Z and 
K\(F) = F*. By a theorem of Matsumoto [Mat69] (see also Milnor [Milno71]), the 
group K2(F) is the universal domain for symbols over F: 


K2(F) := (F* &z F*)/(a@®@(1—a):a#0,1). 


(The tensor product over Z views F™ as an abelian group and therefore a Z-module.) 
The map a 8 b } (a,b)r extends to a map K2(F) — {+1}, a Steinberg symbol, a 
homomorphism from K(F) to a multiplicative abelian group. The higher K-groups 
are related to deeper arithmetic of commutative rings. For an introduction, see Weibel 
[Weib2013] and Curtis—Reiner [CR87, Chapter 5]. 


We now turn to be quite explicit about the values of the Hilbert symbol. We begin 
with the case where F is archimedean. If F = C, then the Hilbert symbol is identically 
1. If F = R, then 


(12.4.5) 


1, ifa>Oorb>0; 
(a, b)R = . 
-1, ifa<Oandb<0O. 


Lemma 12.4.6. The Hilbert symbol defines a nondegenerate symmetric bimultiplica- 
tive pairing 


(, )p: ad) ieee F* |F > {+1}. 
By bimultiplicativity, we mean that 


(a,bc)r = (a,b) (a,c)r and (ab,c)r = (a,c)p(b,c)F (12.4.7) 
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for all a, b,c € F* (equivalent, by symmetry). 
Keeping in the vibe of this section, we give a proof under the hypothesis that 
char k # 2; for a general proof, see Corollary 13.4.6. 


Proof (char k # 2). This lemma can be read off of the direct computation below 
(12.4.9), recording what was computed along the way in the proof of Theorem 12.3.4. 
o 


12.4.8. Since the Hilbert symbol is well-defined up to squares, the symbol (a, b)p is 
determined by the values with a,b € {1, e, 7, er} where e is a nonsquare in k™. Let 
s = (—1)#%-D/2 so that s = 1, —1 according as —1 is a square in k. Then: 


(a,b)r | 1 œe nm en 
1 1 1 1 1 
e 1 1 -1 -1 (12.4.9) 
m 1 -1 s -s 
en 1 -1 -s s 


The computation of this table is requested in Exercise 12.15. For example, if a = e, we 
showed in the proof of Theorem 12.3.4 that (e, 1)r = (e,e)r = 1, because ex? +yz 
and ex? + ey? — z? are isotropic. 

In general, writing a = aga” and b = box”) we have 


v(b) v(a) 
(a, b)p = (-1ytemnone-ne{ 2) (+) (12.4.10) 
where q = #k and 
(<) = 0,41 =c4/2 (mod p) (12.4.11) 


is the Legendre symbol: see Exercise 12.16. (Multiplicativity can also be read off of 
the formula 12.4.10.) 


12.4.12. The following easy criteria follow from 12.4.9 (or (12.4.10)): 


(a) If v(ab) = 0, then (a, b)p = 1. 
(b) If v(a) = 0 and v(b) = v(x), then 


(a,b) a 1 ifa € k*?; 
a, ae a = 
Po \p -1 ifae Vk. 


12.4.13. To compute the Hilbert symbol for a local field F with char F = 0 and 
char k = 2 is significantly more involved. But we can at least compute the Hilbert 
symbol by hand for F = Q2. 

To begin, the group Qe? is generated by —1,—3,2, so representatives are 
{+1, +3, +2, +6}. We recall Hilbert’s criterion: (a, b)r = 1 if and only if ax? + by? =1 
has a solution with x, y € F. 
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If a,b € Z are odd, then 


ax? + by? = z? has a nontrivial solution in Q2 
© az=1(mod4)orb= 1 (mod 4); 


by homogeneity and Hensel’s lemma, it is enough to check for a solution modulo 4. 
This deals with all of the symbols with a, b odd: summarizing, we have in this case 


(a,b), = (Ne VOD, (12.4.14) 


By the determination above, we see that (—3, b) = —1 for b = +2, +6 and (2, 2)2 = 
(—1,2)2 = 1 the latter by Hilbert’s criterion, as —1 + 2 = 1; knowing multiplicativity 
(Lemma 12.4.6), we have uniquely determined all Hilbert symbols, in particular, for 


a € Z odd we have 
(a, D) = (1) D8, (12.4.15) 


It is still useful to compute several of these symbols individually, in the same manner 
as (12.4.13) (working modulo 8): see Exercise 12.17. We summarize the results here: 


(abha |1 -3 -1 3 2 -6 -2 6 
I I I 1 of I I I I 
-3 |1 1 1 1 -1 -1 -1 -l 
-1 |1 1 -1 -1 1 1 -l -l 
3 |1 1 -1 -1 -1 -1 1 1 (12.4.16) 
2 |i =. 1 -1 1 sb 1 -l 
-6 |1 -1 1 Sb af 1 -1 1 
-2 |1 -1 -1 1 1 -1 -1 1 
6 We -1 -1 1 -1 1 1 -l 


Remark 12.4.17. Analogously, one can define a symbol [a, b)r for the splitting of 
quaternion algebras for F a local field with char F = 2. This symbol is no longer called 
the Hilbert symbol, but many properties remain: in particular, there is still an analogue 


b 
of the Hilbert equation, and |<] is split if and only if bx? + bxy + aby? = 1 has a 


solution with x, y € F. 


Exercises 
1. Let p be an odd prime. Show 
(a) Show the equality 


(= 4x)!? =1- )°C,x" € Zilx] 


nat 
of formal series in x with coefficients in Z, where 
1 2n 
Cra- — EZ 
2n- 1 | n ) a 


are the Catalan numbers. [Hint: use binomial expansion. ] 
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2. 


> 4. 


> 6. 


> 7. 


> 8. 


> 9, 


(b) Let p be an odd prime. Show that the squaring map is bijective on 1 + pZp. 
[Hint: show that the series expansion in (a) converges in Zp.] 


Recall that a topological space is T; if for every pair of distinct points, each point 
has an open neighborhood not containing the other. 


(a) Show that a topological space X is T; if and only if {x} is closed for all 
xeXx. 

(b) Let G be a topological group. Show that G is Hausdorff if and only if G is 
Tı. 


. In this exercise we prove some basic facts about topological groups. Let G be a 


topological group. 


(a) Let H < G be a subgroup. Show that H is open if and only if there exists 
h € H and an open neighborhood of h contained in H. 

(b) Show that if H < G is an open subgroup, then H is closed. 

(c) Show that a closed subgroup H < G of finite index is open. 

(d) Suppose that G is compact. Show that an open subgroup H < G is of finite 
index, and that every open subgroup contains an open normal subgroup. 


Let G be a topological group. Let U > 1 be an open neighborhood of 1. 


(a) Show that there exists an open neighborhood V C U of 1 € V such that 
V? =V. VCU. [Hint: Multiplication is continuous. ] 

(b) Similarly, show that there exists an open neighborhood V C U of 1 € V 
such that V~'V c U. 


. Let G be a topological group and let H < G be a closed subgroup. Equip G/H 


with the quotient topology. Show that G/H is Hausdorff. [Hint: Use Exercise 
12.4(b).] 


Let k be a finite field and let Q: V — k be a ternary quadratic form. Show that 
Q is isotropic. [Hint: Reduce to the case of finding a solution to y? = f(x) where 
f is a polynomial of degree 2. If #k is odd, count squares and the number of 
distinct values taken by f(x) ink. Second approach: reduce to the case where #k 
is odd, and show that x? + y? represents a nonsquare, since the squares cannot 
be closed under addition! ] [This repeats Exercise 5.5!] 


Let k be a finite field with char k + 2 and let e € k*. Show directly that there is 
an isometry (—1, e) = (1,-e). 


Let R be a DVR with field of fractions F, let a, b,c € F be nonzero and let Q = 
(a, —b, —c). Show that Q is similar over F to (1, —b’, —c’) with 0 = v(b’) < vic’). 
[Hint: first get v(a), v(b), v(c) € {0, 1}.] 


Let k be a finite field with even cardinality. Show that #k/g(k) = 2, where p(k) 
is the Artin-Schreier group. 
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10. By Theorem 12.2.15, a complete archimedean local field is isomorphic to R or 


> 11. 


> 12. 


13. 


C. Extend this classification to division algebras as follows. 
The notion of absolute value (Definition 12.2.2) extends to a division algebra 
without modification, as does the notion of archimedean and nonarchimedean. 


(a) Show that H has an absolute value |a| = nrd(q) for œ € H. 

(b) Let D be a division algebra equipped with an absolute value | |. Show that 
if | | is archimedean, then char D = 0 and if the restriction of | | to its 
center Z(D) is archimedean. 

(c) Show that every division algebra complete with respect to an archimedean 
absolute value is isomorphic to R, C, or H and with the absolute value 
equivalent to the absolute value |a| = Vnrda in each case. [Hint: recall 
Theorem 3.5.1.] 


Prove Lemma 12.2.18 using Lemma 12.2.17. [Hint: let j be the index that 
achieves the minimal valuation among partial derivatives, and consider the 
restriction f(a, ...,@j-1,X, Aj+1,-++,4n) E€ R[x] to one variable. ] 


In this exercise, we consider an extension of Hensel’s lemma to several polyno- 
mials (in several variables). Let F be a nonarchimedean local field with valuation 
v and valuation ring R. 


(a) Let fi,..., fa E€ RE, . . ., Xn] be polynomials with n > 1. Let a € R” have 
m := min; v(f;(a)). Let 


afi 
Dy := | fi o) € M, (R). 
Ox; ma 
ij 

Suppose m > 2v(detD;(a)). Show that there exists a € R” such that 
f(a) = 0 for all i = 1,...,n and a = a (mod p”). [Hint: by Taylor 
expansion, write f(a+n'x) = f(a)+Df(a)p'x + p*r(x) in vector notation 
with t > m, and iteratively solve the system in a manner analogous to 
Newton’s method. See Conrad [Con] for a complete development. ] 


(b 


ma 


Show that (a) also holds more generally if the number of polynomials r has 
r < n and there exists an r x r matrix minor of D¢(a) whose determinant 
has valuation < m/2. [Hint: see Exercise 12.11.] 


(c 


wa 


Finish the proof of Lemma 12.3.10 as follows. Let Q, Q’ be two binary 
quadratic forms over R such that their reductions Qg, Q, are anisotropic. 
Show that Q ~ Q’. [Hint: reduce to the case where Qk = Qi. If Qk ~ 
(1, —e}, rescale the basis using a plain vanilla flavor of Hensel’s lemma. 
If Qk = [1, 1,t], reduce to showing that Q = [1,1,t]. Consider a general 
change of variables in GL2(R) from [1,1,t] that reduces to the identity 
modulo p and apply the deluxe version of Hensel’s lemma in part (b) to 
the resulting system of three equations in four unknowns: there is a minor 
with determinant 1 — 4t € R*.] 


Let F#C be a local field and let Q be a nondegenerate ternary quadratic form 
over F. Let KDF be a quadratic field extension. Show that Q is isotropic over K. 
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14. 


> 15. 
> 16. 


> 17. 


18. 


Give another proof of Lemma 12.4.6 that the local Hilbert symbol is bimulti- 
plicative using Example 8.2.2 and the Brauer group (section 8.3). 


Show that the table of Hilbert symbols (12.4.9) is correct. 


One can package 12.4.8 together with multiplying by squares to prove the fol- 
lowing more general criterion. Let F be a nonarchimedean local field with 
uniformizer x, valuation v with v(x) = 1, and residue field k. Let q = #k and 
suppose q is odd. 

Show that for a, b € F*, if we write a = aga” and b = box”), then 


v(b) v(a) 
(a byr = (-ayreomoner a 2) (2) 


T T 


Show that the table of Hilbert symbols (12.4.16) is correct by considering the 
equation ax? + by? = 1 (mod 8). 


Prove a descent for the Hilbert symbol, as follows. Let K be a finite extension 
of the local field F with char F + 2 and let a,b € F*. Show that (a, b)g = 


(a, Nmgir (b))r = (Nmxgir (a), b)F. 
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Chapter 13 


Quaternion algebras over local fields 


In this chapter, we approach the classification of quaternion algebras over local fields 
in a second way, using valuations. 


13.1 Extending the valuation 


Recall (section 12.1) the valuation v = v, on Qp, measuring divisibility by p. We have 


Zp = {x € Qp : v(x) = 0}, and 
PZp = {x € Qp : v(x) > 0}; 


Indeed, these can profitably be taken as their definition. 


(13.1.1) 


For any finite extension K 2 Qp of fields, there is a unique valuation w on K such 
that wla, = v (so w extends v), defined by 
v(Nmxig, œ) 
[K : Qp] 


The integral closure of Zp in K is the valuation ring {x € K : w(x) = 0} 2 Zp, and its 
unique maximal ideal is {x € K : w(x) > 0}, as in (13.1.1). 


(13.1.2) 


w(x) := 


For example, there is a unique unramified quadratic extension K of Qp: we have 
K = Q,(ve), where e = —3 for p = 2 and otherwise e € Z is a quadratic nonresidue 
modulo p for p odd. It is common to write K = Q,„2 for this field and Z,2 for its 
valuation ring, since the residue field of K is F „2. 

In a completely parallel fashion, let B be a division quaternion algebra over Qp. 
Then there is again a unique valuation w extending v, defined by 


w: B—> RU {oo} 


v(nrd(@)) (13.1.3) 
eT 
2 
The valuation ring 
O := {a € B : w(a) = 0} (13.1.4) 
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is the unique (!) maximal R-order in B, consisting of all elements of B that are integral 
over Zp. The set 

P := {œ € B : w(@) > 0} (13.1.5) 
is the unique maximal two-sided (bilateral) ideal of O. 


Using the unique extension of the valuation, we obtain the following main result 
of this chapter (a special case of Theorem 13.3.11). 


Theorem 13.1.6. Let q := p*. Then the following statements hold. 


(a) There is a unique division quaternion algebra B over Q,, up to isomorphism 
Zer) 

Qp ` 

(b) The valuation ring of B is O ~ Z4 ® Zgj. 

(c) The maximal ideal P = Oj has P? = pO and O/P = Zg|/plig ~ Fg. 


given by B = 


The method of proof used in this classification can also be used to classify central 
division algebras over local fields in much the same manner. 


13.2 Valuations 


To begin, we briefly review extensions of valuations; for further reading, see the 
references given in section 12.2. 

Let R be a complete DVR with valuation v: R > Zso U {ov}, field of fractions 
F, maximal ideal p generated by a uniformizer m (with v(x) = 1), and residue field 
k := R/p. Then R is an integrally closed PID (every ideal is a power of the maximal 
ideal p), and R = {x € F : v(x) = 0}. Let ||, be an absolute value attached to v, as in 
(12:2-11). 

Let K 2 F be a finite separable extension of degree n := [K : F]. Then in fact K 
is also a nonarchimedean local field; more precisely, we have the following lemma. 


Lemma 13.2.1. There exists a unique valuation w on K such that w\|r = v, defined by 


v(Nmgır(x)) 
[K: F] 


The integral closure of R in K is the valuation ring 


w(x) := (13.2.2) 


S:={x € K : w(x) > 0}. 
When w|r = v, we say that w extends v. 


Proof. See e.g. Neukrich [Neu99, Chapter II, Theorem (4.8)], Cassels [Cas86, Chapter 
7, Theorem 1.1], or Serre [Ser79, Chapter II, §2, Proposition 3]. oO 


By the same token using (13.2.2), there exists a unique absolute value | |, on K 
which restricts to | |, on F; we pass freely between these two formulations. 
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13.2.3. We say K 2 F is unramified if a uniformizer 7 for F is also a uniformizer for 
K. We say K 2 F is totally ramified if a uniformizer zx has the property that 7% is a 
uniformizer for F. 

In general, there is a (unique) maximal unramified subextension Kun € K, and the 
extension K 2 Kın is totally ramified. 


We say that e = [K : Kun] is the ramification degree and f = [Kun : F] the inertial 
degree, and the fundamental equality 


n=[K:Fl=ef (13.2.4) 
holds. 


13.2.5. Suppose that F is a local field (equivalently, the residue field k is a finite field). 
Then for all f € Zs), there is a unique unramified extension of F of degree f and such 
a field corresponds to the unique extension of the residue field k of degree f. In an 
unramified extension K 2 F of degree [K : F] = f, we have Nmgjr(K*) = RX x2, 
so b € Nm r(K~%) if and only if f | v(b). 

If char k + 2, then by Hensel’s lemma, the unramified extension of degree 2 is given 
by adjoining a square root of an element of R which reduces to the unique nontrivial 
class in k*/k*?; if char k = 2, then the unramified extension of degree 2 is given by 
adjoining a root of the polynomial x? + x + t where t € R reduces to an element which 
is nontrivial in the Artin-Schreier group k/g(k) (recalling 12.3.11). 


Before proceeding further, we describe local fields by their defining polynomials— 
we will need this later in the study of norms and strong approximation. 


Lemma 13.2.6. (Krasner’s lemma). Let K 2 F be a finite, Galois extension with 
absolute value | |w. Leta, p € K, and suppose that for allo € Gal(K | F) witho (a) # a, 
we have 

læ — Blw < la-c(@)|w- (13.2.7) 
Then F(a) © F(B). 


Intuitively, we can think of Krasner’s lemma as telling us when £ is closer to a 
than any of its conjugates, then F (£) contains aq. It is for this reason that we state the 
lemma in terms of absolute values (instead of valuations). 


Proof. Let o € Gal(K | F(8)) have o(@) + a. Then by the ultrametric inequality, 


|I7(a) — aly = |ala) —B + B— aly < max(lo (a) — Llw, |B — al) 


(13.2.8) 
= max(|o(a — P)lw, |B — alw) = la — Llw, 
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the final equality a consequence of (13.2.2) and the fact that Galois conjugates have the 
same norm. This contradicts the existence of o, so o (œ) = a for all o € Gal(K | F()). 
By Galois theory, we conclude that F(a) € F(£). o 


Corollary 13.2.9. Let f(x) = x” + an-1 x"! + -- -+ ao € F[x] be a separable, monic 
polynomial. Then there exists 6 > 0 such that whenever g(x) = x" + bp-1 x"! +--+ + 
bo € F[x] has |a; — bily < 6, then 


Flx]/(f Œ) = Fix] /e&)). 
In particular, if f(x) is irreducible then g(x) is irreducible. 


Proof. Since f(x) is separable, its discriminant disc( f) is nonzero. The discriminant 
is a polynomial function in the coefficients, so by continuity (multivariate Taylor 
expansion), there exists 6; > 0 such thatif g(x) = x"+---+bo € F[x] has |a;—bily < 61 
for alli, then |disc(g)—disc(f)|,, < |disc(f)|,; by the ultrametric inequality, we conclude 
that for such g(x) we have |disc(g)|,, = |disc(f)|,, so in particular disc(g) + 0. 

Let g(x) be as in the previous paragraph; then g(x) is separable. We first consider 
the case where f(x) is irreducible. Let K 2 F be a splitting field for the polynomials 
f(x) = [Œ — a) € K[x] and g(x) = [TjL,(« — £i). Let || on K extend | |y. Let 


e€ := mina; — ajly. (13.2.10) 
i$j 
Finally, let 
p(g) = pbo,- - bn-1) =| [8 = | [œ - 6). (13.2.11) 
i=l ij=l 


The map g > p(g) is again a polynomial in the coefficients bo, ..., bn-1 (indeed, it 
is a polynomial resultant). Therefore there exists 6 > 0, with 6 < ôi, such that if 
la; — bily < ô, then |p(g)lw < e”. Therefore in (13.2.11), there exists i, 7 such that 
la; — Bjlw < €. Together with (13.2.10), we have 


la; — Bjlw < € < lai -— axlw 


for all k + i. By Krasner’s lemma (Lemma 13.2.6), we conclude that F(a;) © F(f;). 
Since f(a@;) = g(6;) = 0 and f is irreducible, we have 


[F(a;): F] =n <[F(6;): Fl] <n 
so in fact [F(6;) : F] = n and F(a;) = F(6;). Finally, 
F[x]/(f(x)) = Flai) = F(B;) = Fix) /(g(%)) 


as desired. 

The case when f(x) = fi(x): +- f-(x) is reducible follows by repeating the above 
argument on each factor, and finishing using the continuity of multiplication among 
the coefficients: the details are requested in Exercise 13.16. oO 
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13.3 Classification via extensions of valuations 


We now seek to generalize this setup to the noncommutative case; we retain the notation 
from the previous section. Let D be a central (simple) division algebra over F with 
dime D = [D : F] = n?. We extend the valuation v to a map 


w:D—>RU {oo} 
v(Nmpjr(@)) _ v(nrd(a)) (13.3.1) 
[D:F] — n ’ 


where the equality follows from the fact that Nmp\r(@) = nrd(a)” (see section 7.8). 


Lemma 13.3.2. The map w is the unique valuation on D extending v, i.e., the following 
hold: 


(i) w(@) = œ if and only if a = 0. 

(ii) w(@B) = w(a@) + w(B) = w(Ba) for all a, B € D. 
(iii) w(@ + B) = min(w(a), w(B)) for all a, B € D. 
(iv) w(D%) is discrete in R. 


Proof. Since D is a division ring, statement (i) is immediate. Statement (ii) follows 
from the multiplicativity of nrd and v. To prove (iii), we may suppose 6 + 0 and so 
B € D*. We have 


wla + B) = w((aB"! + 1)B) = w(aB! + 1) + w(B). 


But the restriction of w to F (ap!) is a discrete valuation, thus w(ap! +1) œ> 
min(w(@B7!), w(1)) and by (ii) w(@w + B) > min(w(a), w(B)), as desired. Finally, (iv) 
holds since w(D*) C v(F*)/n and the latter is discrete. The valuation is unique because 
it is unique whenever it is restricted to a subfield. o 


13.3.3. From Lemma 13.3.2, we say that w is a discrete valuation on D since it 
satisfies the same axioms as for a field. It follows from Lemma 13.3.2 that the set 
O := {æ E€ D : w(a) = 0} 


is a ring, called the valuation ring of D. 


Proposition 13.3.4. The ring O is the unique maximal R-order in D, consisting of all 
elements of D that are integral over R. 


Proof. First, we prove that 
O = {a € D : ais integral over R}. (13.3.5) 


We first show the inclusion (2) of (13.3.5), and suppose œ € D is integral over 
R. Since R is integrally closed, by Lemma 10.3.5 the coefficients of the minimal 
polynomial f(x) € F[x] of œ belong to R. Since D is a division ring, f(x) is irreducible 
and hence the reduced characteristic polynomial g(x) is a power of f(x) and thus has 
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coefficients in R. Up to sign, the constant coefficient of g(x) is nrd(@), so w(@) = 
v(nrd(@)) > 0, hence a € O. 

Next we prove (C) in (13.3.5). Suppose œ € O, so that w(a@) > 0, and let K = F(a). 
Let f(x) € F[x] be the minimal polynomial of a. We want to conclude that f(x) € R[x] 
knowing that w(a@) > 0. But the restriction of w to K is the unique extension of v to K, 
and this is a statement about the extension K 2 F of fields and therefore follows from 
the commutative case, Lemma 13.2.1. 

We can now prove that O is an R-order. Scaling an element of D* by an appropriate 
power of 7 gives it positive valuation, so OF = D. To conclude, we must show that O 
is finitely generated as an R-module. Recall that D is a central division algebra over F, 
hence a separable F-algebra, so we may apply Lemma 10.3.7: every œ € O is integral 
over R and O is aring, and the lemma implies that O is an R-order. 

Finally, it follows immediately that O is a maximal R-order: by Corollary 10.3.3, 
every element of an R-order is integral over R, and O contains all such elements. O 


Remark 13.3.6. For a quaternion division algebra D, we can argue more directly in 
the proof of Proposition 13.3.4 using the reduced norm: see Exercise 13.4. 


13.3.7. It follows from Proposition 13.3.4 that O is a finitely generated R-submodule 
of D. But R is a PID so in fact O is free of rank [D : F] as an R-module. We have 


O* = {aE D: w(a) = 0} (13.3.8) 
since w(a~!) = —w(q), and in particular œ € O% if and only if nrd(a) € R*. Conse- 
quently, 

P :={a € D : w(a) > 0} = O\O* (13.3.9) 


is the unique maximal two-sided (bilateral) ideal of O, as well as the unique left or 
right ideal of O. Therefore O is a noncommutative local ring, a noncommutative ring 
with a unique maximal left (equivalently, right) ideal. 


13.3.10. Let 8 € P have minimal (positive) valuation w(8) > 0. Then for all 0 + œ € P 
we have w(aB™!) = w(a) — w(B) = 0 so «87! € O and a € OB. Arguing on the other 
side, we have also a € BO. Thus P = OB = BO = OBO. 

Arguing in the same way, we see that every one-sided ideal of O is in fact two-sided, 
and every two-sided ideal of O is principally generated by any element with minimal 
valuation hence of the form P” for some r € Zso. 


We are now prepared to give the second proof of the main result in this chapter 
(Main Theorem 12.3.2). We now add the hypothesis that F is a local field, so that k is 
a finite field. 


Theorem 13.3.11. Let F be a nonarchimedean local field. Then the following state- 
ments hold. 


(a) There is a unique division quaternion algebra B over F, up to F-algebra iso- 
morphism given by 
F) 
B = | — ], 
F 


where K is the unique quadratic unramified (separable) extension of F. 
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(b) Let B be as in (a). Then the valuation ring of B is O = S ® Sj, where S is the 
integral closure of R in K. Moreover, the ideal P = Oj is the unique maximal 
ideal; we have P? = nO, and O/P 2 R/p is a quadratic extension of finite fields. 


Proof. We begin with existence in part (a), and existence: we prove that B = (K, m | F) 
is a division algebra. We recall that B is a division ring if and only if z ¢ Nmx\r(K*) 
by Main Theorem 5.4.4 and Theorem 6.4.11. Since K 2 F is unramified, we have 
Nmxyr(K*) = R*x? by 13.2.5. Putting these together gives the result. 

Continuing with (a), we now show uniqueness. Let B be a division quaternion 
algebra over F. We refer to 13.3.10, and let P = Of. Then w(f) € 5Z>0, so 


w(B) < w(t) = v(t) = 1 < 2w(B) = w(B); (13.3.12) 


we conclude that 8O = P 2 tO 2 P? = B*O. The mapa + a yields an isomorphism 
O/P > P/P? of k-vector spaces, so 


4 = dimg(O/mO) < dim,(O/P?) = dimy(O/P) + dim,(P/P?) = 2 dimg(O/P) 
(13.3.13) 
and thus dim;(O/P) > 2, with equality if and only if rO = P?. 

As in (13.3.9), we have O\P = O%, so the ring O/P is a division algebra over 
k and hence a finite division ring. By Wedderburn’s little theorem (Exercise 7.30), 
we conclude that O/P is a finite field! So there exists i € O such that its reduction 
generates O/P as a finite extension of k. But i satisfies its reduced characteristic 
polynomial, a monic polynomial of degree 2 with coefficients in R, so its reduction 
satisfies a polynomial of degree 2 with coefficients in k. Since i is a generator, we 
conclude [O/P : k] < 2. Together with the conclusion of the previous paragraph, 
we conclude that [O/P : k] = dim,(O/P) = 2, in other words O/P is a (separable) 
quadratic field extension of k. It then follows from 13.2.5 that K := F(i) is the unique, 
unramified (separable) quadratic extension of F. Therefore equality holds in (13.3.13) 
and P? = nO. Since B?O = P? = nO, we have w(B) = 1/2. 

By Exercise 6.2 or 7.27, there exists b € F* such that B = (K, b | F). Recalling the 
first paragraph above, since B is a division algebra, we have b ¢ Nmx\r(K*) = Ra. 
Applying Exercise 6.4, we may multiply b by a norm from K~, so we may suppose 
b =z, and therefore B = (K, m | F). This concludes the proof of (a). 

We turn now to (b), with B = (K,a|F)=K+Kj with j? =a.Leta=ut+vjeB 
with u,v € K. Then nrd(av) = Nmx)r(u) — a Nmx r(v) = x — zy with x,y € F and 
v(x) even and v(zry) odd (as norms from K). By the ultrametric inequality, we have 
w(q@) = v(nrd(a)) = 0 if and only if v(x), v(y) = 0 if and only if u,v € S (as S is the 
valuation ring of K). Therefore O = S + Sj. Since j? = z, we have w(j) = 1/2, so 
jO = P. The remaining statements were proven in the course of proving (a). o 


Corollary 13.3.14. Let F + C be a local field. Let K be the unramified quadratic 
extension of F, with (o) = Gal(K | F). Then the F-subalgebra 


B= E a zu vE x} c M2(K) 


is the unique division quaternion algebra over F (up to isomorphism). 
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Proof. Using Theorem 13.3.11(a), we split B over K as in 2.3.4. (We may also put 7 
below the diagonal as in 2.3.12.) o 


13.4 Consequences 
We now observe a few consequences of Theorem 13.3.11. 


Corollary 13.4.1. Let F be a nonarchimedean local field with valuation v, let K be 
a separable, unramified quadratic F-algebra, and let B = (K,b | F) with b € F*. If 
v(b) = 0, then B ~ Mo(F). 


Proof. Either K = F x F or K 3 F is the unique unramified quadratic field extension. 
In the first case, K has a zerodivisor so B ~ Mo(F). In the second case, we conclude 
as in the first paragraph of the proof of Theorem 13.3.11, since b € R* < Nmxr(K*). 

o 


13.4.2. Let B be a division quaternion algebra over F. In analogy with the case of field 
extensions (13.2.4), we define the ramification index of B over F as e(B|F) = 2 since 
P? = nO, and the inertial degree of B over F as f(B|F) = 2 since B contains the 
unramified quadratic extension K of F, and note the equality 


e(B|F)f (BIF) = 4 = [B : F], 


as in the commutative case. (Viewed in this way, B is obtained from first an unramified 
extension and then a “noncommutative” ramified extension.) 


Remark 13.4.3. Theorem 13.3.11, the fundamental result describing division quater- 
nion algebras over a local field, is a special case of a more general result as follows. 
Let R be a complete DVR with maximal ideal p = 7R and F := Frac(R). 

Let D be a (finite-dimensional) division algebra over F, and let O € D be the 
valuation ring and P C O the maximal ideal. Then P? = pO for some e > 1, called the 
ramification index; the quotient O/P is a division algebra over the field k = R/p, and 
we let the inertial degree be f = dim,(O/P). Then ef = dimp D = n?; moreover, 
if k is finite (F is a local field), then e = f = n. For a proof, see Exercise 13.10; or 
consult Reiner [Rei2003, Theorems 12.8, 13.3, 14.3]. However, the uniqueness of D 
up to F-algebra isomorphism no longer holds. If F is a local field, then the possibilities 
for D are classified up to isomorphism by a local invariant inv D € ( +Z)/Z = Z/nZ. 
These patch together to give a global result: see Remark 14.6.10. 

This classification can be further extended to an arbitrary central simple algebra 
B ~M,,(D) over F: see Reiner [Rei2003, §17-18]. 


Splitting of local division quaternion algebras over extension fields is given by the 
following simple criterion. 


Proposition 13.4.4. Let B be a division quaternion algebra over a local field F, and 
let L be a separable field extension of F of finite degree. Then L is a splitting field for 
B if and only if [L : F] is even. 
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Proof. If F is archimedean, then either F = C and there is no such L, or F = R and 
B = Hand L = C, and the result holds. So suppose F is nonarchimedean. We have 
B ~ (K,z | F) where K is the unramified quadratic extension of F. Let e, f be the 
ramification index and inertial degree of L, respectively. Then [L : F] =n = ef, and 
n is even if and only if e is even or f is even. But f is even if and only if L contains 
an unramified quadratic subextension, necessarily isomorphic to K; but then K splits 
B so L splits B. 

Having established the claim when f is even, suppose that f is odd. Then L is 
linearly disjoint from K and K 8r L = KL is the unramified quadratic extension of L. 
Therefore B &p L ~ (KL,z | L). Let Rz be the valuation ring of L and let my, be a 
uniformizer for L. Then Nmxz/_(KL*) = Rn. We have m = um; for some u € RX. 
Putting these together, we see that B @F L is split if and only if x is a norm from KL 
if and only if e is even. o 


As a consequence, B contains every separable quadratic extension of F. 


Corollary 13.4.5. If B is a division quaternion algebra over a local field F and K 2 F 
is a separable quadratic field extension, then K => B. 


Proof. Combine Proposition 13.4.4 with Lemmas 5.4.7 and 6.4.12. o 


We repeat now Lemma 12.4.6, giving a proof that works without restriction on 
characteristic. 


Corollary 13.4.6. [f char F + 2, then the Hilbert symbol defines a symmetric, nonde- 
generate bilinear form on F*/F*?. 


Proof. Let K := F[x]/ (x? — a). The Hilbert symbol gives a well-defined map of sets 
FX /F*? = {+1} 
be (a, b)r 


and we may conclude as in Lemma 12.4.6 if we show that this is a nontrivial group 
homomorphism. 

First we show it is nontrivial. By Corollary 13.4.5, the field K embeds in the division 
quaternion algebra B, so by Exercise 2.5, there exists b such that B = (a, b | F), whence 
(a, b)p = —1. 

Next, we show it is a homomorphism. We appeal to Main Theorem 5.4.4. We 
have (a,b)r = 1 if and only if b € Nmgjr(K*). So we reduce to showing that if 
(a, b)r = (a, b')r = —1 for b, b’ € F*, then (a, bb’)r = 1. But by Corollary 7.7.6, since 
there is a unique division quaternion algebra, we conclude that b/b’ € Nmx\r(K*); 
thus bb! = (b’)?(b/b’) € Nmx\r(K%) and (a, bb! | F) = Mo(F) so (a, bb’)p = 1, as 
claimed. o 


Remark 13.4.7. The proof of Corollary 13.4.6 (pairing with any F*/F*%?) shows that 
F*/Nmxjr(K*) = Z/2Z. (13.4.8) 
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Conversely, if we know (13.4.8) then the properties of the Hilbert symbol are immedi- 
ate. Although it was not hard to prove (13.4.8) when char k + 2, to establish its truth 
when chark = 2, one is led to study higher ramification groups (e.g. Serre [Ser79, 
Chapter XV]) eventually leading to local class field theory. 


The norm groups played an important role in the proof above, so we conclude by 
recording the image of the reduced norm nrd(B*) C FX. 


Lemma 13.4.9. We have 
Ro ifB =H; 


F*, otherwise. 


nrd(B™) = | 


Moreover, if F is nonarchimedeanandO C B is a maximal R-order, then nrd(O*) = 
R*. 


Proof. If B = M}(F) is split, then nrd(B%) = det(GL2(F)) = F*. So suppose B is a 
division algebra. If B ~ H then nrd(B*) = R%„, so we suppose F is nonarchimedean. 
Then B ~ (K,z | F) where as above K is the unramified quadratic extension of F 
and z is a uniformizer. But F* = R* x (x), and nrd(K*) = Nmxjr(K*) = R* xn and 
nrd(j) = x. The result then follows by multiplicativity of the norm. 

The second statement follows similarly: if B ~ Mo(F) then O =~ Mo(R) and 
nrd(O*) = det(GL2(R)) = R*; otherwise O = (S, 2 | R) where S is the ring of integers 
of K, and nrd(S*) = Nmxjr(S*) 2 R* and again nrd(j) = z. o 


13.5 Some topology 


In this section, we dive into the basic topological adjectives relevant to the objects we 
have seen and that will continue to play an important role. Throughout, let F be a local 
field. 


13.5.1. F is a locally compact topological field (by definition) but F is not itself 
compact. The subgroup F* = F\{0} is given the topology induced from the embedding 
FX œ FXF 
x= (x,x7"); 


it turns out here that this coincides with the subspace topology F* c F (see Exercise 
13.14(a)). Visibly, F* is open in F so F* is locally compact. 

If F is nonarchimedean, with valuation ring R and valuation v, then F* is totally 
disconnected and further 


R* = {xE R: v(x) =0 CR 


is closed so is a topological abelian group that is compact (and totally disconnected). 


Now let B be a finite-dimensional F-algebra. 
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13.5.2. As an F-vector space, B has a unique topology compatible with the topology on 
F as all norms on a topological vector F-space extending the norm on F are equivalent 
(the sup norm is equivalent to the sum of squares norm, etc.): see Exercise 13.12. In 
particular, two elements are close in the topology on B if and only if their coefficients 
are close with respect to a (fixed) basis: for example, two matrices in M,,(F) are close 
if and only if all of their coordinate entries are close. (Of course, the precise notion 
of “close” depends on the choice of norm.) Consequently, B is a complete, locally 
compact topological ring, taking a compact neighborhood in each coordinate. 


13.5.3. The group B* is a topological group, with the topology given by the embedding 
BX > œ m (a,a7!) € B x B. This topology coincides with the subspace topology 
(see Exercise 13.14(b)). From this, we can see that B* is locally compact: the norm 
Nmagjr: B* — F* is a continuous map, so BX = Nmgjp(F*) is open in B, and an 
open subset of a Hausdorff, locally compact space is locally compact in the subspace 
topology (Exercise 13.14(c)). 


Example 13.5.4. If B = M,,(F), then B* = GL,(F) is locally compact: a closed, 
bounded neighborhood that avoids the locus of matrices with determinant 0 is a 
compact neighborhood. When F is archimedean, this is quite visual: a matrix of 
nonzero determinant is at some finite distance away from the determinant zero locus! 
Note however that GL,,(F) is not itself compact: already F* = GL; (F) is not compact. 


Now suppose F is nonarchimedean with valuation v and valuation ring R. 


13.5.5. We claim that R is the maximal compact subring of F. Indeed, x € F lies 
in a compact subring if and only if v(x) > 0 if and only if x is integral over R. The 
only new implication here is the statement that if v(x) < 0 then x does not lie in a 
compact subring, and that is because the sequence x,, = x” does not have a convergent 
subsequence as |xn| > ov. 


Next, let O be an R-order in B. 


13.5.6. Choosing an R-basis, we have an isomorphism O ~ R”, and this isomorphism 
is also a homeomorphism. Therefore, O is compact as the Cartesian power of a 
compact set. The group O% is therefore also compact because it is closed: for y € O, 
we have y € O* if and only if Nmgjr(y) € R*, the norm map is continuous, and 
R* = {x € R: v(x) = 0} C Ris closed. 


Example 13.5.7. For R = Zp C F = Qp and B = M,(Q,), the order O = M, (Zp) is 
compact (neighborhoods of a matrix can be taken as neighbhoods in each coordinate) 
and the subgroup O* = GL,,(Z,) is compact: there is no way to “run off to infinity”, 
either in a single coordinate or via the determinant. 


13.5.8. Suppose B = D is a division algebra. Then the valuation ring O is the maximal 
compact subring of B, for the same reason as in the commutative case (see 13.5.5, 
details requested in Exercise 13.17(a)). There is a filtration 


ODP DP*>... 
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giving rise to a filtration 
OF S14P 514" 5 cies (13.5.9) 


As in the second proof of Main Theorem 12.3.2, the quotient O/P is a finite extension 
of the finite residue field k, so (O/P)* is a finite cyclic group. The maximal two-sided 
ideal P is principal, generated by an element j of minimal valuation, and multiplication 
by j” gives an isomorphism O/P -æ P” /P”+! of k-vector spaces (or abelian groups) 
for alln > 1. 

Furthermore, for each n > 1, there is an isomorphism of groups 


O/P = P"/P™*! = (1+ P")/(1 +P") 


arPlta. 


(13.5.10) 


Therefore, O* = lim (O/P")* is a projective limit of solvable groups, also called a 
prosolvable group. 


Example 13.5.11. If B is a division quaternion algebra over Qp, with valuation ring 
O and maximal ideal P, then the filtration (13.5.9) has quotients isomorphic to O/P = 
F 2. 


P 


13.5.12. We will also want to consider norm 1 groups; for this, we suppose that B is a 
semisimple algebra. Let 
B! := {a € B: nrd(a) = 1}; 


some authors also write SL (B) := B!. Then B! is a closed subgroup of B*, since the 
reduced norm is continuous. 

If B is a division ring and F is archimedean, then B ~ H and B! ~ H! ~ SU(2) is 
compact (it is identified with the 3-sphere in R4). In a similar way, if B is a division ring 
and F is nonarchimedean, then B! is compact: for B has a valuation v and valuation 
ring O, and if œ € B has nrd(a@) = 1 then v(a@) = 0 and a € O, and consequently 
B! C O* is closed in a compact set so compact. 

If B is not a division ring, then either B is the product of two algebras or B is a 
matrix ring over a division ring, and in either case B! is not compact. 


Remark 13.5.13. The locally compact division algebras over a nonarchimedean field 
are necessarily totally disconnected. On the other hand, it is a theorem of Pontryagin 
[War89, Theorem 27.2] that if A is a connected locally compact division ring, then A 
is isomorphic as a topological ring to either R, C, or H. 


Exercises 


—1,-1 
1. Let B := 7 ; 
| Q 


(a) Show that B is a division ring that is complete with respect to the discrete 
valuation w defined by w(t + xi + yj + zij) = v(t? + x? + y? + 2’) for 
t,x, y,z€Q. 
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(b) Prove that 
l+itjtij 


O := Z2 @ 22i ® Z2j © Z2 7 


cB 


is the valuation ring of B. 


2. Let B be a division quaternion algebra over a nonarchimedean local field F. Give 
another proof that the unramified quadratic extension K of F embeds in B as 
follows. 


Suppose it does not: then for all œ € O, the extension F(a) 2 F is ramified, 
so there exists a € R such that œ -a € P N K(a); let P = jO and write 
«æ = œo =a + ja, and iterate to conclude that a = Doo anj” with an € R. But 
F(j) is complete so O € F(j), a contradiction. 


3. Let F be a local field with F # C, let K the unramified (separable) quadratic 
extension of F (take K = C if F = R), and let (o) = Gal(K | F), so that o is the 
standard involution on K. Let B be a division quaternion algebra B over F. 


Show that 
s= || S (a) 0b EK) EMW, 


nolb) o 


[Hint: Compute the regular representation 2.3.8.] Identify the maximal order O 
its maximal ideal J under this identification. 


4. Let B be a division quaternion algebra over F. Show that œ € B is integral over 
R if and only if nrd(q@), nrd(œ + 1) € R if and only if w(@), w(@ + 1) = 0, where 
w is the valuation on B. 


5. Extend Theorem 13.3.11 as follows. Let R be a complete DVR with field of 
fractions F, and let B be a quaternion division algebra over F. Show that B ~ 


f 


b 
| where K 2 F is an unramified separable quadratic extension of F and 


b ¢ Nmgjr(K*). 


6. Let F be a nonarchimedean local field with residue field k having char k + 2, 
and let K 2 F be a separable quadratic field extension. 


a) Let b € F*. Show that if K is unramified then b € Nmgjr (K*) if and only 
if v(b) is even; and if K = F(a) is ramified, then b € Nmgjr(K*) if and 
only if b = c? or b = —ac? for some c € F*?. 


b) Deduce [F* : Nmgır(K*)] = 2 and Corollary 13.4.6 in this case. 


7. Let R := Q{[f]] be the ring of formal power series over Q; then R is a complete 
DVR with fraction field F = Q((t)), the Laurent series over Q. Let Bo := (a, b | Q) 
be a division quaternion algebra over Q, and let B := Bo &Q F = (a,b | F). 
Show that B is a division quaternion algebra over F, with valuation ring O := 
R+Ri+ Rj + Rij. 
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8. Let B be a division quaternion algebra over a nonarchimedean local field F, and 
let O be the valuation ring. 


(a) Show that every one-sided (left or right) ideal of O is a power of the 
maximal ideal P and hence is two-sided. 
(b) Let 
[O, O] := (aB — Ba: a, B € O) 
be the commutator ideal [O, O] of O, the two-sided ideal generated by 
commutators of elements of O. Show that P = [O, O]. 


9. Let F be a nonarchimedean local field, let B = M2(F) and O = M2(R). Show 
that there are q + 1 right O-ideals of norm p corresponding to the elements of 
P!(k) or equivalently the lines in k?. 


10. Give another proof of Lemma 13.4.9 using quadratic forms. 


11. Let F be a nonarchimedean local field with valuation ring R, maximal ideal 
p, and residue field k. Let D be a division algebra over F with dimp D = n, 
with valuation ring O and maximal two-sided ideal P. Show that O/P is finite 
extension of k of degree n, and P” = pO (cf. Remark 13.4.3). 


12. Show that (13.5.10) is an isomorphism of (abelian) groups. 
13. Let F be a field with absolute value | | and V a finite-dimensional F-vector space. 
(a) Let x1, ..., Xn be a basis for V, and define 
lay x] + +++ + anXnl| := max(laıl, . -, lanl) 
for a; € F. Show that V is a metric space with distance d(x, y) = ||x — yll. 


(b) Show that the topology on V is independent of the choice of basis in (a). 


(c) Finally, show that if F is complete with respect to | |, then V is also complete. 


14. Let F be a topological field. Show that the coarsest topology (fewest open sets) in 
which multiplication on M,,(F) is continuous is given by the coordinate topology. 


15. Let F be a local field. 


(a) The group F™ has the structure of topological group under the embedding 
x m (x, x7!) € F x F (under the subspace topology in F x F). Show that 
this topology coincides with the subspace topology F* c F. 


(b) More generally, let B be a finite-dimensional F-algebra. Show that the the 
topology on B* induced by a > (a,a~!) € B x B coincides with the 
subspace topology B* C B. 


(c) Show that an open subset of a Hausdorff, locally compact space is locally 
compact in the subspace topology. 


16. Let F be a finite extension of Q2. Show that (—1, —1)p = (-1)[F "Q2, 
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17. Finish the proof of Lemma 13.2.9. 


18. Let D be a division algebra over a nonarchimedean local field F. We recall (see 
13.5.2) that D is a complete, locally compact topological ring. 


(a) Verify (as in 13.5.5) that O is the maximal compact subring of B. 
(b) Show that B*/F™ is a compact topological group. 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s 
Creative Commons license, unless indicated otherwise in a credit line to the material. If 
material is not included in the chapter’s Creative Commons license and your intended 
use is not permitted by statutory regulation or exceeds the permitted use, you will need 
to obtain permission directly from the copyright holder. 
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Chapter 14 


Quaternion algebras over global fields 


In this chapter, we discuss quaternion algebras over global fields and characterize them 
up to isomorphism. 


14.1 > Ramification 


To motivate the classification of quaternion algebras over Q, we consider by analogy a 
classification of quadratic fields. We restrict to the following class of quadratic fields 
for the best analogy. 


Definition 14.1.1. A quadratic field F = Q(Vd) of discriminant d € Z is mildly 
ramified if 8 { d. 


A quadratic field F is mildly ramified if and only if F = Q(ym) where m + 1 is 
odd and squarefree; then d = m or d = 4m according as m = 1,3 (mod 4). 

Let F = Q(Vd) be a mildly ramified quadratic field of discriminant d € Z and let 
R be its ring of integers. A prime p ramifies in F, i.e. pR = p* for a prime ideal p C R, 
if and only if p | d. 

But a discriminant d can be either positive or negative; to put this bit of data on 
the same footing, we define the set of places of Q to be the primes together with the 
symbol co, and we make the convention that oo ramifies in F if d < 0 and is unramified 
if d > 0. Let F = QNA) be a mildly ramified quadratic field, and let Ram(F) be the 
set of places that ramify in F. The set Ram(F’) determines F up to isomorphism, since 
the discriminant of F is the product of the odd primes in Ram(F), multiplied by 4 if 
2 e Ram(F) and by —1 if co € Ram(F). (For bookkeeping reasons, in this context it 
would probably therefore be better to consider 4 and —1 as primes, but we will resist 
the inducement here.) However, not every finite set of places 2 occurs: the product d 
corresponding to È is a discriminant if and only if d = 0,1 (mod 4). We call this a 
parity condition on the set of ramifying places of a mildly ramified quadratic field: 


2¢€2Z «© there are an odd number of places in X congruent to —1 (mod 4) 


with the convention that oo is congruent to —1 (mod 4). 
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Note that if È is a finite subset of places of Q and 2 ¢ Ł, then precisely one of 
either È or È U {ov} satisfies the parity condition; accordingly, if we define m(X) to be 
the product of all odd primes in X multiplied by —1 if co € X, then we can recover & 
from m(X). 

We have proven the following result. 


Lemma 14.1.2. The maps F œ> Ram(F) and © =œ m(X) furnishes a bijection 


Mildly ramified quadratic fields Finite subsets of places of Q 
oa r 
Q(Vd) up to isomorphism satisfying the parity condition 
= ican odd es 
m#1 ` 


This classification procedure using sets of ramifying primes and discriminants 
works as well for quaternion algebras over Q. Let B be a quaternion algebra over 
Q. When is a prime p ramified in B? In Chapter 12, we saw that the completion 
Bp = B &Q Q, is either a division ring or the matrix ring M>(Q,,). Further, when Bp 
is a division ring, the valuation ring Op C Bp is the unique maximal order, and the 
unique maximal ideal P, C Op satisfies pO, = PZ, By analogy with the quadratic 
case, we say that a place v is ramified in B if the completion B, is a division ring, and 
otherwise v is unramified (or split). 


b 
Let B = |“ } Without loss of generality, we may suppose a, b € Z. There are 


only finitely many places where B is ramified: by the calculation of the Hilbert symbol 
(12.4.12), if p is prime and p { 2ab, then (a, b)g, = 1 and p is split in B. Therefore 
#Ram B < o. 

We say that B is definite if co € Ram B and B is indefinite otherwise. By definition, 


„b 
B is definite if and only if Bo := B&Q R = (% ~ Hif and only if a, b < 0 (Exercise 


2.4). 

Let Ram B be the set of ramified places of B. Not every finite subset È of places 
can occur as Ram B for a quaternion algebra B. It turns out that the parity condition 
here is that we must have #2 even. So again, if È is a finite set of primes, then precisely 
one of either È or È U {co} can occur as Ram B. We define the discriminant of B to be 
the product disc B of primes that ramify in B, so disc B is a squarefree positive integer. 
This notion of discriminant is admittedly strange; we relate it to perhaps more familiar 
notions in Chapter 15. 

The main result of this chapter, specialized to the case F = Q, is the following. 


Main Theorem 14.1.3. The maps B +> Ram B and + [| pex p furnish bijections 


Quaternion algebras over Q ats Finite subsets of places of Q 
up to isomorphism of even cardinality 


© {D E Zs squarefree } . 


The composition of these maps is B > || peram g P = disc B. 


14.2. > HILBERT RECIPROCITY OVER THE RATIONALS 219 


As previewed at the end of section 12.1, Main Theorem 14.1.3 is a local-global 
principle and provides a convenient way to test when quaternion algebras over Q are 
isomorphic: instead of working hard over Q, we can just test for isomorphism over the 
local fields Q, and R. 

We will spend the next two sections giving a self-contained proof of Main Theo- 
rem 14.1.3 following Serre [Ser73, Chapters I-IV], assuming two statements from 
basic number theory (quadratic reciprocity and the existence of primes in arithmetic 
progression), finishing the proof in section 14.3. Although the proofs presented do not 
seem to generalize beyond F = Q, the argument is simple enough and its structure 
is good motivation for the more involved treatment in the Chapter ahead. (It is also 
comforting to see a complete proof in the simplest case.) 


14.2 > Hilbert reciprocity over the rationals 


To begin, we look into the parity condition: it has a simple reformulation in terms of the 
Hilbert symbol (section 12.4). Fora place v of Q, let Q, denote the completion of Q at 
the absolute value associated to v: if v = p is prime, then Q, = Q, is the field of p-adic 
numbers; if v = co, then Q, = R. For a, b € Q*, we abbreviate (a, b)g, = (a, b)y. 


Proposition 14.2.1. (Hilbert reciprocity). For all a,b € Q*, we have 
| |@ b), = 1, (14.2.2) 


where the product taken over all places v of Q. 


When p is odd and divides neither numerator nor denominator of a or b, we have 
(a, b),p = 1, so the product (14.2.2) is well-defined. The following corollary is an 
equivalent statement. 


Corollary 14.2.3. Let B be a quaternion algebra over Q. Then the set Ram B is finite 
of even cardinality. 


The law of Hilbert reciprocity, as it turns out, is a core premise in number theory: 
it is equivalent to the law of quadratic reciprocity 


(2)(2) eae (14.2.4) 
q| \p 
for odd primes p, q together with the supplement 
-1 z 2 2- 
(= =(-1)F and G) =(-1) (14.2.5) 
p p 


for odd primes p. 
We now give a proof of Hilbert reciprocity (Proposition 14.2.1), assuming the law 
of quadratic reciprocity and its supplement. 
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Proof of Proposition 14.2.1. Since each local Hilbert symbol is bilinear, it suffices to 
prove the statement when a,b € Z are equal to either —1 or a prime number. The 
Hilbert symbol is also symmetric, so we may interchange a, b. 

a,b -1, -1 

Ifa = b = -1, then B = |—]| = 

(—1,-1)» = (—1,—-1) = -1 and (-1,-1), = 1 if v + 2,00, by the computation of 
the even Hilbert symbol (12.4.13). Similarly, the cases with a = —1, 2 follow from the 
supplement (14.2.5), and are requested in Exercise 14.1. 


is the rational Hamiltonians, and 


> =], 
So we may suppose a = p and b = q are primes. If p = q then (22) z= (35) 


and we reduce to the previous case, so we may suppose p + q. Since p,q > 0, we have 
(p, q)» = 1. We have (p, q)e = 1 for all primes £ | 2pq, and 


(P,Dp = (4; P)p = (2) and (p,q)q = (2) 


by 12.4.12. Finally, 
(p,q) = —1 if and only if p,q = 3 (mod 4) 


i.e., (p,q) = (-1)?-PE-Y/4, again by the computation of the even Hilbert symbol 
(12.4.13). Thus the product becomes 


| [e qx = cayortre-n(2)(2) = 
v q p 


by quadratic reciprocity. o 


Hilbert reciprocity has several aesthetic advantages over the law of quadratic reci- 
procity. For one, it is simpler to write down! Also, Hilbert believed that his reciprocity 
law is a kind of analogue of Cauchy’s integral theorem, expressing an integral as a 
sum of residues (Remark 14.6.4). The fact that a normalized product over all places is 
trivial also arises quite naturally: if we define for x € Q* and a prime p the normalized 
absolute value 


|x lp = p™ o, 


and |x|.. the usual archimedean absolute value, then 
| [bb =1 
v 


by unique factorization in Z; this is called the product formula for Q, for obvious 
reasons. 

From the tight relationship between quaternion algebras and ternary quadratic 
forms, we obtain the following corollary. 


Corollary 14.2.6. Let Q be a nondegenerate ternary quadratic form over Q. Then the 
set of places v such that Q, is anisotropic is finite and of even cardinality. 
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In particular, by Corollary 14.2.6, if Q, is isotropic for all but one place v of Q, 
then Q,, is in fact isotropic for all places v. 


Proof. In the bijection implied by Main Theorem 5.2.5, the quadratic form Q cor- 
responds to a quaternion algebra B = (a,b | Q), and by Main Theorem 5.4.4, Q is 
isotropic if and only if B is split if and only if (a, b)g = 1. By functoriality, the same is 
true over each completion Q, for v a place of Q, and therefore the set of places v where 
Q,, is isotropic is precisely the set of ramified places in B. The result then follows by 
Hilbert reciprocity. o 


To conclude this section, we show that every allowable product of Hilbert symbols 
is obtained. 


Proposition 14.2.7. Let X be a finite set of places of Q of even cardinality. Then there 
exists a quaternion algebra B over Q with Ram B = È. 


Remark 14.2.8. Albert [Alb34, Theorem 2, Theorem 3] already sought to simplify 
the presentation of a quaternion algebra by a series of transformations, the content 
of which is contained in Proposition 14.2.7; this was further investigated by Latimer 
[Lat35]. 


Just as with Hilbert reciprocity, Proposition 14.2.7 touches on a deep statement in 
number theory concerning primes, due to Dirichlet. 


Theorem 14.2.9. (Infinitude of primes in arithmetic progression). Given a,n € Z 
coprime, there are infinitely many primes p = a (mod n). 


Proof. See e.g. Serre [Ser73, Chapter VI] or Apostol [Apo76, Chapter 7]. We will 
prove this theorem in Exercise 26.11 as a consequence of the analytic class number 
formula. o 


Remark 14.2.10. Theorem 14.2.9 seems to require analysis. (For algebraic proofs in 
special cases, see e.g., Neukirch [Neu99, Exercise I.10.1] and Lenstra—Stevenhagen 
[LS91].) Ram Murty [Mur88] showed that a “Euclidean proof” of the infinitude of 
primes p = a (mod n) is possible if and only if a? = 1 (mod n), and Paul Pollack 
[Pol2010] has shown that Schnizel’s Hypothesis H gives a heuristic for this. This 
crucial role played by analytic methods motivates part III of this monograph. 


We now prove Proposition 14.2.7 assuming Theorem 14.2.9. 


Proof. Let D := [[pex p be the product of the primes in &, and let u := —1 if œ € X 
and u := 1 otherwise. Let D? := uD. We consider quaternion algebras of the form 


7 q°, D? 
B= Q 


with q? = uq (and q prime) chosen to satisfy certain congruence conditions ensuring 
that Ram B = È. To this end, we seek a prime q such that 


> 
(<] = —1 for all odd p | D (14.2.11) 
p 
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and 


hee if 24D; mas 


5(mod8), if2]D. 


There exists a prime satisfying the conditions (14.2.11)—(14.2.12) by Theorem 14.2.9, 
since the condition to be a quadratic nonresidue is a congruence condition on q? and 
hence on q modulo p. 
We now verify that B has Ram B = &. We have (qg°, D®)o. = u by choice of signs 
and (q°, D°)p = 1 for all p { 2dg. We compute that 
> 
(4°, D°)p = (Ż) =-1 forall odd p | D 
P 
by (14.2.11). For p = 2, we find that (q°, D?) = —1 or (q?, D?) = 1 according as 
2 | D or not by the computation of the even Hilbert symbol (12.4.13). This shows that 


x C RamB CÈU {q}. 


The final symbol (q°, D®), is determined by Hilbert reciprocity (Proposition 14.2.1): 
since #Z is already even, we must have (q°, D°), = 1. Therefore the quaternion algebra 


? D? 
B := [= | pas = Ram. (m 


Example 14.2.13. Let B = (a,b | Q) be a quaternion algebra of prime discriminant 
D = p over Q. Then: 


(i) For D = p = 2, we take a = b = —1; 
(ii) For D = p = 3 (mod 4), we take b = —p and a = -1; 
(iii) For D = p = 1 (mod 4), we take b = —p and a = —q where q = 3 (mod 4) is 


prime and (2) =-1. 
p 


In case (iii), by qudaratic reciprocity (2) = -(4) = | so indeed B is not ramified at 
P 


p. In the proof of Theorem 14.2.7 above, we would have required the more restrictive 
condition g = 3 (mod 8), but we can look again at the table of even Hilbert symbols 
(12.4.16): since b = —p = —1,3 (mod 8), we may take a = —q = 1, —3 (mod 8) freely, 
so q = 3 (mod 4). 

Similarly, for discriminant D the product of two (distinct) primes: 


(i) For D = 2p with p = 3 (mod 4), we take a = —1 and b = p; 
(ii) For D = 2p with p = 5 (mod 8), we take a = 2 and b = p; 
(iii) For D = pq with p = q = 3 (mod 4), we take a = —1 and b = pq; 
(iv) For D = pq with p = 1 (mod 4) or q = 1 (mod 4) and (2) + 1, we take a = p 
P 
and b = q. 
For other explicit presentations of quaternion algebras over Q with specified dis- 


criminant, see Alsina—Bayer [AB2004, §1.1.2]. See Example 15.5.7 for some explicit 
maximal orders. 
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14.3 > Hasse—Minkowski theorem over the rationals 


To complete the proof of Main Theorem 14.1.3, we now show that the map B +> Ram B 
is injective on isomorphism classes. 


Proposition 14.3.1. Let B, B’ be quaternion algebras over Q. Then the following are 
equivalent: 


(i) B = B’; 

(ii) Ram B = Ram B’; 
(iii) By = BY for all places v € P\(Q); and 
(iv) B, = By, for all but one place v. 


The statement of Proposition 14.3.1 is a local-global principle: the global isomor- 
phism class is determined by the local isomorphism classes. 


Corollary 14.3.2. Let B be a quaternion algebra over Q. Then B ~ Mo(Q) if and only 
if By ~ M2(Q,) for all primes p. 


Proof. Apply Proposition 14.3.1 (i) © (iv) with B’ = M»(Q) checking all but the 
archimedean place. o 


By the equivalence between quaternion algebras and quadratic forms (see Chapter 
5, specifically section 5.2), the statement of Proposition 14.3.1 is equivalent to the 
statement that a ternary quadratic form over Q is isotropic if and only if it is isotropic 
over all (but one) completions. In fact, the more general statement is true—and again 
we come in contact with a deep result in number theory. 


Theorem 14.3.3. (Hasse—Minkowski). Let Q be a quadratic form over Q. Then Q is 
isotropic if and only if Q, is isotropic for all places v of Q. 


We will prove the Hasse—Minkowski theorem by induction on the number of 
variables. Of particular interest is the case of (nondegenerate) ternary quadratic forms, 
for which we have the following theorem of Legendre. 


Theorem 14.3.4. (Legendre). Let a,b,c € Z be nonzero, squarefree integers that are 
relatively prime in pairs. Then the quadratic form 


ax’ + by* +cz* =0 
has a nontrivial solution x, y, z € Q if and only if a, b,c do not all have the same sign 


and 


—ab, —bc, —ac are quadratic residues modulo |c\, |a], |b|, respectively. 


Proof. First, the conditions for solvability are necessary. The condition on signs is 
necessary for a solution in R. If ax? + by? + cz? = 0 with x, y, z € Q not all zero, then 
scaling we may suppose x, y, z € Z satisfy gcd(x, y, z) = lsifp|cthenp { y (else p | x 
and p | z, contradiction), so (x/ yy? = (—b/a) (mod |c|) and —ba is a quadratic residue 
modulo |c|; the other conditions hold by symmetry. 
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So suppose the conditions hold. Multiplying through and rescaling by squares 
(Exercise 14.8), we may suppose a, b are squarefree (but not necessarily coprime) and 
c = —1, and we seek a nontrivial solution to ax? + by? = 2. If a € Q”, then we are 
done. Otherwise, we need to solve 


2 


z -ax aN) 


= b =Nmgyvay9 | ———— 
y? awe ( y 


for x, y,z € Q and y # 0, i.e., we need to show that b is a norm from F = Q(va). By 
hypothesis, a, b are not both negative and 


b is a square modulo |a| and a is a square modulo |b]. (14.3.5) 


We may also suppose |a] < |b]. 

We use complete induction on m = |a| + |b|. If m = 2, then we must consider the 
equation +x? + y? = z? with the case both negative signs excluded, each of which 
has solutions. Now suppose that m > 2 so |b| > 2, and let p | b be prime divisor. By 
hypothesis, there exist integers t, b’ such that £? = a + bb’; taking a small residue, we 
may suppose |t| < |b|/2. Thus 


bb’ = t? — a = Nmr olt + Va) 
so bb’ is a norm from F. Thus b is a norm if and only if b’ is a norm. But 


|b| 
o 1 
< I~ < |b| 


t-a 


b’| = |—— 
Ib" b 


because |b| > 2. 


Now write b’ = b”u? with b”,u € Z and b” squarefree. Then |b”| < |b’| < |b| and 
b” is a norm if and only if b’ is a norm. With these manipulations, we propagate the 
hypothesis that |a| is a square modulo |b’’| and |b’’| is a square modulo |a|. Therefore, 
the induction hypothesis applies to the equation ax? + by? = z?, and the proof is 
complete. o 


Corollary 14.3.6. Let Q be a nondegenerate ternary quadratic form over Q. Then Q 
is isotropic if and only if Q, is isotropic for all places v of Q (but one). 


Proof. If Q is isotropic, then Q, is isotropic for all v. For the converse, suppose that 
Q, is isotropic for all places v of Q. As in the proof of Legendre’s Theorem 14.3.4, we 
may suppose Q(x, y, z) = ax? + by? — z?. The fact that Q is isotropic over R implies 
that a, b are not both negative. Now let p | a be odd. The condition that Qp is isotropic 
is equivalent to (a, b)p = (b/p) = 1; putting these together, we conclude that b is a 
quadratic residue modulo |a|. The same holds for a, b interchanged, so (14.3.5) holds 
and the result follows. o 


We are now in a position to complete the proof of the Hasse—-Minkowski theorem. 
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Proof of Theorem 14.3.3. We follow Serre [Ser73, Theorem 8, §IV.3.2]. We may sup- 
pose that Q is nondegenerate in n > 1 variables. If n = 1, the statement is vacuous. If 
n = 2, the after scaling we may suppose Q(x, y) = x? — ay? with a € Q*; since Qp is 
isotropic for all primes p, we have a € (Glog so in particular v,(a) is even for all primes 
p; Since Q is isotropic at co, we have a > 0; thus by unique factorization a € Q**, and 
the result follows. If n = 3, the statement is proven in Corollary 14.3.6. 

Now suppose n > 4. Write Q = (a,b)s — Q’ where Q’ = (cj,...,Cn—2) and 
a,b,c; € Z. Let d = 2ab(cı ---Cy_2) # 0. For each prime p | d, since Q is isotropic, 
there exists tp € Q5 represented by both (a, b) and Q’ in Qp. (This requires a small 
argument, see Exercise 6.14.) Similarly, there exists təs € R* represented by these 
forms in R. 

By another application of the infinitude of primes in arithmetic progression (Exer- 
cise 14.10), there exists £ € Q* such that: 


(i) te tQ% for all primes p | d, 
(ii) t and tœ have the same sign, and 
(iii) p { t for all primes p { d except possibly for one prime q { d. 


Now the quadratic form (a, b, —t) is isotropic for all p | d and at œ by construction and 
at all primes p { d except p = q since p { abt. Therefore, by case n = 3 (using the “all 
but one” in Corollary 14.3.6), the form (a, b, —t} is isotropic. 

On the other side, if n = 4, then the form (t) Œ Q’ is isotropic by the same argument. 
If n > 5, then we apply the induction hypothesis to Q’: the hypothesis holds, since Q’ 
is isotropic at co and all p | d by construction, and for all p { d the completion Q; is 
a nondegenerate form in > 3 variables over Zp so is isotropic by the results of section 
12.3, using Hensel’s lemma to lift a solution modulo the odd prime p. 

Putting these two pieces together, we find that Q is isotropic over Q. o 


We conclude with the following consequence. 


Corollary 14.3.7. Let Q, Q’ be quadratic forms over Q in the same number of variables. 
Then Q = Q’ if and only if Q, ~ Q', for all places v. 


Proof. The implication (=) is immediate. We prove (4) by induction on the number 
of variables, the case of n = 0 variables being clear. By splitting the radical (4.3.9), 
we may suppose that Q, Q’ are nondegenerate. Let a € Q* be represented by Q. Since 
Q, = Q!, the quadratic form (—a) Œ Q’ is isotropic at v for all v, so Q’ represents a 
(Lemma 5.4.3). In both cases, we can write Q ~ (—a)#Q) and Q’ ~ (—a)#Q‘ for 
quadratic forms Q1, Q1 in one fewer number of variables. Finally, by Witt cancellation 
(Theorem 4.2.22), from Q, = Q', we have (Q1), = (Q'), for all v, so by induction 
Qı = Q, and thus Q = Q’. o 


We now officially complete our proofs. 


Proof of Proposition 14.3.1. The implications (i) => (ii) and (iii) = (iv) are imme- 
diate. For the implication (ii) = (iii): either v € Ram B, in which case B, = B’, 
is the unique division algebra over Q, (Theorem 12.1.5), or v ¢ Ram B, in which 
case B, ~ M2(Q,) = B’, by definition. For the implication (iv) > (i), recalling 
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Theorem 5.1.1, by Corollary 14.3.6 applied to the ternary quadratic form associated 
to B, we conclude that this form is isotropic, which by Proposition 5.1.2 implies that 
B = M,(Q). o 


Proof of Main Theorem 14.1.3. The map B +» Ram B has the desired codomain, by 
Hilbert reciprocity (Proposition 14.2.1); it is surjective by Proposition 14.2.7; and it 
is injective by Corollaries 14.3.6 and 14.3.7. The second bijection (with squarefree 
integers) is immediate. o 


To summarize these past few sections, the classification of quaternion algebras 
over Q embodies some deep statements in number theory: quadratic reciprocity (and 
its reformulation in Hilbert reciprocity), the Hasse-Minkowski theorem (the local- 
global principle for quadratic forms), and the proofs use the theorem of the infinitude 
of primes in arithmetic progression! It is a small blessing that we can make these 
essentially elementary arguments over Q. In the more general case, we must dig more 
deeply. 

For fun, we conclude this section with a consequence in number theory: Legen- 
dre’s three-square theorem (cf. Lagrange’s four-square theorem, Theorem 11.4.3, and 
Remark 11.4.4). 


Theorem 14.3.8. (Legendre—Gauss). An integer n > 0 can be written as the sum of 
three squares n = x? + y? + z? if and only if n is not of the form n = 4¢(8b + 7) with 
a,b € Z. 


Proof. Looking modulo 8, we see that the provided condition is necessary (Exercise 
14.3(a)). Conversely, suppose n > 0 is not of the form n = 4° (8b + 7), or equivalently 
that -n ¢ Q? (Exercise 14.4). We may suppose a = 0, 1. 

Let B = (-1,-1 | Q) be the rational Hamiltonians. We have Ram B = {2, co}, which 
is to say the associated ternary quadratic form x? + y? + z? is isotropic over Q, for all 
odd primes p. Consider the quadratic form Q(x, y, z, w) = x? + y? + z? — nw’. Then 
Q is isotropic over R since n > 0, and isotropic over all Q, with p odd taking w = 0. 
The form is also isotropic over Q2 (Exercise 14.3), lifting a solution modulo 8 via 
Hensel’s lemma. By the Hasse—Minkowski theorem (Theorem 14.3.3), Q is isotropic 
over Q, so there exist x, y, z, w € Q not all zero such that x + y? +z? = nw”. We must 
have w # 0 by positivity, and dividing through we get x, y, z € Q not all zero such that 
x? +y? +z? =n. Leta = xi + yjt+zij € B. Then @? +n = 0 and « € Bis integral. 

Let O’ C B be a maximal order containing «œ, and let O be the Hurwitz order. By 
Proposition 11.3.7, O’ is conjugate to O; after conjugating, we may suppose a € O. 
But trd(a) = 0, so necessarily a € Zi, j) and x, y, z € Z with nrd(a) = x? +y? +z? =n 


as desired. o 
See also Exercise 14.5 for a variant of the proof of the three-square theorem staying 


in the language of quaternions. 
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14.4 Global fields 


In this chapter and in many that remain, we focus on a certain class of fields of 
arithmetic interest: a global field is either a finite extension of Q (a number field) 
or of F,,(t) (a function field) for a prime p. Global fields are strongly governed by 
their completions with respect to nontrivial absolute values, which are local fields. 
Throughout this text, we will return to this theme that global behavior is governed by 
local behavior. 

For the rest of this chapter, let F be a global field. We quickly introduce in this 
section some basic notions from algebraic number theory: for further reference, see 
e.g. Neukirch [Neu99, Chapters I-l], Cassels [Cas2010, Chapter II], or Janusz [Jan96, 
Chapter IT]. 


Remark 14.4.1. When F is a function field, we will often insist that F is equipped 
with an inclusion Fo —> F where Fo ~ Fp(t) has pure transcendence degree 1 over 
Fp. (For the geometrically inclined, this corresponds to a morphism X — P! of the 
associated curves.) Often this inclusion will not play a role, but it will be important 
to treat certain aspects uniformly with the number field case where there is only one 
inclusion Q => F. 


14.4.2. The set of places of F is the set Pl F of equivalence classes of embeddings 
ly: F > F, where F, is a local field and (F) is dense in F,; two embeddings 
ty: F > F, and u,: F — F; are said to be equivalent if there is an isomorphism of 
topological fields ¢: F, — F’ such that vf, = go w. 


14.4.3. Every valuation v: F — R U {ov}, up to scaling, defines a place y: F > F, 
where v is the completion of F with respect to the absolute value induced by v; 
we call such a place nonarchimedean, and using this identification we will write 
v for both the place of F and the corresponding valuation. For a nonarchimedean 
place v corresponding to a local field F,, we denote by R, its valuation ring, pẹ, its 
maximal ideal, and k, its residue field. If F is a function field, then all places of F are 
nonarchimedean. If F is a number field, a place F —> R is called a real place and a 
place F — C (equivalent to its complex conjugate) is called a complex place. A real 
or complex place is archimedean. 


14.4.4. Let K 2 F be a finite, separable extension of fields, and let v € PI F. We say 
that a place w of K is above v if w|- = v, and we write w | v. The set of places w 
above v are obtained as follows: since K is separable, we have an isomorphism 


K Qf F, = K1 XX K, (14.4.5) 


where each K; 2 F, isa finite extension of local fields. Indeed, writing K = F[x]/(f(x)) 
with f(x) € F[x] the minimal polynomial of a primitive element, we have 


K 8r Fy = Fy[x]/(f) = RAAE x + x NG) 


where f(x) = fi(x) +- f-(x) € F[x] is the factorization of f(x) into irreducibles in 
F,[x], distinct because f is separable. Thus each K; is a local field by the classification 
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in Theorem 12.2.15, and the composition 
K= K8r F, > K; 


defines a place w; of K above v. Conversely, every place w above v is equivalent to w; 
for some i [Jan96, Chapter II, Theorem 5.1; Cas78, §9; Neu99, Chapter II, Proposition 
(8.3)]. 

We say that a nonarchimedean place v ramifies in K if there exists a place w | v 
such that Ky, 2 F, is ramified (see 13.2.3). Only finitely many places of F ramify in 
K. 


A global field F has a set of preferred embeddings u,,: F — F, corresponding to 
each place v € Pl F—equivalently, a preferred choice of absolute values ||, for each 
place v € PI F—such that the product formula holds: for all x € F*, 


[| Fe 1. (14.4.6) 
vePI F 
where m, = 2 if v is complex and m, = 1 otherwise. Admittedly, the extra exponents 
2 for the complex places are annoying (see Remark 12.2.3)! Often what is done is 
to define normalized absolute values ||x||, := |x|?” for v € PIF, so then (14.4.6) 
becomes 


[ | i =1. (14.4.7) 


vePI F 


Preferred absolute values are defined as follows. 


14.4.8. The set of places PI(Q) of Q consists of the archimedean real place, induced 
by the embedding Q —> R and the usual absolute value |x|, and the set of nonar- 
chimedean places indexed by the primes p given by the embeddings Q —> Qp, with 
the preferred absolute value 


—Vp (x) i 


IXlp =p 


The statement of the product formula for x € Q is 


le) [pees (14.4.9) 
p 


rearranging, (14.4.9) is equivalent to [],, p’e™) = |x|, and this follows from unique 
factorization in Z. 


14.4.10. The set of places of F(t) is indexed by monic irreducible polynomials f(t) € 
F,[t] with preferred absolute value 


xÐ = p Gee f) ordro) 


and 1/t, the place at infinity, with preferred absolute value 


deg x 


xij =p?" 
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where if x = f/g is the ratio of relatively prime polynomials f,g € Fp[t], then 
deg x := max(deg f, deg g). 
Then the statement of the product formula for x(t) € F p(t) is 


p~“ [leer = 1; (14.4.11) 
f 


rearranging as over Q, but now also taking the logarithm in base p, (14.4.9) is equivalent 
to») (deg f) ord¢(x) = deg x which follows from unique factorization in F,[t]. 


14.4.12. More generally, let K 2 F be a finite, separable extension of global fields. 
Let v be a place of F with a preferred absolute value and let w be a place of K above 
v. Then the preferred absolute value for w is the unique one extending v, namely 


1/[Kw Fy 
Ixl» = INmx,, r, (x) |p Aw?) 


for x € K. These absolute values fit together, with 


BES = |Nmgir Œx)ly (14.4.13) 


wlv 


for all x € K, a consequence of (14.4.5) [Jan96, Chapter II, Theorem 5.2; Cas78, §11, 
Theorem, p. 59; Neu99, Chapter I, Corollary (8.4)]. 

In particular, if F satisfies the product formula (14.4.6) with respect to preferred 
absolute values, then so does K, since 


| [ew = AT te = | [Nae = 1. (14.4.14) 


wily 


Remark 14.4.15. The definitions for the preferred absolute values are pretty dry— 
sorry! But we will see later that they are natural from the perspective of Haar measure: 
see section 29.3 and ultimately (29.6.3). 


We will also make use of the following notation in many places in the text. Let F 
be a global field. 


Definition 14.4.16. A set S C PI F is eligible if S is finite, nonempty, and contains all 
archimedean places of F. 


Definition 14.4.17. Let S be an eligible set of places. The ring of S-integers in F is 


the set 
Ris) := {x € F : v(x) = 0 forall v ¢ S}. (14.4.18) 


A global ring is a ring of S-integers in a global field for an associated eligible set S. 


The expression (14.4.18) makes sense, since if v ¢ S then by hypothesis v is 
nonarchimedean. When no confusion can result, we will abbreviate R = Rs) for a 
global ring R. 


Example 14.4.19. If F is a number field and S consists only of the archimedean places 
in F then Rs) is the ring of integers in F, the integral closure of Z in F, also denoted 
Ris) = Zr. If F is a function field, corresponding to a curve X, then Rs) is the ring of 
all rational functions with no poles outside S. (So in all cases, it is helpful to think of 
the ring Rs) as consisting of those elements of F with “no poles outside S”.) 
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14.5 Ramification and discriminant 


Let R = Ris) be a global ring, with S c PI F eligible. Let B be a quaternion algebra 
over F. 


Definition 14.5.1. Let v € PIF. We say that B is ramified at v if B, = B 8r F, isa 
division ring; otherwise we say that B is split (or unramified) at v. 
Let Ram B denote the set of ramified places of B. 


If v € PIF is a nonarchimedean place, corresponding to a prime p of R, we will 
also say that B is ramified at p when B is ramified at v. 


Remark 14.5.2. We use the term ramified for the following reason: if Bp is a divi- 
sion ring with valuation ring Op, then pOp = P? for a two-sided maximal ideal P: 
see Theorem 13.3.11. (Eichler [Eic55-56, §1, Theorem 4] called them characteristic 
primes.) 


Lemma 14.5.3. The set Ram B of ramified places of B is finite. 


Proof. Write B = (K,b | F). Since F has only finitely many archimedean places, we 
may suppose v is nonarchimedean. The extension K 2 F is ramified at only finitely 
many places, so we may suppose that K 2 F is unramified at v. Finally, v(b) = 0 for 
all but finitely many v, so we may suppose v(b) = 0. But then under these hypotheses, 
By = (Ky, b | Fy) is split, by Corollary 13.4.1. oO 


Motivated by the fact that the discriminant of a quadratic field extension is divisible 
by ramifying primes, we make the following definition. 


Definition 14.5.4. The R-discriminant of B is the R-ideal 
discr(B) = I] PER 


peRam B 
pgs 


obtained as the product of all primes p of R = Ris) ramified in B. 


Remark 14.5.5. When F is a number field and S consists of archimedean places only, 
so that R = Zr is the ring of integers of F, we abbreviate discr(B) = disc B. The 
discriminant discr(B) discards information about primes in S: only Ram B records 
information about B that is independent of S. 


Remark 14.5.6. One could make the same definitions when R is more generally a 
Dedekind domain. However, unless the residue fields of R are finite, this is not as 
useful a notion: see Exercise 14.14. (In some sense, this is because the Brauer group 
of F = Frac R is not as simply described as when F is a global field, viz. Remark 
14.6.10.) 


As usual, the archimedean places play a special role for number fields, so we make 
the following definition. 


Definition 14.5.7. Let F be a number field. We say that B is totally definite if all 
archimedean places of F are ramified in B; otherwise, we say B is indefinite. 
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14.5.8. If v is a complex place, then v is necessarily split, since the only quaternion 
algebra over C is Mo(C); therefore, if B is a totally definite quaternion algebra over a 
number field F, then F is totally real. 


14.6 Quaternion algebras over global fields 


We now generalize Main Theorem 14.1.3 to the global field F, deducing results 
characterizing isomorphism classes of quaternion algebras. The main result is as 
follows. 


Main Theorem 14.6.1. Let F be a global field. Then the map B > RamB gives a 
bijection 


Quaternion algebras over F Finite subsets of noncomplex places 
up to isomorphism of F of even cardinality 


In other words, if B is a quaternion algebra over a global field, then the set of places 
of F where B is ramified is finite and of even cardinality, this set uniquely determines 
B up to isomorphism, and every such set occurs. 


Proof. We give a proof in section 26.8, which itself relies on an analytic result (Theo- 
rem 26.8.19) proven in Chapter 29. 

Alternatively, this statement can also be viewed a direct consequence of a (hard- 
earned) fundamental exact sequence in class field theory: see Remark 14.6.10. o 


Recall the definition of the Hilbert symbol (as in section 12.4), computed explicitly 
for v an odd nonarchimedean place (12.4.9): for a place v of F, we abbreviate (a, b)r, = 
(a, b)y. We also recall Lemma 14.5.3 that (a, b),, = 1 for all but finitely many places v. 


Corollary 14.6.2. (Hilbert reciprocity). Let F be a global field with char F + 2 and 
let a,b € F*. Then 
I] (a, b)y = 1. (14.6.3) 
vePlF 


Proof. Immediate from Main Theorem 14.6.1: Hilbert reciprocity is equivalent to the 
statement that # Ram B is even. o 


Remark 14.6.4. Stating the reciprocity law in the form (14.6.3) is natural from the 
point of view of the product formula (14.4.6). And Hilbert reciprocity can be rightly 
seen as a law of quadratic reciprocity for number fields (as we saw in section 14.2 for 
F = Q). (For more, see Exercise 14.16.) 

Hilbert saw his reciprocity law (Corollary 14.6.2) as an analogue of Cauchy ’s inte- 
gral theorem [Hil32, p. 367-368]; for more on this analogy, see Vostokov [Vos2009]. 


Corollary 14.6.5. (Local-global principle for quaternion algebras). Let B, B’ be quater- 
nion algebras over F. Then the following are equivalent: 

Gi) B= B’; 

(ii) Ram B = Ram(B’); 
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(iii) By = BY for all places v € PI F; and 
(iv) B, = By, for all but one place v € PI F. 


In particular, B ~ M2(F) if and only if Ram B = @. 


Proof. For the equivalence (i) © (ii) © (iii), combine Main Theorem 14.6.1 and the 
fact that for a noncomplex place v there is a unique division algebra over Fy. The 
equivalence (iii) & (iv) follows from the parity constraint, since if v is a place and 
Ram B\ {v} = È, then v € Ram B or not according as #2 is odd or even. oO 


Remark 14.6.6. Corollary 14.6.5 is a special case of the Albert—Brauer—Hasse—Noether 
theorem [AH32, BHN31]: a central simple algebra A over F such that A, ~ M, (F,) 
for all v e PI F has A ~ M,,(F). See Remark 14.6.10 for further discussion. 


The statement of Corollary 14.6.5 is the local-global principle for quaternion 
algebras: the isomorphism class of a quaternion algebra over a global field is determined 
by its isomorphism classes over the collection of local fields obtained as completions 
of the global field. In a similar way, we have a local-global principle for quadratic 
embeddings as follows. 


Proposition 14.6.7. (Local-global principle for splitting/embeddings). Let K 2 F be 
a finite separable extension of global fields. Then the following are equivalent: 


(i) K splits B, i.e., B@p K ~ M2(K); and 
(ii) For all places w € PIK, the field Ky, splits B. 


If dimr K = 2, then these are further equivalent to: 


(iii) There is an embedding K —> B of F-algebras; 
(iv) For all places v € PIF, there is an embedding K, —> B, of F,,-algebras; and 
(v) Every v € Ram B does not split in K, i.e., K, is a field for all v € Ram B. 


Proof. The equivalence (i) © (ii) is a consequence of Corollary 14.6.5: they are both 
equivalent to Ram Bx = 0, since K splits B if and only if B &r K ~ M)(K) if and only 
if Ram(B 8r K) = 0 if and only if for all places w of K we have B @r Kw ~ M2(Ky). 

The equivalence (i) © (iii) was given by Lemmas 5.4.7 and 6.4.12. 

The implication (iii) > (iv) is clear. For the implication (iv) > (v), if v € Ram B, 
then B, is a division algebra; so if K, is not a field, then we cannot have K, => By. 
Finally, for (v) > (ii), let w € PIK with w | v € PIF. If v ¢ RamB then already F, 
splits B; otherwise, v € Ram B and K, = Kw is a field with [Kẹ : Fp] = 2, so by 
Proposition 13.4.4, Ky, splits B. o 


14.6.8. The equivalences (iii) © (iv) © (v) in Proposition 14.6.7 hold also for the 
separable F-algebra K = F x F: for there is an embedding F x F — B if and only if 
B = M} (F). 


We also record the statement of the Hasse—-Minkowski theorem over global fields, 
generalizing Theorem 14.3.3. 
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Theorem 14.6.9. (Hasse—Minkowski). Let F be a global field and let Q be a quadratic 
form over F. Then Q is isotropic over F if and only if Q, is isotropic over F, for all 
places v of F. 


Proof. The same comments as in the proof of Main Theorem 14.6.1 apply: we give a 
proof in section 26.8. But see also O’ Meara [O’Me73, §$65-66] for a standalone class 
field theory proof for the case when F is a number field. o 


This local-global principle for isotropy of quadratic forms is also called the Hasse 
principle. For a historical overview of the Hasse principle, and more generally Hasse’s 
contributions in the arithmetic theory of algebras, see Fenster-Schwärmer [FS2007]. 


Remark 14.6.10. The fact that quaternion algebras are classified by their ramification 
set (Main Theorem 14.6.1) over a global field F is a consequence of the following 
theorem from class field theory: there is an exact sequence 


0 > Br(F) > B Br(F,) > Q/Z > 0 
(14.6.11) 
([Ay]), e X, inv, [Ay] 


where the first map is the natural diagonal inclusion [A] + ([A ®, F,]), and the 
second map is the sum of the local invariant maps inv,,: Br(k,) > Q/Z from Remark 
13.4.3. The class of a quaternion algebra B in a Brauer group over a field is 2-torsion 
by 8.3.4, and the local invariant inv, B, is equal to 0,1/2 according as B, is split or 
ramified, and in this way we recover the main classification theorem. (In this sense, the 
discriminant of a quaternion algebra captures the Brauer class of a quaternion algebra 
at the finite places, and the ramification set captures it fully.) The exact sequence 
(14.6.11) is sometimes called the fundamental exact sequence of global class field 
theory: see Milne [Milne-CFT, §VIII.4] or Neukirch-Schmidt—Wingberg [NSW2008, 
Theorem 8.1.17]. 


14.7 Theorems on norms 


In the previous sections, we have seen how both local-global principles allow a nice, 
clean understanding of quaternion algebras—and at the same time, the norm groups 
play an important role in this characterization. These themes will continue through 
the book, so we develop them here in an important first case by describing the group 
nrd(B*) < F*. 

We retain our hypotheses that F is a global field and B a quaternion F-algebra. 


14.7.1. First, we recall the calculation of the local norm groups (Lemma 13.4.9): for 
v € PIF, we have 


Rey if v € Ram B is real (i.e., B, = H); 
E otherwise. 


nrd(B%) = | 


Under B — B,, we have nrd(B”) < nrd(B%) for all places v € PIF, and this 
‘places’ a condition on the reduced norm at precisely the real ramified places. 
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14.7.2. Let Q C Ram B be the set of ramified (necessarily real) archimedean places in 
B. (If F is a function field, then Q = Ø.) Let 
F* 


>Q0 


:= {x € F* : v(x) > 0 for all v € Q} (14.7.3) 


be the group of elements that are positive for the embeddings v € Q. For Q the set of 
all real places, we write simply FX) and call such elements totally positive. 


By 14.7.1, we have nrd(B™) < F% o: In fact, equality holds. 


Main Theorem 14.7.4. (Hasse—Schilling). We have nrd(B*) = F% 


>90 


To prove this theorem, we will use two lemmas. 


Lemma 14.7.5. Let v be a noncomplex place of F. Let n, € F3, and if v is real suppose 
n, > 0. Then there exists t, € F, such that x? — t,x + n, is separable and irreducible 
over F,. Moreover, if n, € R, then we may take ty € Ry. 


Proof. We suppose that char F, + 2 and leave the other case as an exercise (Exercise 
14.22). If n, ¢ re. then we can take t, = 0; this includes the case where v is a 
real place. So suppose n, € F*?. Let Ky = F,(Vd,) 2 F, be a separable quadratic 
extension, and in particular d, ¢ Ee. The quadratic form (1, —47,,) = (1,-1) over 
F, is a hyperbolic plane (Definition 5.4.1) so universal; let x, y, € F, be such that 
cod - Any y2 = d,. We cannot have y, = 0, else x2 =d, € FZ, Let tp = xy/yy. Then 
x? — t,x +n, has discriminant d, and so is separable and irreducible. 

For the second statement, multiplying by a square we may suppose without loss of 
generality that d, € R, so the equality A = dy + 4n, implies v(t) > Oand4 E R. D 


Next, we want to show that we can approximate a polynomial over a completion 
F, by a polynomial over the global field F sufficiently well—the reader is invited to 
ignore this on a first reading and accept this intuitively as a consequence of the fact 
that F is dense in F}. 


Lemma 14.7.6. Let v € PIF, let f,(x) = xX -tx +n, € F,[x] be a separable 
polynomial, and let € > 0. Then there exists t,n € F such that |t — ty|,|n — n,| < € and 
such that f(x) = x? — tx +n has 


BIFOO = KA). (14.7.7) 


In particular, f(x) is separable, and if f,(x) is irreducible then so is f(x). 
Further, if already n, € F then we may take n = n,, and similarly with t. 


Proof. If F, is nonarchimedean, the lemma follows from Corollary 13.2.9 and the 
fact that F is dense in Fp. The case where F, is archimedean is straightforward: see 
Exercise 14.23. o 


The same argument can be applied to several local fields at once, as follows. 
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Corollary 14.7.8. Let & C PIF be a finite set of noncomplex places. For each v € È, 
let f,(x) = x —tyx +n, € F,[x] be a separable polynomial, and let € > 0. Then 
there exists t,n € F such that for f(x) = x? — tx +n and for all v € = we have 
|t-—t||a—n,| < eand F,[x]/(f(x)) = Fy[x]/(4A,.Q)). In particular, f(x) is separable, 
and if f,(x) is irreducible for some v then so is f(x). 

Further, ifall n, = m E F for v € È}, then we may take n = m, and similarly with t. 


Proof. We repeat the argument of Lemma 14.7.6, using weak approximation (i.e., F 
is dense in [],, Fy; look ahead to Lemma 28.7.1 and the adjacent discussion) for all 
v € È to find t,n. oO 


We now conclude with a proof of the theorem on norms. 


Proof of Main Theorem 14.7.4. Letn € FR 0° We will construct a separable quadratic 
extension K 2 F with K —> B such that n € Nmx r(K%). To this end, by Proposition 
14.6.7, itis enough to find K 2 F such that K, is a field for all v € Ram B. 

By Lemma 14.7.5, for all v € Ram B, there exists 4, € F, such that the polynomial 
x7-t,x+n € F,[x]is separable and irreducible over F,,; here if v € Q is real we use that 
v(n) > 0. By Corollary 14.7.8, there exists t € F such that x? — tx + n irreducible over 
each F,. Let K be the extension of F obtained by adjoining a root of this polynomial. 
Then K, is a field for each ramified v, and n € Nmgjr(K*%) as desired. oO 


Exercises 


> 1. Complete the proof of Hilbert reciprocity (Proposition 14.2.1) in the remaining 
cases (a, b) = (—1, 2), (2, 2), (-1, p), (2, p). In particular, show that 


-1,2) _ (2,2) _ 
(z Beg ae 


and 


a 
(a, p) = (a, P)p = (z) 
P 
for a = —1,2 and all primes p (cf. 12.4.13). 


2. Derive the law of quadratic reciprocity (14.2.4) and the supplement (14.2.5) 
from the statement of Hilbert reciprocity (Proposition 14.2.1). 


> 3. Letn € Zyo. 


(a) Suppose n is of the form n = 44 (8b + 7) with a, b € Z. Show that there is 
no solution to x? + y? +z =n with x, y,z € Z. [Hint: Look modulo 8. ] 

(b) Suppose n is not of the form n = 4°(8b + 7) with a, b € Z. Show that there 
is a solution to x? + y? +z? = n with x, y,z € Z2. [Hint: lift a solution 
modulo 8 using Hensel’s lemma. ] 


> 4. Let n € Z be nonzero. Show that n is a square in Qo if and only if n is of the 
form n = 4° (8b + 1) with a,b € Z. 
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. Letn > O have -n ¢ Q%*. Let B = (—1, —1 | Q) and let K = Q(V—n). Show that 
2 


K splits B. [Hint: Use the local-global principle for embeddings (Proposition 
14.6.7).] Conclude that there exists a € B such that a? = —n, and conclude as 
in Theorem 14.3.8 that n is the sum of three squares. 


. Let F be a number field. Show that every totally positive element of F is a sum 


of four squares of elements of F. 


. Show that the law of Hilbert reciprocity (Proposition 14.2.1) implies the law of 


quadratic reciprocity; with the argument given in section 14.1, this completes 
the equivalence of these two laws. 


. In the proof of Legendre’s theorem (Theorem 14.3.4), we reduced to the case 


a,b > 0 and c = —1. Show that this reduction is valid. 


. In this exercise, we generalize the proof of Proposition 14.2.7 to give a more 


general construction of quaternion quaternion algebras. Let D be a squarefree 
positive integer and let u = —1 if D has an odd number of prime divisors, 
otherwise u := 1. 


(a) For b € Z squarefree, show that K := Q(vb) embeds in a quaternion 
algebra of discriminant D if and only if: 


e b < Oif B is definite; 
e b #1 (mod 8) if 2 | D; and 
e for all odd primes p | D, we have () #1. 
p 
(b) Suppose b satisfies the conditions in (a) but with the further requirement 
b 
that D | b, i.e., in the third condition we require (?) = 0. Let q be an odd 
p 


prime such that q? := uq has: 


q? 
e (=| = —1 for all odd p | D; 


o 
e (ż) = 1 for all odd p | (b/D); and 
P 
e g° = 1,5 (mod 8) according as 2 ł D or 2 | D. 
Note there exist infinitely many such primes q by the infinitude of primes 
> 
qb 


in arithmetic progression. Then show that B := 


and Ram B = È. 


) has disc B = D 


> 10. Let S C PI(Q) be eligible. For each v € S, let f, € Q% be given. Show that there 


11. 


exists t € Q* such that ¢ € 4,Q*? for all v € S and vp(t) = 0 for all p ¢ S\{oo} 
except (possibly) for one prime p = q. 


Let F = Q(Vd) be a real quadratic field. Find a, b € Q* (depending on d) such 
that (a, b | F) is a division ring unramified at all finite places. 


12. Let F := Q(a@) where œ := 2 cos(27/7) = & + 1/4 with G = exp(27i/7). Let 


B := (-1, -1 | F). Compute Ram(B) and find a maximal order in B. 
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13. Let F be a global field with char F # 2 and let B be a quaternion algebra over 
F. Let L D F be a finite extension. An extension K 2 F is linearly disjoint 
with L over F if the multiplication map K ®¢ L — KL is an isomorphism of 
F-algebras. 


Show that there exists a splitting field K 2 F for B such that K is linearly disjoint 
with L over F. 

14. Show that the notion of discriminant of a quaternion algebra as the product of 
ramified primes is not such a great notion when R is an arbitrary Dedekind 
domain, as follows. 


Let R = Qt]; then R is a Dedekind domain. Let F = FracR = Q(t). Let 
Bo = (a, b | Q) be a division quaternion algebra ove Q and let B = Bo &Q F = 
(a, b | Q(t)). Show that there are infinitely primes at which B is “ramified”: for 
every prime p = (t — c)R, show that the algebra By is a division quaternion 
algebra over Fy ~ Q((t)). [Hint: See Exercise 13.6.] 
15. Using Hilbert reciprocity, one can convert the computation of an even Hilbert 
symbol to the computation of several odd Hilbert symbols, as follows. 
Let F be a number field, let p | (2), and let a,b € F*. Show that there exist 
(computable) a’, b’ € F* such that the following hold: 
(i) (a, b)p = (a’, b')p; and 
(ii) ordg(a’) = ord,(b’) = 0 for all q | (2) with q # p. 
Conclude that 
(aby = | | œh. 


vePI F 

v odd 
16. Let F be a number field with ring of integers Zr. We say an ideal b C Zp is odd 
if Nm(b) is odd, and b € Zr is odd if (b) is odd. For a € Zp and b C Zp odd, 
let 7 be the generalized Jacobi symbol, extending the generalized Legendre 


a 


symbol by multiplicativity, and write ale 
bZr 


5 for a,b € Zr\{0} with b 
odd. 


(a) Leta, b € Zr satisfy aZr + bZr = Zr, with b odd, and suppose a = aga, 


with dı odd. Then 
| | (a b) 
b di ? 4 


y|2c0 


(b) Suppose that F has a computable Euclidean function N and let a,b € 
Zp {0} with b odd. Describe an algorithm using (a) to compute the Leg- 


endre symbol (5). 
17. In this exercise, we give a constructive proof of the surjectivity of the map 


B +> Ram B in Main Theorem 14.6.1 in the spirit of the proof of Proposition 
14.2.7 (assuming two analytic results). 
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Let F be a number field, and let £ c PIF be a finite set of noncomplex places 
of F of even cardinality. Let R = Zr be the ring of integers of F. 


(a) 


(b 


wm 


(c) 


(d) 


Let D := [ [pez p be the product of the primes corresponding to nonar- 
chimedean places in S. Using weak approximation (see Lemma 28.7.1), 
show there exists a € Ð such that: 


e v(a) < 0 for all real places v € X and v(a) > 0 for all real places v ¢ È, 
if there are any; and 
e aR = Db with D +b = Rand2R+b=R. 


In the special case where R has narrow class number | (that is, every ideal 
a C R is principal a = (a) and generated by an element a € R such that 
v(a) > O for every real places v), show that we may take (a) = D and 
b=R. 

Show that there exists t € R coprime to 8aR such that the following hold: 


t 
For all primes p | D with p { 2R, we have | —] = -1; 


For all primes p | © with p | 2R, the extension F, (VÐ) is the quadratic 
t 
unramified extension of Fy, so () = —1 in the sense of the general- 


ized Kronecker symbol; 


t 
For all primes q | b we have | —} = 1; and 


For all prime powers të || 8R with t { D, we have t = 1 (mod t°). 


Show that ft is well-defined as an element of (R/8aR)*, i.e., if t’ = t 


(mod 8a) then ?’ also satisfies these conditions. 
Using the infinitude of primes in arithmetic progression over number fields 


(Theorem 26.8.26), show there exists q € R a prime element (i.e., qR is a 
prime ideal) such that g = t (mod 8a) witht as in (b) and further satisfying 
v(q) < 0 for all real places v € È. 


Show that B := (42) has Ram B = È. 


18. Let F be a global field. Show that two quaternion algebras B, B’ over F are 
isomorphic if and only if they have the same quadratic subfields (for a quadratic 
extension K > F, we have K > B if and only if K = B’). 


[See work of Garibaldi-Saltman [GS2010] for a discussion of the fields F with 
char F # 2 and the property that two division quaternion algebras over F with 
the same subfields are necessarily isomorphic. (Roughly speaking, they are the 
fields for which nonzero 2-torsion elements of the Brauer group can be detected 
using ramification.)] 

19. In this exercise, we consider how ramification sets change under base extension. 
Let F be a global field and let K 2 F be a finite separable extension. 


(a) 


Let B be a quaternion algebra over F with ramification set Ram B and 
consider Bg = B 8r K. Show that 


Ram(Bx) = {w € PI(K) : w lies over v € Ram B and 2 { [Ky : F,]}. 
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20. 


> 21. 


22. 


> 23. 


> 24. 


25. 


(b) Suppose B is a division algebra and [K : F] is odd. Show that Bx is a 
division algebra. 

(c) Asa converse to (a), suppose that Xx C PI(K) is a finite subset of noncom- 
plex places of K of even cardinality with the property that if w € Xx lies 
over v € PIF, then [Kẹ : Fy] is odd and moreover 


{w € PIF : w lies over v and [Kọ : Fy] is odd} C Xx. 


Show that there exists a quaternion algebra B over F with the property that 
Ram(Bx) = Xx. (We say that the quaternion algebra associated to the set 
<x descends to F.) 

(d) As a special case, what do (a) and (c) say when [K : F] = 2? 

(e) Restate (a) and (b) in terms of the kernel of the map Br(F)[2] > Br(K)[2] 
induced by [B] [Bx] (see Remark 14.6.10). 


Let R be a global ring with F = Frac R, and let K 2 F be a finite Galois 
extension with S the integral closure of R in K. Let B be a quaternion algebra 
over F and consider Bx = B &rp K. Then Gal(K | F) acts naturally on Bx via 
o («æ & x)= «æ ®oa(x). (This action is not by K-algebra isomorphism!) 


Show that there exists a maximal S-order O C Bx stable under Gal(K | F), i.e., 
o (O) = O for all o € Gal(K | F). 


Let F be a global field, let vı, . . ., v, be places of F, and for each v; suppose we 
are given the condition ramified, split, or inert. Show that there exists a separable 
quadratic extension K 2 F that K,, satisfies the given condition for each i. [Hint: 
follow the proof of Main Theorem 14.7.4.] 


Let F be a global field, let B4, B2, . . ., B, be quaternion algebras over F, and let 
B := B1 8 B2 8- -- 8 B,. Recalling section 8.2, show (in as many ways as you 
can) that B ~ M,,-:(B’) for a quaternion algebra B’ over F. (Recalling 8.3, by 
Merkurjev’s theorem this shows the class of every element in the 2-torsion of 
the Brauer group Br(F)[2] is represented by a quaternion algebra.) 


Let F, be a local field with char F, = 2. Let n € F,,. Show that there exists t € Fy 
such that x? — tx + n is separable and irreducible. 


Prove Lemma 14.7.6 for v an archimedean place. 


In this advanced exercise following up on Exercise 9.8, we consider features of 
quaternion algebras and orders in the case of a global function field, assuming 
background in algebraic geometry. 


Let X be a smooth, projective, geometrically integral curve over a finite field k; 
then X is a separated, integral Dedekind scheme. Let @x be its structure sheaf. 
Let F be its function field, and let B be a quaternion algebra over F. Define a 
sheaf of @x-orders in B, or simply an Gx-order in B, to be an @x-lattice Z in 
B such that for each open set U € X, the Gx(U)-lattice GV) is a subring of B. 
We recall the local-global dictionary for Oy-lattices (Exercise 9.16(c)). 
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In parts (a)-(c) we work out an example: let X = P! with function field F = k(t) 
where div t = (0) — (co). Suppose that char k # 2, and let u € kX*\k*?. Let B be 
the quaternion algebra with Ram(B) = {(0), (c)}. 


(a) Show that B = (u,t | F). 

(b) Let U = Spec k[t] = X\ {co}. Show that there exists a unique Ox-order Z 
in B with AU) = k[t] + k[tli + k[t]j + k[t]ij and stalk Ai.) a maximal 
CX (co)-order. Describe explicitly Bio) and A(Spec k[1/t]) as orders in B. 

(c) With & from (b), show that A(X) = k[i]. 


Restoring generality, let Z be an Ox-order such that AV) is a maximal Ox(U)- 
order in B for all affine open sets U. 


(d) Show that A(X) has a zero divisor if and only if A(X) =~ Mo(k) if and 
only if B ~ M} (F). 

(e) Show that A(X) is a k-algebra with a nondegenerate standard involution. 

(f) Suppose that B is a division algebra. Show that either A(X) = k or 
B(X) = kz is the quadratic extension of k. 

(g) Still supposing that B is a division algebra, show that if A(X) = k2, then 
every ramified place of B has odd degree. [Hint: show that B = (K,b | F) 
where K = Fk is the constant field extension of F of degree 2, and 
b € F*\k*. Compute the Hilbert symbol at v € Ram(B) to show v(b) is 
odd.] 
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Chapter 15 


Discriminants 


Discriminants measure volume and arithmetic complexity, and they simultaneously 
encode ramification. We devote this chapter to their study. 


15.1 > Discriminantal notions 


Let x1, .. ., Xn € R”, and let A be the matrix with columns x;. Then the parallelopiped 
with edges from the origin to x; has volume |det(A)|. We can compute this volume in 
another way: 
det(A)? = det(A‘A) = det(M) (15.1.1) 
where M has ijth entry equal to the ordinary dot product x; - xj. 

The absolute discriminant of a number field is a volume and a measure of arithmetic 
complexity, as follows. If x1,...,x, is a Z-basis for Zr andi: F > F @g R = R” 
(normalized with an extra factor of V2 at the complex places), then the volume of Zr 
in this embedding is the absolute determinant of the matrix with columns 1(x;), and 
its square is defined to be the absolute discriminant of F. Replacing the dot product in 
the definition of M in (15.1.1) with the trace form (x, y) > Trr/g(xy), we see that the 
absolute discriminant is a positive integer. A prime p is ramified in F if and only if it 
divides the discriminant, so this volume also records arithmetic properties of F. 

More generally, whenever we have a symmetric bilinear form T: V x V > F on 
a finite-dimensional F-vector space V, there is a volume defined by the determinant 
det(T(x;, x;))i,;: and when T arises from a quadratic form Q, this is volume is the 
discriminant of Q (up to a normalizing factor of 2 in odd degree, see 6.3.1). In 
particular, if B is a finite-dimensional algebra over F, there is a bilinear form 


BxBoF 
(a, p) > Trgir (ap) 


(or, when B is semisimple, the bilinear form associated to the reduced trace trd) 
and so we obtain a discriminant—a “squared” volume—measuring in some way the 
complexity of B. As in the commutative case, discriminants encode ramification. 

In this chapter, we establish basic facts about discriminants, including how they 
behave under inclusion (measuring index) and localization. To illustrate, let B be a 
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quaternion algebra over Q and let O C B be an order. We define the discriminant of 
O to be 
disc(O) := |det(trd(a;a;));,;| € Z>0 (15.1.2) 


„b 
where a, ...,@4 is a Z-basis for O. For example, if B = (= with a,b € Z\{0}, 
then the standard order O = Z + Zi + Zj + Zk has 
disc(O) = (4ab)’; 


indeed, this is the discriminant of the quadratic form (1, —a, —b, ab), the reduced norm 
restricted to O. If a,b < 0, i.e. B is definite, then the reduced norm is a Euclidean 
norm on B~ = B @g R ~ H; normalizing with an extra factor v2, the discriminant is 
square of the covolume of the lattice O c Bæ. For example, the Lipschitz order Zi, jY 
(11.1.1) has disc(Z(i, 7)) = 4°, the square of the covolume of the lattice (V2Z)* c R4. 

If O’ 2 O, then disc(O) = [O’ : OF disc(O’); in particular O’ = O if and only if 
disc(O’) = disc(O). It follows that the discriminant of an order is always a square, so 
we define the reduced discriminant discrd(O) to be the positive integer square root, 
and discrd(O)? = disc(O). The discriminant of an order measures how far the order 
is from being a maximal order. We will show (Theorem 15.5.5) that O is a maximal 
order if and only if discrd(O) = disc B, where disc B is the (squarefree) product of 
primes ramified in B. 

In an extension of Dedekind domains, the different of the extension is an ideal whose 
norm is the discriminant of the extension (see Neukirch [Neu99, §III.2]). The different 
is perhaps not as popular as its discriminant cousin, but it has many nice properties, 
including easy-to-understand behavior under base extension. Similar conclusions holds 
in the noncommutative context (presented in section 15.6). 


15.2 Discriminant 


For further reference on discriminants, see Reiner [Rei2003, §10, §14]. 


Let R be a noetherian domain and let F = Frac R. Let B be a semisimple algebra 
over F with dimp B = n. For elements aj,...,a@, € B, we define 


d(a1,..., Qn) = det(trd(ajay))i,j=1,....n- (15.2.1) 


Let J C B be an R-lattice. 


Definition 15.2.2. The discriminant of J is the R-submodule disc(/) € F generated 
by the set 
{d(ay,...,Q@y)! Q1,...,Q% E I}. 


15.2.3. If 7 = O, then for a,...,@, € O we have a;a; € O and so trd(aj;a;) € R for 
all i, j. Thus d(q1,...,@,) € R and therefore disc(O) € R. 
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Remark 15.2.4. When working over Z, it is common to take the discriminant instead to 
be the positive generator of the discriminant as an ideal; passing between these should 
cause no confusion. 


Although Definition 15.2.2 may look unwieldly, it works as well in the commutative 
case as in the noncommutative case. Right away, we see that if O C O’ are R-orders, 
then disc(O’) | disc(O). 

The function d itself transforms in a nice way under a change of basis, as follows. 


Lemma 15.2.5. Let aj,...,@, E€ B and suppose B\,...,Bn € B are of the form 
Bi = ae Mj Qj with Mij E F. Let M = (mj;)ji,j=1,....n- Then 


d(B1,..-,Bn) = det(M)*d(ay,...,@p). (15.2.6) 


Proof. By properties of determinants, if 6),..., 8, are linearly dependent (over F) then 
d(Bi,..-, Bn) = 0 and either a,...,@, are also linearly dependent or det(M) = 0, and 
in either case the equality (15.2.6) holds trivially. 

So suppose that £1, ..., Bn are linearly independent, then a1, . . ., Œn are also linearly 
independent and the matrix M, a change of basis matrix, is invertible. By Gaussian 
reduction, we can write M as a product of elementary matrices (a matrix that coincides 
with the identity matrix except for a single off-diagonal entry), permutation matrices 
(a matrix interchanging rows suffices), and a diagonal matrix; it is enough to check that 
the equality holds when M is a matrix of one of these forms. And for such a matrix, the 
equality can be checked in a straightforward manner using the corresponding property 
of determinants. Oo 


Corollary 15.2.7. If I is free as an R-module, and œ, .. ., œn is an R-basis for I, then 
disc) = d(q,...,@n)R. 


Proof. The matrix M writing any other f£),...,8, € J in terms of the basis has 
M e M, (R) so det(M) € R, and therefore d(6),..., Bn) € d(@1, . . ., Œ@n)R by Lemma 
13.255. o 


15.2.8. More generally, if Z is completely decomposable with 
I = a D: D Anan 
such as in (9.3.7), then from (15.2.6) 
disc(1) = (1 -+ an) dlan,» -p Œn). 


More generally, the discriminant is well-behaved under automorphisms because 
the reduced trace is so. 


Corollary 15.2.9. If ¢: B — B is an F-algebra automorphism, then disc(¢(D) = 
disc(7). 


Proof. By Proposition 7.8.6, we have trd(ġ(a«p)) = trd(a@B) for all a, B € B. There- 
fore, for all a1,...,@, E€ B we have d(G(a1),..., 6(@n)) = d(a1,...,Q@n); the result 
disc(@(/)) = disc(/) follows. o 
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Our primary interest will be in the case J = O. 


Example 15.2.10. Suppose char F + 2. Let B := (a,b | F) with a,b € R. Let O := 
R@®Ri@ Rj @ Rij be the standard order. Then disc(O) is the principal R-ideal generated 
by 


2 0 0 0 
a ae 0 2a 0 O |. 2 
d(,i, j,ij) = det 0 0 2b 0 —(4ab)°. 


The calculation when char F = 2 is requested in Exercise 15.1. 


Example 15.2.11. Let B := M,,(F) and O := M,,(R). Then disc(O) = R (Exercise 
15.2). 


15.2.12. Let B := (K, b | F) be a quaternion algebra over F with b € R and let S be an 
R-order in K. Let O := S ® Sj; then O is an R-order in B by Exercise 10.7. We have 
disc(O) = b? disc(S)*, by Exercise 15.4. 

In particular, let F be a nonarchimedean local field, let R be its valuation ring and 
p = Ra its maximal ideal, and let B be a division quaternion algebra over F. Then 
by Theorem 13.3.11, we have B = (K,a | F) with K 2 F an unramified separable 
quadratic extension of F. The valuation ring S of K has disc(S) = R, so the valuation 
ring O = S @ Sj of B has discriminant disc(O) = p?. 


15.2.13. Equation (15.2.6) and the fact that Ip) = J ®p Rp) implies the equality 
disc([(p)) = disc(1)(p) 


on localizations and for the same reason an equality for the completions disc(/y) = 
disc(/),. In other words, the discriminant respects localization and completion and can 
be computed locally. Therefore, by the local-global principle (Lemma 9.4.6), 


disc(I) = () disc()). 
p 


Lemma 15.2.14. If B is separable as an F-algebra and I is projective as an R-module, 
then disc(1) is a nonzero projective fractional ideal of R. 


Proof. Since I is an R-lattice, there exist elements a1, ...,@, which are linearly inde- 
pendent over F. Since B is separable, by Theorem 7.9.4, trd is a nondegenerate bilinear 
pairing on B so disc(/) is a nonzero ideal of R. It follows from Lemma 15.2.5 that 
disc(/) is finitely generated as an R-module, since this is true of J: we apply d to 
all subsets of a set of generators for J as an R-module. To show that disc(/) is pro- 
jective, by 9.2.1 we show that disc(/) is locally principal. Let p be a prime ideal of 
R. Since J is a projective R-module, its localization Ip) is free; thus from Corollary 
15.2.7, we conclude that disc(J)(p) = disc([(p)) is principal over Rp) and generated by 
disc(a@},...,@,) for an Rp)-basis @ı, . . ., Œn of I, as desired. oO 


15.3. QUADRATIC FORMS 245 


We conclude this section comparing lattices by their index and discriminant as 
follows. We recall the definition of index (section 9.6). 


Lemma 15.2.15. Let I, J C B be projective R-lattices. Then 
disc(I) = [J : I] disc(J). 
Moreover, if I C J, then disc) = disc(J) if and only if I = J. 


Proof. For the first statement, we argue locally, and combine (15.2.6) and Lemma 9.6.4. 
For the second statement, clearly disc(J) € disc(/), and if J = J then equality holds; 
and conversely, from disc(/) = [J : iy disc(J) = disc(J) we conclude [J : I]r = R, 
hence J = I by Proposition 9.6.8. Oo 


Remark 15.2.16. We defined the discriminant for semisimple algebras so that it is given 
in terms of the reduced trace. This definition extends to an arbitrary finite-dimensional 
F-algebra B, replacing the reduced trace by the algebra trace Trg). If B is a central 
simple F-algebra of dimension n?, then n trd = Trgir so when n € F* one can recover 
the discriminant as we have defined it here from the more general definition; but if 
n = 0 € F then the discriminant of B computed with the algebra trace will be zero. 


15.3 Quadratic forms 


Essentially the same definition of discriminant (Definition 15.2.2) applies to quadratic 
modules, as follows. We recall 6.3.1, where the discriminant was defined in all char- 
acteristics. 

Let Q: M —> L be a quadratic module over R (Definition 9.7.3) with rk M = n and 
associated bilinear map T: M X M > L. 


15.3.1. Let xı, . .., Xn € M and f € LY := Homg(L, R). If n is even, we define 


d(x1,...,Xn3 f) := det( f (T (xi, xj)))i,j=1,...n- (15.3.2) 
If n is odd, then by specializing the universal determinant as in 6.3.4, we define 
d(x1,..-,Xn3 f) := (det /D(f T xi, xii j=1,..nn- (15.3.3) 
The discriminant of Q is then the ideal disc(Q) € R generated by the set 
(A, ccnp Xn; FVM. oy E M, f E LY}. (15.3.4) 
15.3.5. If M, L are free with R-basis x), . . ., Xn and e, respectively, then letting f € LY 


the dual to e with f(e) = 1 gives 
disc(Q) = d(x1, . . -, Xn; f)R. 


In particular, since M, L are projective and therefore locally free over R, the discrimi- 
nant of Q is locally free and hence a projective R-ideal. 
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Lemma 15.3.6. The discriminant of a quadratic module is well-defined up to similarity. 


Proof. Let Q: M —> L and Q’: M’ — L’ be quadratic modules over R similar by 
g:M > M’ andh: L = L. It suffices to check the invariance locally, so to this end 
we may suppose that the modules are free; choose a basis M = };_; Rx; and L = Re, 
and let x; = g(x;) and e’ = h(e). Then M’ = X; Rx; and L’ = Re’. Let f, f’ be dual 
to e, e’; then postcomposing Q and Q’ by f, f’ we may suppose L = L’ = R and h is 
the identity. 

We then have Q’(g(x)) = Q(x) for all x € M, so the same is true of the associated 
bilinear forms T, T’. But then d(x» <. X1) = A(X1,..., Xn), and by 15.3.5 this implies 
disc(Q) = disc(Q’) as ideals of R. o 


15.3.7. Let B be a finite-dimensional F-algebra with a standard involution. Then the 
reduced norm is a quadratic form on B with associated bilinear form T(a, 8) = trd(a). 
Although the bilinear form differs by the presence of this standard involution from the 
definition of discriminant in (15.2.1), the resulting discriminants are the same (up to 
R*): see Exercise 15.13. 


Lemma 15.3.8. The quadratic module Q is nonsingular if and only if disc(Q) = L. 

In particular, suppose that M = R” is free with basis e; and L = R, and let 
[T] := (Tei, ej))i j E Mn(R) be the Gram matrix in this basis. Then Q is nonsingular 
if and only if det([T]), (det /2)X([T]) € R* according as n is even or odd. 


Proof. The map T: M — Hompr(M,L) is an isomorphism if and only if it is an 
isomorphism in every localization, so we may suppose that Q is free, with M = R” 
and L = R, which is to say we may prove the second statement in the case where R is 
local, with maximal ideal p and residue field k := R/p. Let Q mod p: M r k > k 
be the reduction of Q; its Gram matrix is [T] mod p € M,,(k). Over the field k, we 
have that Q mod p is nonsingular if and only if it is nondegenerate if and only if 
det[T], (det /2)([T]) + 0 according as n is even or odd; since R is local, these are 
equivalent to asking that these values are in R*. An application of Nakayama’s lemma 
then implies the result. o 


15.4 Reduced discriminant 


In this section, we extract a square root of the discriminant for quaternion orders. 
Indeed, in Example 15.2.10, we saw that the discriminant of the standard R-order 
O € B = (a,b | F) is disc(O) = (4ab)}?R, a square. If O’ is another projective R-order, 
then disc(O’) = [O : oR disc(O) by Lemma 15.2.15, so in fact the discriminant of 
every R-order is the square of an R-ideal. 

In fact, there is a way to define this square root directly, inspired by vector calculus. 


15.4.1. Ifu, v, w € R? then |u-(vxw)|, the absolute value of the so-called mixed product 
(or scalar triple product or box product), is the volume of the parallelopiped defined 
by u, v, w; identifying R? = H? as in section 2.4, from (2.4.10) we can write 


2u:-(v X w) = u; (vw — wy) = — trd(u(vw — wy)). 
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For example, 2 = —2i-(j X k) = -i- (jk — kj) = trd(ijk). 


More generally (and carefully attending to the factors of 2) we make the following 
definition. Let B be a quaternion algebra over F. 


15.4.2. For a1, a2, a3 € B, we define 


m(Q1, @2, @3) := trd((œ1@2 — @2@1)@3) 


= @10203 — A20103 — A302 X1 + A3Q)1 Q2. 


Lemma 15.4.3. The form m: B x B x B > F is an alternating trilinear form which is 
well-defined as a form on B/F. 


Proof. The form is alternating because for all a},a2 € B we have m(aq1, a1, @2) = 0 
and 


m(Q, @2, &1) = trd((@1@2 — a2@1)a7) = trd(nrd(a1)a@z2) — trd(@z nrd(a,)) = 0 


and similarly m(@1, @2,@2) = 0. The trilinearity follows from the linearity of the 
reduced trace. Finally, from these two properties, the descent to B/F follows from the 

computation m(1, a1, a2) = 0 for all a1, @ € B. 
(Alternatively, one can check that the pairing descends to B/F first, so that the 
involution becomes a + F = —a + F, and then the alternating condition is immediate.) 
o 


Definition 15.4.4. Let J C B be an R-lattice. The reduced discriminant of 7 is the 
R-submodule discrd(/) of F generated by 


{m(@1, @2, @3) : @, @2, a3 E I}. 

15.4.5. If a;, 8; € B with 8; = Ma; for some M € M3(F), then 
m(Bı, P2, B3) = det(M)m(a1, a2, a3) (15.4.6) 

by Exercise 15.10. It follows that if J & J are projective R-lattices in B, then 

discrd(7) = [J : I] discrd(J). 
Lemma 15.4.7. If I is a projective R-lattice in B, then disc) = discrd(). 
Proof. First, we claim that 

mii, j,i)? = —d(1,i, j,i). 
If char F + 2, then disc(1, i, j, ij) = —(4ab)* by Example 15.2.10 and 
m(i, jij) = td((ij — jiij) = trd(2ij@j)) = 4ab, 

as claimed. See Exercise 15.1 for the case char F = 2. This computation verifies the 


result for the order O = R @ Ri @ Rj © Rij. 
The lemma now follows using (15.2.6) and (15.4.6), for it shows that 


2 
m(ay, @2, a3)” = —d(1, a, a2, a3) 


for all a1, a2, a3 € B, and the latter generate discrd(/) by Exercise 15.7. o 
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The notions in this section extend more generally to an arbitrary algebra B with a 
standard involution. 


15.5 Maximal orders and discriminants 


We now relate discriminants to maximal orders. Throughout this section, we suppose 
that R is a Dedekind domain. We record the following important principle. 


Lemma 15.5.1. Let O C€ O’ be R-orders. Then O = O' if and only if disc O = disc O’. 
Proof. In the nontrivial direction, by Lemma 15.2.15 we have 
disc O = [O’ : O]}, disc(O’) 
so disc O = disc O’ if and only if O = ©’. o 
First, we ensure the existence of maximal orders (cf. 10.4.2) using the discriminant. 


Proposition 15.5.2. There exists a maximal R-order O C B, and every order O is 
contained in a maximal R-order O’ C B. 


Proof. The algebra B has at least one R-order O as the left- or right-order of a lattice 
10.2.5. If O is not maximal, then there exists an order O’ 2 O with disc(O’) 2 disc(O) 
by Lemma 15.5.1. If O’ is maximal, we are done; otherwise, we can continue in 
this way to obtain orders O = O} € O2 Ç ... and an ascending chain of ideals 


disc(O;) Ç disc(O2) Ç ... of R; but since R is noetherian, the latter stabilizes after 
finitely many steps, and the resulting order is then maximal, by Lemma 15.2.15. oO 


Using the discriminant as a measure of index, we can similarly detect when orders 
are maximal. We recall (10.4.3) that the property of being maximal is a local property, 
so we begin with the local matrix case. 


Lemma 15.5.3. Suppose that R is a DVR, and let O C B := M,(F) be an R-order. 
Then O is maximal if and only if disc O = R. 


Proof. First, suppose O is maximal. Then by Corollary 10.5.5, we conclude O = 
M,,(R) (conjugate in B). By Corollary 15.2.9, we have discO = discM,(R); we 
computed in Example 15.2.11 that disc M,(R) = R, as claimed. The converse follows 
by taking O’ a maximal order containing O (furnished by Proposition 15.5.2) and 
applying Lemma 15.5.1. o 


Example 15.5.4. By 15.2.12, if F is a nonarchimedean local field with valuation ring 
R and B is a division quaternion algebra over F, then the valuation ring O C B is the 
unique maximal order (Theorem 13.3.11) with disc O = p? and discrd O = p. Arguing 
as in Lemma 15.5.3, we find that an R-order in B is maximal if and only if it has 
reduced discriminant p. 


Maximality can be detected over global rings in terms of discriminants, as follows. 
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Theorem 15.5.5. Let R be a global ring with field of fractions F, let B be a quaternion 
algebra over F, and let O C B be an R-order. Then O is maximal if and only if 


discrd(O) = discr(B). (15.5.6) 


Proof. Suppose that O is maximal. Then Op is maximal for all primes p of R. If 
By =~ Mo(Fp) is split, then by Lemma 15.5.3, discrd Op = Rp; if By is a division 
algebra, then discrd Op = pRy. Since discriminants are defined locally, we conclude 
that 
discrd(O) = I] p = discr(B) 
peRam BNS 

if R as a global ring is the ring of S-integers. 

In the other direction, if (15.5.6) holds, we choose O’ 2 O be a maximal 
R-superorder and conclude that disc(O) = discr(B)* = disc(O’) so O = O’ is maximal 
by Lemma 15.5.1. oO 


Example 15.5.7. We recall Example 14.2.13, giving an explicit description of quater- 
nion algebras B = (a, b | Q) of prime discriminant D = p. We now exhibit an explicit 
maximal order in each of these algebras. 

For p = 2, we have B = (—1, -1 | Q) and take O C B the Hurwitz order. 

For p = 3 (mod 4), we took B = (—p, —1 | Q). The order O := Z((1+i)/2, j) = S@Sj 
with S := Z[(1 + 1)/2] has discrdO = p by 15.2.12, so O is maximal by Theorem 
15.5.5: 

For p = 1 (mod 4), we had B = (—p, —q | Q) where q = 3 (mod 4) is prime and 
(2) = —1, so that by qudaratic reciprocity -pq = —qp = 1. In this case, let c € Z be 


P 
such that c? = —p (mod q). Then 


l+j iQ +j) get 
2 2 q 

is a maximal order: one checks that O is closed under multiplication (in particular, the 
basis elements are integral), and then that disc O = p. The order Z(i, (1 + j)/2)} € O has 
the larger reduced discriminant pq, hence the need for a denominator q in the fourth 
element. 

For further discussion of explicit maximal orders over Z, see Ibukiyama [Ibu82, 
pp. 181-182] or Pizer [Piz80a, Proposition 5.2]. For a more general construction, see 
Exercise 15.5. 


O:=ZeZ eZ 


15.6 Duality 


To round out the chapter, we relate the discriminant and trace pairings to the dual and 
the different. For a detailed, general investigation of the dual in the context of other 
results for orders, see Faddeev [Fad65]. 

We continue with the hypothesis that R is a domain with F = Frac R. Let B be an 
F-algebra with n := dimp B < ov. As the trace pairing will play a significant role in 
what follows, we suppose throughout that B is separable (in particular, semisimple) as 
an F-algebra with reduced trace trd. Let J, J be R-lattices in B. 
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Definition 15.6.1. The dual of 7 (over R, with respect to trd) is 
IË := {æ € B : trd(al) C R} = {a € B : trd(Ia) € R}. 
Some properties of the dual are evident. 
Lemma 15.6.2. 


FIC J then IË 2 JË. 
(b) For all B € B*, we have (Ble = IB. 
(c) If p E R is prime, then (Ip)! = Ëp) and the same with the completion. 


Proof. For parts (a) and (b), see Exercise 15.15. The proof of part (c) is similarly 
straightforward. o 


15.6.3. Suppose that / is free over R with basis a1,...,@,. Since the trace pairing on 
B is nondegenerate (Theorem 7.9.4), there exists a dual basis af € B to œ; under the 


reduced trace trd, so that trd(afa;) = 0, 1 according asi + j ori = j. 
Then /# is free over R with basis af, a „až: if B = bia! ++ bnat with 
bi, ..., bn € F, then £ € IË if and only if trd(@;8) = b; € R for all i. 


Lemma 15.6.4. I? is an R-lattice in B. 


Proof. Let œj,..., Œn € I be an F-basis for B, and let J = >); Ra; C I. Then there 


exists nonzero r € R such that r7 C J, so J C I Cr 'J. Let af, fe at € B be the 


dual basis as in 15.6.3. It follows that JË = > i Ra” is an R-lattice, and consequently by 
Lemma 15.6.2(a)—(b) we have rJ§ cic JË; since R is noetherian, 7? is an R-lattice. 
o 


From now on, we suppose that R is a Dedekind domain; in particular, I is then 
projective as an R-module. 


Lemma 15.6.5. The natural inclusion I > (I*)* C B is an equality. 


Proof. If œ € I and £ € I} then trd(@B) C R and a e€ (I#)}}. To show that the map is an 
equality, we argue locally, so we may suppose that J is free over R with basis a;; then 


by applying 15.6.3 twice, (1#)# has basis (at = q;, and equality holds. o 

Proposition 15.6.6. We have OR(I) = O,(/*) and OL(D) = Og(/*). 

Proof. First the inclusion (C). Let a € Or(/); then Ja C I, so T8Ta c IĦI and 
trd(aT*I) = td? Ta) C traQ#* 1) c R 


hence al# c IË and æ € OL(I#). Thus Ok(D © OLU® c Og((#)#) = ORCO) by 
Lemma 15.6.5, so equality holds. A similar argument works on the other side. o 


The name dual is explained by the following lemma. 
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Proposition 15.6.7. Let trd(J) = a € F. Then the map 


rs Homa(/, a) 
BR (ab trd(aB)) 
is an isomorphism of OR(1), OL (I)-bimodules over R. 


(15.6.8) 


Proof. For p € I, let øg: I — a be defined by ¢g(a@) = trd(af) for a € I. The map 
B+ ¢g € Homer, a) from (15.6.8) is an R-module homomorphism. Moreover, it a 
map of Or(/), OL(/)-bimodules: if y € OL) then y € Or(I*) by Lemma 15.6.6, with 
induced map 


bpy(a) = trd(aBy) = trd(yaB) = ġg(ya) = (ypa) (15.6.9) 


and similarly on the other side. 
Finally, we prove that the map (15.6.8) is also an isomorphism. Extending scalars 
to F, the trace pairing gives an isomorphism of F’-vector spaces 


Homrt(/, a) 8r F =~ Homp(B, F) = B 
Be op 


because the pairing is nondegenerate (as B is separable). So immediately the map is 
injective; and it is surjective, because if 6 € Homr(/, a) then ¢ = ġg for some £ € B, 
but then ¢(@) = trd(@B) € R for alla € I, so BE Tt by definition. o 


Remark 15.6.10. The content of Proposition 15.6.7 is that although one can always 
construct the module dual, the trace pairing concretely realizes this module dual as a 
lattice. (And we speak of bimodules in the proposition because Homg(Z, a) does not 
come equipped with the structure of R-lattice in B.) This module duality, and the fact 
that J is projective over R, can be used to give another proof of Lemma 15.6.5. 


The dual asks for elements that pair integrally under the trace. We might also ask 


for elements that multiply one lattice into another, as follows. 


Definition 15.6.11. Let /, J be R-lattices. The left colon lattice of 7 with respect to J 
is the set 
UT: /J)i:={@eBraJ ch 


and similarly the right colon lattice is 


(I: Drs={~aeB: Jacl. 


Note that (7 : J), = O_() is the left order of J (and similarly on the right). The 
same proof as in Lemma 10.2.7 shows that (7: J), and (I : J)p are R-lattices. 


Lemma 15.6.12. We have 
DË = dË: De =Ë: Da. 
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Proof. We have B € (IJ)* if and only if trd(BIJ) C R if and only if Ba € JË for 
all a € I if and only if 6 € (JË : Da. A similar argument works on the other side, 
considering trd(7J £) instead. o 


Corollary 15.6.13. We have OL (1) = (II®Ë and OR(D) = (UË DË. 


Proof. Combining Lemmas 15.6.5 and 15.6.12, 
OL(D =: Di = ESD. = F 


and similarly on the right. o 


Definition 15.6.14. The level of / is the fractional ideal IvI(7) = nrd(/*) c F. 
We now relate the above duality to the discriminant. 


Definition 15.6.15. The codifferent of O is 
codiff(O) := OË. 


Lemma 15.6.16. O, (codiff(O)) = Op(codiff(O)) = O and O € codiff(O). 


Proof. By Proposition 15.6.6, O = Og(O) = O_(codiff(O)) and similarly on the right. 
And O C codiff(O) since trd(OO) = trd(O) € R. o 


The major role played by the codifferent is its relationship to the discriminant, as 
follows. 


Lemma 15.6.17. disc(O) = [codiff(O) : Ol. 


Proof. For a prime p C R we have disc(O)p») = disc(Oyp)) and [Ors : Op lRo) = 
({O# : O] R)(p); and so to establish the equality we may argue locally. Since Oçp) is 
free over Rp), we reduce to the case where O is free over R, say O = 9); Raj. Then 
Of = Di Ra” with af, ee af € B the dual basis, as in 15.6.3. 

The ideal disc(O) is principal, generated by d(@1,..., Œn) = det(trd(a;a;));,;; at 
the same time, the R-index [ot : OJR is generated by det(6) where ô is the change of 


basis from af to a;. But 6 is precisely the matrix (trd(@;æ;));,; (Exercise 15.14), and 
the result follows. Oo 


Remark 15.6.18. In certain circumstances, it is preferable to work with an integral 
ideal measuring the discriminant, so instead of the codifferent instead a different: we 
will want to take a kind of inverse. We study this in the next chapter: see section 16.8. 
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Exercises 
Unless otherwise specified, let R be a noetherian domain with field of fractions F. 


a,b 


1. Let char F = 2 and let F ) be a quaternion algebra over F with a,b € R and 


b + 0. Show that O = R + Ri + Rj + Rij is an R-order in B and compute the 
(reduced) discriminant of O. 

2. Let B = M,(F) and O = M,(R) with n > 1. Show that disc(O) = R. [Hint: 
Compute directly on a basis {e;;};,; of matrix units, which satisfy eijeyj = eij 
if j =i’, otherwise zero.] 

3. Suppose R is a global ring, so F is a global field; let B be a quaternion algebra 
over F and let O C B be an R-order. Prove that for all primes p € R, we have 
Op ~ Mn (Rp) if and only if p { disc O. 

4. Let B := (K,b | F) be a quaternion algebra over a field F with b € F*. Let 
S C K be an R-order with 0 := disc(S); let b C K be a fractional S-ideal (which 
can be but need not be invertible), and finally let O := S @ bj. 


(a) Show that O is an R-order if and only if Nmgjr b S bo! R. 
(b) Compute that discrd O = 0(Nmxjr b)b. 


5. In this exercise, we consider a construction of maximal orders as crossed prod- 


a) 
ucts in the simplest case over Q, continuing Exercise 14.9. Let B := (=) bea 


quaternion algebra of discriminant D, where b € Z is squarefree with D | b and 
q is an odd prime with q? = +q = 1 (mod 4), the minus sign if B is indefinite. 


Let K := Qq?) be the quadratic field of discriminant q?. Let S C K be the 
ring of integers of K, so disc S = q°. 


> 


(a) Show that for all odd primes p | (b/D), we have £) = 1. Conclude there 
p 


exists an ideal b C S such that Nm b = b/D. 
(b) Let q C S be the unique prime above q, and let 
O :=S@(qb)'j 


Show that O is a maximal order in B. 
(c) Let c € Z satisfy c? = q? (mod 4b/D). Show that the order O in (b) can be 
written 


l+i D(c +i)j 
Zj 8 Z———.. 
y ASA- Sig 
6. Let B be a separable F-algebra with dimp B = n. Show that aj,...,a@, € B are 
linearly independent over F if and only if d(@1, . . ., Œn) # 0. 


7. Let O be an R-order. Show that disc(O) is generated by 


O=ZƏZ 


{d(1, a1, Sis -> @n-1) Ejes n-i E O}. 
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. Let J be an R-lattice in B over F, let K be a finite extension field of F, and let S 


be a domain containing R with field of fractions K. Show that 


disc(7 Qr S) = disc(/) @r S = disc(J)S. 


. Let char F + 2 and let B be a quaternion algebra over F. Let a, 8 € B be such 


that F(a) N F(B) = F. Recall the discriminant form A (Exercise 4.3), and let 


trd(a) trd() 


s := trd(« f) — 7 


Show that 
d(1, æ, B, ap) = —(s* - 4AA)? = -Ala p). 
[Hint: reduce to the case where trd(q@) = trd(8) = 0, noting the invariance of s.] 


Let B be a quaternion algebra over F. Define m : B x B x B —> F by 
m(@ı, @2, @3) := trd([@1, &2]@3) for a; € B. If Bj = Ma; for some M € M3(F), 
show that 

m(ß1, P2, 3) = det(M)m(a1, a2, a3). 


Let B be a quaternion algebra over F. Give another proof that 
m(a1, a2, a3)? = d(1, œ, œ2, a3) 


(cf. Brzezinski [Brz82, Lemma 1.1(a)]) for all œ; € B as follows: 


(a) Suppose B = Mo(F). Show that the matrix units 


_(o 1 mo _ (1 0 
e12 = |o 0’? ezn = \4 (0N e2 = |o 0 


span B/F, and m(e12, e21, en)” = d(l, e12, €21, €22). Conclude using Exer- 
cise 15.10. 
(b) Reduce to (a) in general by taking a splitting field for B. 


Suppose R = Rs) is a global ring with 2 € R*. Let K D F be a quadratic field 
extension and S$ C K an R-order. Let Ram(K) be the set of places of F that are 
ramified in K. Show that S is maximal if and only if its discriminant is equal to 


discr(S) = I] PER 
peRam(K)\S 
in analogy with Theorem 15.5.5. 
Let B be a finite-dimensional F-algebra with a standard involution. Compare 
det(trd(a;a;));,; with det(trd(a;a;));,; 


for a; € B, and show that defining the discriminant of an order O C B with 
respect to either pairing gives the same result. 
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> 14. Let B be a semisimple F-algebra with dimp B = n, let J be an R-lattice that is 


free over R with basis aj,...,@,, and let af, p at € B be the dual basis, so 


trd(a%a;) = 1,0 according as i = j or not. Show that the change of basis matrix 


from {a*}; to {aj}; is given by (trd(ai@;))i,;- 
15. Let J C B be an R-lattice in a separable algebra B. 


(a) If J C Bis an R-lattice with J C J, show that 7# 2 JË. 
(b) Show that for all B € BX, we have (B/)# = 1# 67! 


16. Let R be a DVR with maximal ideal p = xR, and let O := (; i for 
e > 0. Compute the codifferent codiff(O): in particular, show that codiff(O) is a 
principal two-sided O-ideal, and find a generator. Verify Lemma 15.6.17. 

17. Let R be a noetherian domain with F = Frac R. Let B be a central simple algebra 
over F. Let O C B be an R-order. We say O is Azumaya if O is R-simple, which 
is to say every two-sided ideal J C O is of the form aO = Oa witha = IARC R. 


(a) Show that O is Azumaya if and only if every R-algebra homomorphism 
O = A is either the zero map or injective. 

(b) Show that O is Azumaya if and only if O/mO is a central simple algebra 
over the field R/m for all maximal ideals m of R. 

(c) Suppose that B is a quaternion algebra. Show that the quaternion order O 
is Azumaya if and only if disc O = R. Conclude that the only Azumaya 
quaternion algebra over the valuation ring R of a local field is M2(R), and 
that the only Azumaya quaternion algebra over Z is M2(2). 


[See Auslander and Goldman [AG60] or Milne [Milne80, §IV.1].] 


18. Let G be a finite group of order n = #G and let R be a domain with F = Frac R. 
Suppose that char F { n. Then B := F[G] is a separable F-algebra by Exercise 
17.6: 


(a) Consider the algebra trace Trg)r and its associated bilinear form. Show 
that in the basis of F[G] given by the elements of G that the trace pairing 
is the scalar matrix n. 

(b) Now write B ~ Bı X--- X B, as a product of simple F'-algebras. Let K; 
be the center of B;, and let dimx, Bi = n. Show that Tr |g; = ni trd. Let 
O = R[G], and suppose that O = O; x--- X O,. Show that 


codiff(O) = nO, X +++ X n7 'O,. 
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Chapter 16 


Quaternion ideals and invertibility 


Much like a space can be understood by studying functions on that space, often the 
first task to understand a ring A is to understand the ideals of A and modules over A 
(in other words, to pursue “linear algebra’ over A). The ideals of a ring that are easiest 
to work with are the principal ideals—but not all ideals are principal, and various 
algebraic structures are built to understand the difference between these two. In this 
chapter, we consider these questions for the case where A is a quaternion order. 


16.1 > Quaternion ideals 


To get warmed up for the noncommutative situation, we consider ideals of quadratic 
rings. An integer d € Z is a discriminant if d = 0, 1 (mod 4). Let S be the quadratic 
order of nonsquare discriminant d € Z, namely, 


S = S(d) := Z @® Zi(d + Vd)/2] c K = Q(Va). 


The set of ideals of S has a natural multiplicative structure with identity element S 
(giving it the structure of a commutative monoid), but we lack inverses and we would 
surely feel more comfortable with a group structure. So we consider nonzero S-lattices 
a C K, and call them fractional ideals of S; equivalently, they are the S-submodules 
d-'a c K with a C S a nonzero ideal and d € Zso, hence the name fractional ideal 
(viz. 9.2.4). To get a group structure, we must restrict our attention to the invertible 
fractional ideals a c K, i.e., those such that there exists a fractional ideal b with 
ab = S. The simplest kind of invertible fractional ideals are the principal ones a = aS 
for a € K*, with inverse a“! = a7'S. If a fractional ideal a has an inverse then this 
inverse is unique, given by 


al={xeK:xaC S$}; 


and for a fractional ideal a, we always have aa”! C S (but equality may not hold). If 
S = Zx is the ring of integers (the maximal order) of K, then all nonzero fractional 
ideals of S are invertible—in fact, this property characterizes Dedekind domains, in 
that a noetherian commutative ring is a Dedekind domain if and only if every nonzero 
(prime) ideal is invertible. (See also the summary in section 9.2.) 
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A fractional ideal a of S is invertible if and only if a is locally principal, i.e., 
a @z Zip) = Up) = a(p)Z(p) is a principal fractional ideal of the localization Sp) 
for all primes p. Every locally principal ideal is invertible, and the extent to which 
the converse holds is something that arises in an important way more generally 
in algebraic geometry. In the language of commutative algebra, a locally principal 
S-module is equivalently a projective S-module of rank 1. 

Suppose that S is not maximal; then S(d) = Z + fZx for a unique integer f € Zs1, 
the conductor of S; accordingly, d = dg f 2 where dx € Z is the discriminant of Zg 
(a fundamental discriminant). In this case, there is always an ideal of S that is not 
invertible. Specifically, consider the ideal 


f= fZ+vVdZcs. (16.1.1) 
Then f is a free Z-module of rank 2 and 
P =(fZ+ZNdy = f°Z + fVdZ = ff 


so if f were invertible, then cancelling we would obtain f = fS, a contradiction. The 
source of this example is that f = fS(dx) since S(dx) = Z + VdxZ, so really this 
fractional ideal belongs to the maximal order S(dx), not to S. For more on the notion 
of invertibility for quadratic orders, see Cox [Cox89, §7], with further connections to 
quadratic forms and class numbers. 

We now turn to the quaternionic generalization, where noncommutativity presents 
some complications. Let B be a quaternion algebra over Q and let O C B be an order. 
To study ideals of O we must distinguish between left or right ideals, and the product of 
two (say) right O-ideals need not be again a right O-ideal! To address this, for lattices 
I, J C B, we say that J is compatible with J if the right order of J is equal to the left 
order of J, so that what comes between J and J in the product J - J “matches up”. 

A lattice 7 C B is right invertible if there exists a lattice Z’ C B such that 


Il’ =O.W) 


with a compatible product, and we call /’ a right inverse. We similarly define notions 
on the left, and we say J C B is invertible if there is a two-sided inverse I’ C B, so 


Il’ = OLD) = OR(T’) and T'I = OLI’) = OR(I) 


with both of these products are compatible. If a lattice 7 has a two-sided inverse, then 
this inverse is uniquely given by 


I! :={~@e€B:lal ch 


(defined so as to simultaneously take care of both left and right): we always have that 
IT! C O.(), but equality is needed for right invertibility, and the same on the left. 

Let O C B be an order. A left fractional O-ideal is a lattice Z C B such that 
O c OL(I); we similarly define on the right. For a maximal order, all lattices are 
invertible (Proposition 16.6.15(b)). 
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Proposition 16.1.2. Let O C B be a maximal order. Then a left or right fractional 
O-ideal is invertible. 


The simplest kind of invertible lattices are the principal lattices 
I = O.(a = aOr() 


with a € B*: its inverse is I7! = a !O_(1) = Og()a™!. 
The major task of this chapter will be to interrelate these notions in the quaternionic 
context. Let 
nrd(/) := gced({nrd(a) : a € I}, 


i.e., nrd(/) is a positive generator of the (finitely generated) subgroup of Q generated 
by nrd(q@) for œ € I. The main result over Q is the following theorem (Main Theorem 
16.7.7). 


Main Theorem 16.1.3. Let B be a quaternion algebra over Q and let I C B be an 
integral lattice. Then the following are equivalent: 


(i) I is locally principal, i.e., Ip) = I ®z Zip) is principal for all primes p; 
(ii) I is invertible; 

(iii) I is right invertible; 

(iii’) I is left invertible; 

(iv) nrd(D? = [OrU): I]; and 

(iv’) nrd(1)? = [OLW) : I]. 


Accordingly, for J integral, we may define the right absolute norm of J by 
NZ) := #(ORW)/D = [OrW) : I] € Z>1 


and similarly on the left; by Main Theorem 16.1.3 (iv) © (iv’), when J is locally 
principal, the left and right absolute norms coincide (called then just absolute norm) 
and are related to the reduced norm by N(/) = nrd(7 y. 


16.2 Locally principal, compatible lattices 


The simplest lattices to understand are those that are principal; but as we saw in 
section 9.4, lattices over Dedekind domains are inherently local in nature. We are led 
to consider the more general class of locally principal lattices. We work first with 
lattices, and later we will keep track of their left and right orders. 

Throughout this chapter, let R be a Dedekind domain with field of fractions F, let 
B be a finite-dimensional algebra over F, and let J C B be an R-lattice. 


Definition 16.2.1. / is principal if there exists œ € B such that 
I = O(a = aOR); 


we say that J is generated by a. 


260 CHAPTER 16. QUATERNION IDEALS AND INVERTIBILITY 


16.2.2. If I is generated by «œ € B, then since / is a lattice (Definition 9.3.1) we have 
IF = Boa = B, so a € B*. 


16.2.3. If I = OL (I)e, then OR(I) = a~!O_()a by Exercise 16.2, so 
I = (atO, (Da) = ORCI). 


Therefore it is sufficient to check for a one-sided generator (and if we defined the 
obvious notions of left principal or right principal, these would be equivalent to the 
notion of principal). 


The notion of principality naturally extends locally. 


Definition 16.2.4. An R-lattice J is locally principal if Ip) = Z 8r Rgp) is a principal 
Rp)-lattice for all primes p of R. 


Now let J, J be R-lattices in B. We define the product ZJ to be the R-submodule of 
B generated by the set 
faB:ael, Be J}. 


The product 7J is an R-lattice: it is finitely generated as this is true of J, J individually, 
and there exists a nonzero r € I (Exercise 9.2) so rJ C IJ and thus 


B=FJ=F(rJ)CFIJ 


so equality holds. 
When multiplication of two lattices matches up their respective left and right orders, 
we give it a name. 


Definition 16.2.5. We say that 7 is compatible with J if Og) = OL(J). 


We will also sometimes just say that the product ZJ is compatible to mean that / is 
compatible with J. The relation “is compatible with” is in general neither symmetric 
nor transitive. 


16.2.6. I has the structure of a right Op(/)-module and J the structure of a left OL (J)- 
module. When Or(/) = OL(J) = O, that is, when J is compatible with J, it makes 
sense to consider the tensor product J &ọ J as an R-module. The multiplication map 
B &g B — B defined by a 8 B |> af restricts to give an isomorphism J 8o J > IJ 
as R-lattices. In this way, multiplication of compatible lattices can be thought of as a 
special case of the tensor product of modules. 


We conclude this section with several other basic properties of lattices. 
Definition 16.2.7. An R-lattice / is integral if 7? C 7. 

In Definition 16.2.7, the product need not be compatible. 
Lemma 16.2.8. Let I be an R-lattice. Then the following are equivalent: 


(i) I is integral; 
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(ii) For all a, B € I, we have ap € I; 

Gii) Z C OLW/, so I is a left ideal of OL (D) in the usual sense; 
Gii’) I C On); and 

(iv) © OLM NA Or). 


If I is integral, then every element of I is integral over R. 


Proof. The equivalence (i) © (ii) follows immediately. For (i) © (iii), we have ZI C I 
if and only if Z € O_() by definition of O_(/), and the same argument gives (i) © 
Gii’), and this then gives (i) © (iv). The final statement follows from Lemma 10.3.2. 

o 


In light of Lemma 16.2.8, we need not define notions of left integral or right 
integral. 

For an R-lattice 7, there exists nonzero d € R such that dI is integral, so every 
R-lattice I = (dI)/d is fractional in the sense that it is obtained from an integral lattice 
with denominator. 


Definition 16.2.9. Let O C B be an R-order. A left fractional O -ideal is a lattice 
I C B such that O € O_(J); similarly on the right. 

If O, O’ C B are R-orders, then a fractional O, O’ -ideal is a lattice 7 that is a left 
fractional O-ideal and a right fractional O’-ideal. 


Remark 16.2.10. A left ideal J C O in the usual sense is an integral left O-ideal 
in the sense of Definition 16.2.9 if and only if IF = B, i.e., I is a (full) R-lattice. 
(Same for right and two-sided ideals.) If J is nonzero and B is a division algebra, then 
automatically J is full and the two notions coincide. 

Indeed, suppose J € O is a left ideal of O (in the usual sense). Then O € O_(J) so 
in particular 7 has the structure of an R-module, and since O is finitely generated as 
an R-module and R is noetherian, it follows that J is finitely generated. Consequently, 
a left ideal J C O is a left fractional O-ideal if and only if IF = B. 


Definition 16.2.11. Let / be a left fractional O-ideal. We say that / is sated (as a left 
fractional O-ideal) if O = O(I). We make a similar definition on the right and for 
two-sided ideals. 


Example 16.2.12. By Lemma 15.6.16, codiff(O) is a two-sided sated O-ideal. 


Remark 16.2.13. Our notion of sated is sometimes called proper: we do not use this 
already overloaded term, as it conflicts with the notion of a proper subset. 


16.3 Reduced norms 


Next, we extend the reduced norm to lattices; see also Reiner [Rei2003, §24]. To this 
end, in this section we suppose that B is semisimple. 


Definition 16.3.1. The reduced norm nrd(/) of J is the R-submodule of F generated 
by the set {nrd(@) : a € J}. 
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Lemma 16.3.2. The reduced norm nrd(J) is a fractional ideal of F: i.e., it is finitely 
generated as an R-module. 


Proof. We first give a proof when B has a standard involution, and nrd is a quadratic 
form. Since J is an R-lattice we have JF = B; since nrd(B) + {0}, we have nrd(/) + {0}. 
And J is generated by finitely many a; as an R-module; the R-module nrd(J/) is then 
generated by the values aj; = nrd(q;) and a;; = nrd(q; + œj) — nrd(q;) — nrd(q;), since 
then 

nrd(X;ciai) = Di jaijCiCj & Dij Raij 
for all c; € R. 

Now for the general case. Replacing J by rI with r € R nonzero, we may suppose 
that J is integral, and hence nrd(J) C R. Since J is a lattice, there exists r € INR 
with r + 0. For all p such that ordy(r) = 0, we have 1 € Kp) so nrd(lg)) = Rip). 
For each of the finitely many primes p that remain, we choose an element œ € J such 
that ord,(nrd(q@)) is minimal; then nrd(q@) generates nrd(I(p)), and by the local-global 
dictionary, these finitely many elements generate nrd(/). o 


16.3.3. For a prime p of R we have nrd(/)(p) = nrd(/(p)), so by the local-global property 
of lattices (Lemma 9.4.6), 


nrd(I) = (| ard(Do) = (| nrag). (16.3.4) 
p 


p 


16.3.5. If / is a principal R-lattice generated by œ € I then nrd(/) = nrd(@)R; more 
generally, if J is an R-lattice and a € B* then nrd(a@J) = nrd(q) nrd(/) (Exercise 16.4). 


Now suppose that Z, J are lattices. Then nrd(/J) 2 nrd(/) nrd( J). However, we need 
not have equality, as the following example indicates. 


Example 16.3.6. It is not always true that nrd(7J) = nrd(/) nrd(J). For example, if 
‘ . ; _ {aR R _ (aR aR . 
a € Ris neither zero nor a unit, then J = k R J and J = R R ) are R-lattices 
in M2 (F) with nrd(7) = nrd(J) = aR but IJ = M2(R) and so nrd(JJ) = R. 
We have Or(J) = M2(R) = O_(/), so J is compatible with J, and nrd(JI) = a? R = 
nrd(J) nrd(/); but 


R a'R 


ORC) = & R 


) and ou =( r me 


a'R R 
so I is not compatible with J. 


The issue present in Example 16.3.6 is that the product is not as well-behaved 
for noncommutative rings as for commutative rings; we need the elements coming 
between J and J to match up. 


Lemma 16.3.7. Suppose that I is compatible with J and that either I or J is locally 
principal. Then nrd(IJ) = nrd(Z) nrd(J). 
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Proof. By the local-global property for norms (16.3.4) and since localization com- 
mutes with multiplication, i.e., 


(ab)(p) = Ap) Pep) for all (finitely generated) R-modules a, b C F, 


we may localize and suppose that either 7 or J is principal. Suppose J is (right) 
principal. Then J = œO for some a € B where O = Or(/) = OL(J). Then 


IJ =(a@O)J = a(OJ) = aJ 
and so nrd(JJ) = nrd(q@) nrd(J) = nrd(Z) nrd(J) by 16.3.5. The case where J is principal 


follows in the same way. o 


Principal lattices are characterized by reduced norms, as follows. 


Lemma 16.3.8. Let I be locally principal and let a € I. Then a@ generates I if and only 
if nrd(œ)R = nrd(J). 


Proof. If I = œO then nrd(/) = nrd(a@)R by Lemma 16.3.7. 

For the converse, let O = Org(/). We want to show that J = «O, and we know that 
I 2 aO. To prove that equality holds, it suffices to show this locally, so we may suppose 
that J = BO. Then a = By with u € O, and nrd(@) = nrd(Gy) = nrd(Z) nrd(w). By 
hypothesis, nrd(w) € R*, and thus u € O*, so BO = aO. o 


16.4 Algebra and absolute norm 


The reduced norm of an ideal is related to its algebra norm, as follows. We continue to 
suppose that B is semisimple, so the definitions of left and right norm coincide. 


Definition 16.4.1. The (algebra) norm Nmajr(J) of J is the R-submodule of F gen- 
erated by the set {Nmgjr(q@) : a € J}. 


Remark 16.4.2. The definition of algebra norm by necessity depends on the choice of 
domain R; indeed, J is an R-lattice. 


Proposition 16.4.3. The following are equivalent: 


(i) Z is locally principal; 
(ii) Nmgir (1) = [OL : Tr; and 
(iii) Nmgjr(/) = [Or(/): Zr. 


If B is simple with dimp B = n?, then these are further equivalent to 


(iv) nrd(J)” = [O_C) : T]R. 
(v) nrd(Z)" = [Or() : Tr. 
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Proof. Let O = O.(). Let œ € J. Right multiplication by «œ gives an R-module 
isomorphism O — Oa (change of basis between two free R-modules), so by Lemma 
9.6.4 we have [O : Oa] = det(a)R = Nmgjr(a)&, thinking of a € Endr (B). 

We now prove (i) © (ii). We may suppose R is local, so R is a DVR. For all a € J, 
the following equality holds: 


[O s Nr : Oa]r = [O s Oa]r = Nmgjr(@)R. (16.4.4) 


To show (i) > (ii), if J = Oa then Nmgjr (I) = Nmgjr(a)R and so by cancelling 
[Z : Oa]r = R in (16.4.4) we obtain (ii). To show (ii) > (i), suppose that Nmgjr(/) = 
[O : I]r. Let œ € I be such that Nmgjr(@) has minimal valuation; then Nmajr(a) 
generates Nmgjr(/). By (16.4.4), cancelling on both sides [J : Oa]r = R, and since 
Oa C I we conclude J = Oa. A similar argument holds on the right, proving (i) © 
(iii). Finally, (iii) = (iv) since Nmgjr(@) = nrd(@)”, and the same on the right. o 


16.4.5. Recalling the proof of Proposition 16.4.3 and the definition of R-index, we 
always have the containment 
Nmgjr(/) 2 [OL(D : Zr 


and the same on the right; by Proposition 16.4.3, equality is equivalent to J being 
locally principal. 


To conclude this section, we suppose for its remainder that F is a local with 
valuation ring R or a global number field. Then the reduced norm is also related to the 
absolute norm, an absolute measure of size, as follows. 


16.4.6. For a fractional ideal a of R, we define the absolute norm N(a) to be 
N(a) := [R : a]z < œ (16.4.7) 
the index taken as abelian groups, recalling Example 9.6.6. Then 
N(a) = |Nmr Q0) 


and if a C R then 
N(a) = #(R/a), 


so this norm is also called the counting norm. 
We extend this definition to elements a € F* by defining N(a) := N(aR). 


16.4.8. Similarly, if IZ C B is a locally principal R-lattice, we define the absolute norm 
of I to be 
NCD) := [OLM) : Mz = lOrW): Iz, (16.4.9) 


the latter equality by taking R = Z in Proposition 16.4.3. If I is integral then 
NZ) = #OL(D)/D = #ORW)/D. 
The absolute norm of J is compatible with the absolute norm on R via 


NC) = [Or() : Mz = [R : Nmgr()]z = NNmzgir (D); 
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and if B is simple with dimp B = n? then 
N(Z) = NWNmair()) = N(nrd(/))”. (16.4.10) 


Remark 16.4.11. The absolute norm may also be defined for a global function field, 
but there is no canonical ‘ring of integers’ as above. 


16.5 Invertible lattices 


We are now in a position to investigate the class of invertible lattices. Let Z C B be an 
R-lattice. 


Definition 16.5.1. / is invertible if there exists an R-lattice 7” C B that is a (two-sided) 
inverse to J, i.e. 


IT’ = OL(D) = Or’) and FI = OL’) = OrV). (16.5.2) 
In particular, both of the products in (16.5.2) are compatible. 


16.5.3. If I, J are invertible lattices and J is compatible with J, then ZJ is invertible 
(Exercise 16.10). 


16.5.4. If I is a principal lattice, then Z is invertible: if Z = Oa with a € B* and 
O = OL(I), then I’ = a~!O has 


IT = (Oa)(a7'0) = O(aa™')O = OO = O 
so I’ is a right inverse, and 
T'I = (a 'O)(Oa) = a7!Oa = Op() 
so I’ is also a left inverse. 


A candidate for the inverse presents itself quite naturally. If I” = O(I) and 
II = Or(), then IFI = J. 


Definition 16.5.5. We define the quasi-inverse of Z as 
I! :={@e€B:lal ch. (16.5.6) 
Lemma 16.5.7. The following statements hold. 
(a) The quasi-inverse I`! is an R-lattice and 
7 el, 
(b) If O is an R-order, then O`! = O. 


Proof. Statement (a) follows as in the proof of Lemma 10.2.7, and the inclusion is by 
the definition of /~'. For statement (b), if a € O, then OaO C O since O is an order; 
conversely, if O@O C O, then taking 1 € O on left and right we conclude œ € O. O 
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We now consider the quasi-inverse as an inverse. 
Proposition 16.5.8. The following are equivalent: 


(i) I"! is a (two-sided) inverse for I; 
(ii) ITH = OC) and I! = OL); 
(iii) Z is invertible; 
(iv) There is a compatible product II~'I = I and both 1 € IT" and 1 € T'I. 


Proof. The implication (i) > (ii) is clear. For (ii) > (i), we need to check the compat- 
ibility of the product: but since I7! = Og(J) we have OL(I7!) c Og(J), and from the 
other direction we have the other containment, so these are equal. 

The implication (i) > (iii) is clear. For (iii) = (i), suppose that 7’ is an inverse to 
I. Then I = ITI so I’ C I"! by definition. Therefore J c IJ~'T C I and equality holds 
throughout. Multiplying by 7’ on the left and right then gives 


rD! = (IDOI) =I =f. 


Again the implication (i) => (iv) is immediate. To prove (iv) => (ii), we need to 
show that ZI! = O,(J) and I!I = Op(J); we show the former. By compatibility, 
Ord!) = OL(D) = O. If II! = J then J = IIT! = OUT"')O = OJO, so J C Oisa 
two-sided ideal of O containing 1 hence J = O. o 


Invertibility is a local property, as one might expect. 
Lemma 16.5.9. J is invertible if and only Ig) is invertible for all primes p. 
Proof. We employ Proposition 16.5.8(iv): We have IJ~'I = I if and only if 


-1 -1 
UT D) = I Npp) = Lp) 


for all primes p and e.g. 1 € JI~! if and only if 1 € Toly o 
Corollary 16.5.10. Jf I is locally principal, then I is invertible. 
Proof. Combine 16.5.4 with Lemma 16.5.9. m 


A compatible product with an invertible lattice respects taking left (and right) 
orders, as follows. 


Lemma 16.5.11. Jf I is compatible with J and J is invertible, then OL (IJ) = OL (D). 


Proof. We always have OL (I) € O_(/J) (even without J invertible). To show the other 
containment, suppose that a € OL(IJ), so that @IJ © IJ. Multiplying by J7}, we 
conclude aJ C J anda € OL(D). (m 


Finally, not every lattice is invertible, and it is helpful to have counterexamples at 
hand (see also Exercise 16.12). 
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Example 16.5.12. Let p € Z be prime. Let B := (22) and 


O := Z 6 pZi © pZj ® Zij 
I:= p’Z@ZieZj eZij. 
Then O C B is an order and O,(/) = Or(/) = O. We compute that 


I! =pZeZieoZj eZij (16.5.13) 
and 
OLI!) = Og") = Z + Zi+ Zj + Zij =Z+ 50% (16.5.14) 
so in the product 
I! =I = pZo@ pZie pZj eZij CO (16.5.15) 


we see J is not invertible and the product is not compatible. 
Seen a different way, we have J = J and in the compatible product 


P =TI =11 = pZ pZi  pZj @Zij (16.5.16) 
we have i, j € O,(I*) = Og(/’) but i, j ¢ O; therefore, I is not invertible by Lemma 
16.5.11. Indeed, 

1 1 
OL) = Og”) = -P =Z + O. 
Pp P 


Finally, it will convenient to consider invertibility in the context of ideals, labelling 
left and right orders as follows. 


Definition 16.5.17. Let O, O’ C B be R-orders and let J be a fractional O, O’-ideal. 
We say Z is invertible if J is invertible as a lattice and 7 is sated (i.e., O = O_(/) and 
O’ = ORCI). 


16.5.18. The condition that / is sated in Definition 16.5.17 is important: we must be 
careful to work over left and right orders and not some smaller order. Indeed, if J is 
invertible as an R-lattice then it is invertible as a fractional O_(J), Or(/)-ideal, but not 
for any strictly smaller orders. If 7’ is an R-lattice and IJ’ = O for some O € O_(J), 
then multiplying on both sides on the left by OL(/) gives 


and the same on the right. In other words, if we are going to call out an invertible 
fractional ideal by labelling actions on left and right, then we require these labels to be 
the actual orders that make the inverse work. 


Remark 16.5.19. Equation 16.1.1 suggested the ‘real issue’ with noninvertible modules 
for quadratic orders: as an abelian group, 


f=fZ+ fvdZ = f - S(d), 
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so f is principal and hence certainly invertible as an ideal of S(d)—but not as an ideal 
of the smaller order S(d). More generally, if a c K = Qvd) is a lattice in K (free 
Z-module of rank 2), we define its multiplicator ring as 


S(a):= {xe K:xaCa}; 


the ring S(a) is an order of K and so is also called the order of a. In the example above, 
S(f) = Sf(Z+VdxZ)) = S(dx) 2 S(d). It turns out that every lattice in K is invertible 
as an ideal of its multiplicator ring [Cox89, Proposition 7.4], and this statement plays 
an important role in the theory of complex multiplication. (Sometimes, an ideal a C S 
is called proper or regular if S = S(a); both terms are overloaded in mathematics, so 
we will mostly resist this notion.) 

Unfortunately, unlike the quadratic case, not every lattice J C B is projective as 
a left module over its left order (or the same on the right): this is necessary, but not 
sufficient. In Chapter 24, we classify orders O with the property that every lattice J 
having O_(/) = O is projective as an O-module: they are the Gorenstein orders. 


Remark 16.5.20. Invertible lattices give rise to a Morita equivalence between their 
corresponding left and right orders: see Remark 7.2.20. 


16.6 Invertibility with a standard involution 


In section 16.6, we follow Kaplansky [Kap69], considering invertibility in the presence 
of a standard involution. The main result of this chapter is as follows. 


Main Theorem 16.6.1. Let R be a Dedekind domain with field of fractions F, and let 
B be a finite-dimensional F -algebra with a standard involution. Then an R-lattice I is 
invertible if and only if I is locally principal. 


Remark 16.6.2. We can relax the hypothesis that R is a Dedekind domain and 
instead work with a Priifer domain, a generalization of Dedekind domains to the 
non-noetherian context. 


We have already seen (Corollary 16.5.10) that the implication (=) in Main Theorem 
16.6.1 holds without the hypothesis of a standard involution; the reverse implication 
is the topic of this section. This implication is not in general true if this hypothesis is 
removed (but is true again when B is commutative); see Exercise 16.18(a). 


Remark 16.6.3. The provenance of the hypothesis that R is a Dedekind domain is the 
following: if a C R is not invertible as an R-module, and O C B is an R-order, then 
aO is not invertible as an R-lattice. To make the simplest kind of arguments here, we 
would like for all (nonzero) ideals a C R to be invertible, and this is equivalent to the 
requirement that R is a Dedekind domain (see section 9.2). 


Throughout this section, let R be a Dedekind domain with field of fractions F, 
let B be a finite-dimensional F-algebra, and let J C B be an R-lattice. The following 
concept will be useful in this section. 


Definition 16.6.4. We say J is a semi-order if 1 € J and nrd(/) € R. 
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(For a semi-order J, we necessarily have nrd(J) = R since 1 € Z.) 


Lemma 16.6.5. An R-lattice I is a semi-order if and only if 1 € I and every a € I is 
integral over R. 


Proof. We have that a € I is integral over R if and only if trd(a@) € R and nrd(@) € R 
(by Corollary 10.3.6, since R is integrally closed) if and only if nrd(a@) € R and 
nrd(@ + 1) = nrd(a@) + trd(a) + 1 € R. o 


In particular, Lemma 16.6.5 implies that an order is a semi-order (by Corollary 
10.3.3); we will see that semi-orders behave enough like orders that we can deduce 
local principality from their structure. 


16.6.6. Let T := {Œ : æ € J}. Then T is an R-lattice in B. If I, J are R-lattices then 
IJ = J I (even if this product is not compatible). 

If I is a semi-order, then J = J (Exercise 16.15). In particular, if O is an R-order 
then O = O. 


Lemma 16.6.7. We have OL(I) = Og(1) and Og(1) = OL (D). 


Proof. We have «œ € O,(/) if and only if œI C I if and only if af = Tœ C T if and only 
if @ € O(/) if and only if a € ORT) = ORD). o 


Corollary 16.6.8. [f I is a semi-order, then OL (I) = Or). 
Proof. Apply Lemma 16.6.7 with T = I. o 


By Lemma 16.6.7, the standard involution gives a bijection between the set of 
lattices J with OL (7) = O and the set of lattices with Or(J) = O. 


16.6.9. Suppose that R is a DVR (e.g., a localization of R at a prime ideal p). We will 
show how to reduce the proof of Main Theorem 16.6.1 to that of a semi-order. 

Since R is a DVR, the fractional R-ideal nrd(7) C R is principal, generated by an 
element with minimal valuation: let œ € J achieve this minimum reduced norm. Then 
the R-lattice J = a~'J now satisfies 1 € J and nrd(J) = R. Thus J is a semi-order, and 
J is (locally) principal if and only if Z is (locally) principal. 


16.6.1 
Proof of Main Theorem. The proof is due to Kaplansky [Kap69, Theorem 2]. The 


statement is local; localizing, we may suppose R is a DVR. By 16.6.9, we reduce to the 
case where / is a semi-order. In particular, we have | € Z. Let aj, . . ., Œn be an R-basis 
for I. 
We claim that 
P+ = (16.6.10) 


Since 1 € Z, we have J” C J"*!. It suffices then to prove that a product of n + 1 basis 
elements of J lies in Z”. By the pigeonhole principle, there must be a repeated term a; 
among them. We recall the formula (4.2.16) 


aB + Ba = trd(B)a + trd(a)B — trd(@B) (16.6.11) 
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for all a, 8 € B. We can use this relation to “push” the second instance of the repeated 
element until it meets with its mate, at the expense of terms lying in J”. More precisely, 
in the R-module /”/T, by (16.6.11), 
aja; =—a;a; (mod T) 
for all i, j; it follows that in J"*!/7", 
H(a;a;)v = —u(aj;a;)v (mod I”) 

for all u, v appropriate products of basis elements. Therefore we may suppose that 
the repetition a? is adjacent; but then a; satisfies a quadratic equation and a? = 
trd(a;)a; — nrd(a;) € I, so in fact the product belongs to 7”, and the claim follows. 

Now suppose J is invertible; we wish to show that J is principal. From the equality 
I"*! = J", we multiply both sides of this equation by (J~!)” and obtain J = O = 
OL(I) = Or(/). In particular, / is principal, generated by 1. o 


The above proof has the following immediate corollary. 


Corollary 16.6.12. An R-lattice I is an R-order if and only if 1 € I, every element of 
I is integral, and I is invertible. In particular, an invertible semi-order is an order. 


We conclude with two consequences. 


16.6.13. Let J, J be invertible R-lattices such that J is compatible with J. Then 
nrd(IJ) = nrd(/) nrd(J), since it is enough to check this locally, and locally both 7 
and J are principal and we have proved the statement in this case (Lemma 16.3.7). 


16.6.14. In the presence of a standard involution, we can write the inverse in another 
way: if J is invertible, then 
TI =nrd(1)Og(1) and TI = nrd(1)O,(/) 


by checking these statements locally (where they follow immediately by computing 
the norm on a local generator). Since nrd(J) is a fractional R-ideal and thus invertible 
(R is a Dedekind domain), it follows that if J is invertible, then 


I! = Inra}. 
In view of 16.6.14, the following important proposition is natural. 


Proposition 16.6.15. Let B be a quaternion algebra over F and let I C B be an 
R-lattice. Then the following statements hold. 


(a) We have II = nrd(I)O, where O C B an R-order satisfying OL(I) © O, and 
similarly TI = nrd(1)O’ with Og(1) € O’. 

(b) If either O (I) or Og() is maximal, then I is invertible, and both O, (I) and 
Or() are maximal. 
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Proof. We follow Kaplansky [Kap69, Theorems 6-7]. We again may suppose R is a 
DVR and 7 is a semi-order, so 1 € J and nrd(/) = R; and T = 7. 

First we prove (a). We need to show that 7 ? is an order. We showed in (16.6.10) 
(without extra hypothesis) that /* = 74; with B a quaternion algebra, we will improve 
this to J? = I, whence (J”)? = I* = I° and consequently 7° is closed under multipli- 
cation and hence an R-order. 

Let J = I’; then J? = (PY = I6 = P = J, so J is an R-order. 

Let p be the maximal ideal of R and consider the 4-dimensional algebra J/pJ 
over k = R/p. Then I/pI © J/pJ is a k-subspace containing 1. If (1/pl)* = I/pI, 
then by dimensions we contradict (I/pl)° = J/pJ; therefore (I/pl)* 2 I/pl. If 
dim;,(//pl) < 2, then I/p/ is a proper k-subalgebra, impossible. Thus dim, (//pJ) 2 3 
and dim,(//pl)” > 4, and so (I/pl)* = J/pJ. By Nakayama’s lemma, it follows that 
P = J = P. The containments follow directly, e.g. O.(J) € OLUT = OL(O) = O. 

For part (b), applying part (a) we have O = O_(/) by maximality and the same on 
the right. But since J is a semi-order, from Corollary 16.6.8 we have IT = O,(1) = 
OR(D) = IT so by definition, 7 has inverse T. o 


16.7 One-sided invertibility 


In this section, we pause to consider one-sided notions of invertibility. We refresh 
our notation, recalling that R is a Dedekind domain with F = Frac R and B is a 
finite-dimensional algebra over F with J C B an R-lattice. 


Definition 16.7.1. / is right invertible if there exists an R-lattice /’ C B, a right 
inverse, such that the product //’ is compatible and JJ’ = O,(J/). 

A right fractional O-ideal / is right invertible if / is right invertible and sated (viz. 
16.5.18). 


We similarly define left invertible and left inverse. Applying the same reasoning 
as in Lemma 16.5.9, we see that one-sided invertibility is a local property. 


Remark 16.7.2. For rings, the (left or) right inverse of an element need not be unique 
even though a two-sided inverse is necessarily unique. Similarly, left invertibility does 
not imply right invertibility for lattices in general, and so the one-sided notions can be 
a bit slippery: see Exercise 16.18(b). 


Remark 16.7.3. The compatibility condition in invertibility is important to avoid 
trivialities. Consider Example 16.3.6: we have IJ = M2(R) = O_(J), and if we let 
J= 6 a for any nonzero b € R, the equality IJ = M»(R) remains true. Not 


every author requires compatibility in the definition of (sided) invertibility. 


A natural candidate for the right inverse presents itself: if I” = O(I), then J’ 
maps J into O_(/) on the right. We recall the definition of the colon lattices (Definition 
15.6.11). Let 1’ := (OC) : Dp. Then II’ C O_() by definition; however, in general 
equality need not hold and the product need not be compatible. Similarly, since ZI! C 
I we have IJ“! C O,(J), but again equality need not hold. 

The sided version of Proposition 16.5.8 also holds. 
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Proposition 16.7.4. The following are equivalent: 


(i) I~! is a right inverse for I; 
(ii) I is right invertible; 
(iii) There is a compatible product III = I and 1 € IT}. 


Similar equivalences hold on the left. 


Proof. This is just a sided restriction of the proof of Proposition 16.5.8. For example, 
to show (ii) > (i), we always have IIIc Iso ID COM; if I’ isa right inverse to 
I, then II’J = OL(DI = I and I’ c I~}, and therefore II! D IJ’ = OL(I). Therefore 
II! = O(I) and I! isa right inverse for I. o 


Returning to the setting of the previous section, however, we can show that the 
one-sided notions of invertibility are equivalent to the two-sided notion. 


Lemma 16.7.5. Suppose B has a standard involution. Then an R-lattice I is left invert- 
ible if and only if I is right invertible if and only if I is invertible. 


Proof. We will show that if Z is right invertible then Z is left invertible; the other 
implications follow similarly. By localizing, we reduce to the case where R isa DVR. By 
the results of 16.6.9, we may suppose that 7 is a semi-order, so that OL (D) = Og) = O 
and J = T. Suppose IJ’ = O. Then T'I = O = O, and I’ is compatible with J since 


O = Or) = OLU’) = ORU’) 


as desired. oO 


Corollary 16.7.6. Suppose R is a Dedekind domain and that B has a standard 
involution. Then an R-lattice I is right invertible with II’ = O (I) if and only if 
I = (0D) : De = D !. 


A similar statement holds for the left inverse; in particular, this shows that a right 
inverse is necessarily unique. 


Proof. The implication (=) is immediate, so we prove (4). Let O = O_(/). Then 
O= CKO: DRrCcO 


so equality must hold, and ZI’ = 1(O : Dr. By 16.7.5, I is invertible, and multiplying 
both sides by J! gives I’ = (O : Dp. Oo 


We collect the results of this section in the following theorem. 


Main Theorem 16.7.7. Let R be a Dedekind domain with F = Frac R, let B be a 
quaternion algebra over F, and let I C B be an R-lattice. Then the following are 
equivalent: 


(i) Z is locally principal; 
(ii) I is invertible; 
(iii) Z is left invertible; 
Gii’) I is right invertible; 
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(iv) nrd(1)? = [O.() : Ir; and 
(iv’) nrd(D? = [Og() : Tr. 
Proof. Main Theorem 16.6.1 proves (i) © (ii). For the equivalence (ii) © (iii) © 


Gii’), apply Lemma 16.7.5. Finally, the equivalence (i) © (iv) © (iv’) is supplied by 
Proposition 16.4.3. o 


16.8 Invertibility and the codifferent 
To conclude this chapter, we pick up a remaining thread concerning the (co)different. 


Definition 16.8.1. We define the different of O to be the quasi-inverse of the codiffer- 
ent: 

diff(O) := codiff(O)"! = {a € B : Ofa0# c OF}. 
Lemma 16.8.2. The different diff(O) is an integral two-sided O-ideal. 
Proof. By Lemma 15.6.16, we have OO#O = OË and so if a € diff(O) then 
O#(Oa@0)O# = OfaO# = OF and diff(O) is a two-sided O-ideal. To prove that 
diff(O) c O, referring to Lemma 15.6.2, starting with OfaO# c OË taking 1 € OF 


we have aO# c OË so (aO#)# = (Of Fag! 2 (OME. By Lemma 15.6.5, we have 
(O#)* = O, so Oa~! 2 O, so Oa C O and again taking 1 we get a € O. o 


16.8.3. If codiff(O) is locally principal (section 16.2), then so is diff(O), and by 
Proposition 16.4.3 we have 


Nmap) (diff(O)) = [O : diff(O)]r = [codiff(O) : O]r = disc(O); 
so when further B is a quaternion algebra, we have 
nrd(diff(O)) = discrd(O). (16.8.4) 
Invertibility of ideals is detected by the (co)different [Fad65, Proposition 24.1]. 


Proposition 16.8.5. Zf codiff(O) is right invertible, then all sated left fractional O- 
ideals are right invertible. Similarly, if codiff(O) is left invertible, then all sated right 
fractional O-ideals are left invertible. 


Proof. To get started, we refresh a few things: by Corollary 15.6.13, we have (ZI hi = 
OL(I) = O. The product IY is compatible by Proposition 15.6.6. By Lemma 15.6.5 
we have IJ = OË = codiff(O). 

Now by hypothesis of invertibility, O#(O*)-! = O,(O#) = O is a compatible 
product. Therefore the product 7 HOR! is compatible, and 


1(1#(08)-!) = aIo = OF (OF)! = O. (16.8.6) 
A similar argument holds on the right. o 


We have the following corollary of Proposition 16.8.5, phrased in terms of the 
different. 
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Corollary 16.8.7. Suppose that B has a standard involution. Then the following are 
equivalent: 


(i) codiff(O) is invertible; 
(ii) diff(O) is invertible; 
(iii) All sated left fractional O-ideals I are invertible, with inverse r! =]} diff(O); 
and 
Gii’) All sated right fractional O-ideals I are invertible, with inverse I~! = diff(O)Ë. 


Proof. Combine Proposition 16.8.5 and (16.8.6) with Lemma 16.7.5 and Corollary 
16.7.6. o 


We conclude with a criterion to determine invertibility; it is not used in the sequel. 


Proposition 16.8.8. (Brandt’s invertibility criterion). Let I C B be an R-lattice. Then 
I is invertible if and only if 


nrd(7#) discrd(7) € nrd(J). 
Proof. See Kaplansky [Kap69, Theorem 10] or Brzezinski [Brz82, Theorem 3.4]. o 


Exercises 


Unless otherwise specified, throughout these exercises let R be a Dedekind domain 
with field of fractions F, let B be a finite-dimensional F-algebra, and let J C B be an 
R-lattice. 


1. Let d € Z be a nonsquare discriminant, and let S(d) = Z[(d + Vd)/2] be the 
quadratic ring of discriminant d. 
(a) Suppose that d = dx f? with f > 1. Show that the ideal (f, Vd) of S(d) is 
not invertible. 
(b) Consider d = —12, and S = S(-12) = Z[-V—3]. Show that every invertible 
ideal of S is principal (so S has class number 1), but that S is not a PID. 


>2. Show that if J = OL (I)a with a € B*, then OR(I) = a !OL(Da. 

>3. Show that if J is an R-lattice in B and u € B*, then uJ = J if and only if 
H E OLJ). 

>4. Show that if a € B then nrd(a/) = nrd(q@) nrd(/). Conclude that if I is a principal 
R-lattice, generated by a € Z, then nrd(/) = nrd(q@)R. 


5. Let a1,...,@, generate J as an R-module. Give an explicit example where nrd(/) 
is not generated by nrd(a;) (cf. Lemma 16.3.2). Moreover, show that for an R- 
lattice J, there exists a set of R-module generators œ; such that nrd(/) is in fact 
generated by nrd(q;). 


>6. Suppose that R is a Dedekind domain, and let O C B be an R-order. Let I be 
a locally principal right fractional O-ideal. Show that J can be generated as a 
right O-ideal by two elements, and in fact for a € nrd(/) nonzero we can write 
I = aO + BO with £ € B*. 
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>10. 


11. 


12. 


>13. 


14. 


>15. 


16. 


. Let F be a number field, let R C F be a (Z-)order, and let a C R be a nonzero 


ideal. Show that a is projective as an R-module if and only if a is invertible if 
and only if a is locally principal. [These are all automatic when R is a Dedekind 
domain 9.4.5.] 


. If I,J C B are R-lattices with J C J, is it true that Z7! 2 J71? 
. Let Z, J, K C B be R-lattices. Show that 


(2: Ji: Kx =(M:K)R: J). 


Let I, J C B be R-lattices and suppose that J is compatible with J. Show that ZJ 
is invertible (with (ZJ)! = J~'I~!) if both J, J are invertible, but the converse 
need not hold. 


Let I, J C B be R-lattices, and suppose that J is invertible. Show that (7: J). = 
IJ! and (I : Dg = J. 
Let p be prime, let B = (p, p | Q), and let O := Ziti, j) = Z® Zi © Zj @® Zij. 

(a) Let I = {a € O : p | nrd(qa)}. Show that J = pZ ẹ Zi ® Zj ® Zij. 


(b) Show 7 = Oi + Oj, that O is a two-sided O-ideal, and that [O : 7] = p. 
(c) Show that Ip) # Opa for all œ € I). [Hint: show that if a € I, then 


p° | [O : Oa].] 
(d) Compute that O (D = Z + Zi + Zj + Z(ij/p) 2 O, and that J = O, (Di = 
OLD). 


(e) Compute codiff(O) and diff(O) and show they are invertible. 
[Compare Lemurell [Lem2011, Remark 6.4].] 


Let K be a separable quadratic field extension of F and let Z C K be an R-lattice. 
Let O = OL) = Or(U). 


(a) Show that IT=11= nrd(1)O. [Hint: argue as in Proposition 16.6.15.] 
(b) Conclude that J is invertible as a O-module. 


Show that if 7, J C B are locally principal (hence invertible) R-lattices, then 
U: J]r = [J! : TYR. 


Let B be an F-algebra with a standard involution ~. Show that if / is a semi-order 
then J = I. 


Let R be a Dedekind domain with field of fractions F, let K > F be a separable 
quadratic field extension and let S be an R-order in K. Let Sx be the integral 
closure of R in K. 


(a) Show that there exists a (unique) ideal f = f(S) c Sx (called the conductor) 
such that S = R + fSx. 

(b) Now let b c K be a fractional S-ideal. Show that the following are equiva- 
lent: 


(i) 6 is a locally principal S-ideal; 
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(ii) b is invertible as a fractional S-ideal, i.e., there exists a fractional ideal 
b7! such that 667! = S (necessarily bT! = (S : 6)); 

(iii) There exists d € K* such that db + fN S = S; and 

(iv) b is proper, i.e., S = O(b) = {x € K : xb C b}. 


>17. Let O C B be an R-order. 


(a) Let a € B*. Show that J = Oa is a lattice with OL (I) = Or(/) = O if and 
only if a € B* and Oa@ = œO. Conclude that the set of invertible two-sided 
principal lattices J with OL (Z) = OrRr(I) = O forms a group. 


(b) Show that the normalizer of O, 
Npgx(O) = {a € B* : «Oa™! = O} 
is the group generated by a € B* such that Oa is a two-sided O-ideal. 


18. The following example is due to Kaplansky [Kap69, pp. 220, 221]. Let R be a 
DVR with field of fractions F and maximal ideal p = xR. 


(a) Consider the R-lattice 


mR nR R 
T=|7R nR R|CB=M3(F) 
R R R 


Show that J is invertible but is not principal. 
(b) Consider the R-lattice 


mR nR R 
T=|7?R wR R|CB=M3(F) 
R R R 


Show that J is left invertible but is not right invertible. 
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Chapter 17 


Classes of quaternion ideals 


Having investigated the structure of lattices and ideals in Chapter 16, we now turn to 
the study of their isomorphism classes. 


17.1 > Ideal classes 


For motivation, let K be a quadratic number field and S C K an order. We say that 
two invertible fractional ideals a,b c K of S are in the same class, and write a ~ b, 
if there exists c € K* such that ca = b; we denote the class of a fractional ideal a as 
[a]. We have a ~ b if and only if a and b are isomorphic as S-modules. The set Cl S of 
invertible fractional ideals is a group under multiplication, measuring the failure of S 
to be a PID. The class group CI S is a finite abelian group, by Minkowski’s geometry of 
numbers: every class in Cl S is represented by an integral ideal a C S whose absolute 
norm is bounded (depending on S, but independent of the class), and there are only 
finitely many such ideals. For an introduction to orders in quadratic fields and their 
class numbers, with further connections to quadratic forms, see Cox [Cox89, §7]. 

The first treatment of isomorphism classes of quaternion ideals was given by Brandt 
[Bra28]. Let B be a quaternion algebra over Q. In the consideration of classes of lattices 
I C B, we make a choice and consider lattices as right modules—considerations on 
the left are analogous, with the map J +> T allowing passage between left and right. 
We say that lattices J, J C B are in the same right class, and write J ~p J, if there 
exists a € B* such that a/J = J; equivalently, J ~r J if and only if J is isomorphic to 
J as right modules over Op(/) = Op(J). The relation ~g is evidently an equivalence 
relation, and the class of a lattice J is denoted [/]p. 

Let O C B be an order. We define the right class set of O as 


Clsr O := {[I]r : Z C B invertible and Or(/) = O} 


equivalently, Clsg O is the set of isomorphism classes of invertible right O-modules 
in B. The standard involution induces a bijection between Clsg O and the analogously 
defined left class set Cls_ O; working on the right from now on, we will often abbreviate 
Cls O := Clsp O. 
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Unfortunately, the class set Cls O does not have the structure of a group: only 
a pointed set, with distinguished element [O]r. One problem is the compatibility 
of multiplication discussed in the previous chapter. But even if we allowed products 
between incompatible lattices, the product need not be well-defined: the lattices ZJ and 
TaJ for a € B* need not be in the same class, because of the failure of commutativity. 
(This is the reason we write ‘Cls’ instead of ‘Cl’, as areminder that it is only a class set.) 
In Chapter 19, we will describe the structure that arises naturally instead: a partially 
defined product on classes of lattices, a groupoid. 

In any case, using the same method of proof (geometry of numbers) as in the 
commutative case, we will show that there exists a constant C (depending on O) such 
that every class in Cls O is represented by an integral ideal J C O with NU) = #(O/D < 
C. As a consequence, we have the following fundamental theorem. 


Theorem 17.1.1. Let B be a quaternion algebra over Q and let O C B be an order. 
Then the right class set Cls O is finite. 


Accordingly, we call #Cls O € Zs, the (right) class number of O. 

Right class sets pass between orders as follows. Let O,O’ C B be orders. If 
O = O’ are isomorphic as rings, then of course this isomorphism induces a bijection 
ClsO — ClsO’. In fact, O =~ O’ if and only if there exists œ € B* such that 
O’ =a '!Oa by the Skolem—Noether theorem; for historical reasons, we say that O, O’ 
are of the same type. 

Note that J = Oa = aO’ has OL L(I) = O and Or) = O (recalling 10.2.5). 
With this in mind, more generally, we say that O’ is connected to O if there exists 
an invertible lattice J with OL (J) = O and Or(J) = O’, called a connecting ideal. 
Because invertible lattices are locally principal, two orders are connected if and only 
if they are locally of the same type (i.e., locally isomorphic). If O’ is connected to 
O, then right multiplying by a O, O’-connecting ideal J yields a bijection 


Cls O & Cls O’ 


17.1.2 
Wr HJ]r í ) 


We define the genus of an order O C B to be the set Gen O of orders in B locally 
isomorphic to O, and the type set Typ O of O to be the set of R-isomorphism classes 
of orders in the genus of O. The map 


Cls O — TypO 


[rb class of OL) (17.1.3) 


is a surjective map of sets, so the type set is finite: in other words, up to isomorphism, 
there are only finitely many types of orders in the genus of O. All maximal orders in B 
are in the same genus, so in particular there are only finitely many conjugacy classes 
of maximal orders in B. In this way, the right class set of O also organizes the types of 
orders arising from O. 

The most basic question about the class number is of course its size (as a function 
of O). In the case of quadratic fields, the behavior of the class group depends in a 
significant way on whether the field is imaginary or real: for negative discriminant 
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d < 0, the Brauer-Siegel theorem provides that #Cl S is approximately of size yjd]; 
in contrast, for positive discriminant d > 0, one typically sees a small class group and 
a correspondingly large fundamental unit, but this statement is notoriously difficult to 
establish unconditionally. 

The same dichotomy is at play in the case of quaternion algebras, and to state the 
cleanest results we suppose that O is a maximal order. Let D := disc B = discrd(O) 
be the discriminant of B. If B is definite, which is to say oo € Ram B, then B is like 
an imaginary quadratic field K: the norm is positive definite. In this case, # Cls O is 
approximately of size D, a consequence of the Eichler mass formula, the subject of 
Chapter 25. On the other hand, if B is indefinite, akin to a real quadratic field, then 
#Cls O = 1, this time a consequence of strong approximation, the subject of Chapter 
28. Just as in the commutative case, estimates on the size of the class number use 
analytic methods and so must wait until we have developed the required tools. 


17.2 Matrix ring 


To begin, we first consider classes of ideals for the matrix ring; here, we can use 
methods from linear algebra before we turn to more general methods in the rest of the 
chapter. 


17.2.1. Let R be a PID with field of fractions F, and let B = M,(F). By Corollary 
10.5.5, every maximal order of B = M,,(F) is conjugate to M,(R). Moreover, every 
two-sided ideal of M,,(R) is principal, generated by an element a € F* (multiplying 
a candidate ideal by matrix units, as in Exercise 7.5(b)), so the group of fractional 
two-sided M,,(R)-ideals is canonically identified with the group of fractional R-ideals, 
itself isomorphic to the free abelian group on the (principal) nonzero prime ideals of R. 


Just as in the two-sided case, the right class set for M,,(R) is trivial. 


Proposition 17.2.2. Let R be a PID with field of fractions F, and let B = M,(F). Let 
I C B be an R-lattice with either O (I) or Og(I) maximal. Then I is principal, and 
both O(I) and Or() are maximal. 


Proof. We may suppose Z is integral by rescaling by r € R. Replacing J by the 
transpose I‘ = {at : œ € J} interchanging left and right orders (Exercise 10.12) if 
necessary, we may suppose that O(I) is maximal. Then, by Corollary 10.5.5, we 
have O.(/) = a~'!M,(R)a with œ € B*, so replacing J by a~'I we may suppose 
OL(I) = M, (R). 

Now we follow Newman [New72, Theorem II.5]. Let aj,...,@, be R-module 
generators for J. Consider the nm x n matrix A = (a1, . . ., @m)*. By row reduction over 
R (Hermite normal form, proven as part of the structure theorem for finitely generated 
modules over a PID), there exists Q € GL„m(R) such that QA = (£, 0)' and 8 € M,,(R). 
We will show that J = M,,(R)G. Let v11, ...,Yım E Mn(R) be the block matrices in 
the top n rows of Q. Then B = via +--+: + Vim@m so B E I and M,(R)B C I. 
Conversely, let 411, ...,Hm1 E Mn(R) be the block matrices in the left n columns 
of Q7! € GLjm(R). Since O7'(B, 0} = A, we have p;8 = a; so a; € M, (R) for 
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i = 1,...,m, thus Z C M, (R)£. Therefore J = M,,(R)£, and so Or(/) is maximal 
(16.2.3). o 


Returning to the case of quaternion algebras, we have the following corollary of 
Proposition 17.2.2. 


Corollary 17.2.3. Let R be a Dedekind domain and let B be a quaternion algebra over 
F = Frac R. Let I C B be an R-lattice with either O, (1) or Og() maximal. Then I is 
locally principal and both O, (1) and OR(I) are maximal. 


Proof. For each prime p of R, we have that Ry is a DVR and one of two possibilities: 
either By ~ M2(Fp), in which case we can apply Lemma 17.2.2 to conclude Jy is 
principal, or By is a division algebra, and we instead apply 13.3.10 to conclude that J, 
is principal. o 


17.3 Classes of lattices 


For the rest of this chapter, let R be a Dedekind domain with field of fractions F = 
Frac R, and let B be a simple F-algebra. 


Definition 17.3.1. Let 7, J C B be R-lattices. We say I, J are in the same right class, 
and we write I ~p J, if there exists œ € B* such that a/ = J. 


17.3.2. Throughout, we work on the right; analogous definitions can be made on the 
left. When B has a standard involution, the map J > Z interchanges left and right. 


Lemma 17.3.3. Let I, J C B be R-lattices. Then the following are equivalent: 


(i) I ~rJ; 
(ii) I is isomorphic to J as a right module over Og() = Or(J); and 
Gii) (J : D. is a principal R-lattice. 


Proof. For (i) > (ii). If I ~g J then J = al with a € B*, so OR(J) = OR(I) and the 
map left-multiplication by @ gives a right O-module isomorphism J — J. Conversely, 
for (i) = (ii), suppose that ¢: I — J is an isomorphism of right O-modules. Then 
or : 1@r F = B > J ®p F = B is an automorphism of B as a right B-module. 
Then as in Example 7.2.14, such an isomorphism is obtained by left multiplication by 
a € BX, so by restriction ¢ is given by this map as well. 

Next, for (i) > (iii), suppose aJ = J with a € B*. Then 


(J: DL. ={BeB: BI CJ =al} = «0D 
is principal. The converse follows similarly. o 


The relation ~g defines an equivalence relation on the set of R-lattices in B, and 
the equivalence class of an R-lattice J is denoted [J/]p. If Z is an invertible R-lattice, 
then every lattice in the class [/]p is invertible and we call the class invertible. 

In view of Lemma 17.3.3(b), we organize classes of lattices by their right orders. 
Let O C B be an R-order. 
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Definition 17.3.4. The (right) class set of O is 
Clsr O := {[Z]r : Z an invertible right fractional O -ideal}. 


In view of 17.3.2, we will soon abbreviate Cls O := Clsp O and drop the subscript 
r from the classes, when no confusion can result. 


Remark 17.3.5. The notation Cl O is also used for the class set, but it sometimes means 
instead the stably free class group or some other variant. We use “Cls” to emphasize 
that we are working with a class set. 


17.3.6. The set Clsp O has a distinguished element [O]r € Clsr O, so it has the 
structure of a pointed set (a set equipped with a distinguished element of the set). 
However, in general it does not have the structure of a group under multiplication: for 
classes [I]r, [J]r, we have [æJ]r = [J]r for a € B* but we need not have [IaJ]z = 
[IJ ]p, because of the lack of commutativity. 


17.3.7. An argument similar to the one in Proposition 17.2.2, either arguing locally or 
with pseudobases (9.3.7), yields the following [CR81, (4.13)]. 

Let R be a Dedekind domain with F = Frac R, and let J C B be an R-lattice with 
OL(D = M,,(R). Then there exists 8 € GL,(F) and fractional ideals a),--- ,a, such 
that 

IT=M,(R) diag(a1,...,an)6 (17.3.8) 


where diag(aj,...,d,) is the R-module of diagonal matrices with entries in the given 
fractional ideal. The representation (17.3.8) is called the Hermite normal form of 
the R-module /, because it generalizes the Hermite normal form over a PID (allowing 
coefficient ideals). 

By 9.3.10, the Steinitz class [a ---a,] € Cl Ris uniquely defined. Switching to the 


right, this yields a bijection 
CIR æ Clsr(Mn (R 
rRMn(R)) (17.3.9) 
[a] > [diag(a, 1, .. ., 1) Ma (R)]R 


17.4 Types of orders 


Next, we consider isomorphism classes of orders. Let O, O’ C B be R-orders. 


Definition 17.4.1. We say O, O’ are of the same type if there exists a € B* such that 
Ol =a On, 


Lemma 17.4.2. The R-orders O, O' are of the same type if and only if they are iso- 
morphic as R-algebras. 


Proof. If O,O’ are of the same type, then they are isomorphic (under conjugation). 
Conversely, if 6: O — O’ is an isomorphism of R-algebras, then extending scalars 
to F we obtain df: OF = B — B = O’F an F-algebra automorphism of B. By 
the theorem of Skolem—Noether (Corollary 7.7.4), such an automorphism is given by 
conjugation by a € B*, so O, O’ are of the same type. o 
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17.4.3. If O, O’ are of the same type, then an isomorphism O æ> O’ induces a bijection 
Cls O = Cls O’ of pointed sets. By Lemma 17.4.2, such an isomorphism is provided 
by conjugation O’ = a~'Oa for some a € B*. The principal lattice I = Oa = a0’ 
has O_L(/) = O and Or(/) = O’. 


Generalizing 17.4.3, the class sets of two orders are in bijection if they are con- 
nected, in the following sense. 


Definition 17.4.4. O is connected to O’ if there exists a locally principal fractional 
O, O’-ideal J € B, called a connecting ideal. 


The relation of being connected is an equivalence relation on the set of R-orders. 
If two R-orders O, O’ are of the same type, then they are connected by a principal 
connecting ideal (17.4.3). 


Definition 17.4.5. We say that O, O’ are locally of the same type or locally isomor- 
phic if O, and O, are of the same type (i.e., Op ~ Op) for all primes p of R. 


Lemma 17.4.6. The R-orders O,O’ are connected if and only if O,O' are locally 
isomorphic. 


Proof. Let J be a connecting ideal, a locally principal fractional O, O’-ideal. Then for 
all primes p of R we have Jp = Opay with œp € By, and consequently O, = Or(Up) = 
a, 'Oyap. Therefore O is locally isomorphic to O’. 

Conversely, if O, O’ are locally isomorphic, then for all primes p of R we have 
O; = ap Opa with ay € Bp. Since R is a Dedekind domain, O% = Oy for all but 
finitely many primes p, so we may take ay € Op = O; for all but finitely many primes 
p. Therefore, there exists an R-lattice J with Jp = Oyay by the local-global principle 
for lattices, and / is a locally principal fractional O, O’-ideal. o 


Lemma 17.4.7. If O,O’ C B are maximal R-orders, then OO’ is a O, O' -connecting 
ideal. 


The product in Lemma 17.4.7 is not necessarily compatible. 


Proof. Since O, O’ are R-lattices, their product J := OO’ is an R-lattice. We visibly 
have O € O_(/) and the same on the right; but O, O’ are maximal, so equality holds 
and / is a fractional O, O’-ideal. Finally, Z is invertible by Proposition 16.6.15(b), hence 
locally principal by Main Theorem 16.6.1. oO 


In analogy with the class set, we make the following definitions. 


Definition 17.4.8. Let O C B be an R-order. The genus Gen O of O is the set of R- 
orders in B locally isomorphic to O. The type set Typ O of O is the set of isomorphism 
classes of orders in the genus of O. 


17.4.9. The orders in a genus have a common reduced discriminant, since the discrim- 
inant can be defined locally and is well-defined on (local) isomorphism classes, by 
Corollary 15.2.9. 
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17.4.10. Recalling section 15.5, there is a unique genus of maximal R-orders in a 
quaternion algebra B—that is to say, every two maximal orders are locally isomorphic— 
and this genus has a well-defined reduced discriminant equal to discr B. 


The importance of connected orders is attested to by the following result. 


Lemma 17.4.11. Let O, O’ be connected R-orders, and let J be a connecting O, O'- 
ideal. Then the maps 
Clsp O > Clsp O’ 
Ur WJIr 
WJ Irk GIR 
are mutually inverse bijections. In particular, if O’ € Gen O then # Clsr O = #Clsp O’. 


Proof. By definition, J is invertible with O, (J) = O and OR(J) = O’. Therefore the 
map / + TJ induces a bijection between the set of invertible right O-ideals and the set of 
invertible right O’ -ideals (Lemma 16.5.1 1), with inverse given by I’ + I’J~!, and each 
of these products is compatible. This map then induces a bijection Cls O æ Cls O’, 
since is compatible with left multiplication in B, i.e., (@DJ = a(/J) for all œ € B*. 

o 


Remark 17.4.12. The equivalence in Lemma 17.4.11 is a form of Morita equivalence: 
see Remark 7.2.20. 

Lemma 17.4.11 says that the cardinality of the right class set is well-defined on the 
genus Gen O; and of course the cardinality of the type set is also well-defined on the 
genus (as it is the number of isomorphism classes). 


Lemma 17.4.13. The map 


Clsp O > TypO 


(17.4.14) 
[Z]r }> class of OL(/) 


is a surjective map of sets. 


Proof. If O’ is connected to O, then there is a connecting O’, O-ideal J, and [/]r € 
Clsp O has O, (J) = O’. oO 


Remark 17.4.15. The fibers of the map (17.4.14) is given by classes of two-sided 
ideals: see Proposition 18.5.10. 
17.4.16. Let B = M2(F) and O = M2(R). From the bijection (17.3.9), the classes in 


aoa 
R for [a] € Cl R. Consequently 


O(a) = T 


a! 


Cla(M2(R)) are represented by Ia = 
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We will see later (28.5.1 1) that there is a bijection 


CI R/(C1 R} = Typ M2(R) 
(17.4.17) 
class of [a] up to squares +> class of (2 4 : 


17.5 > Finiteness of the class set: over the integers 


Over the next two sections, we will show that the set Cls O of invertible right (fractional) 
O-ideals is finite using the geometry of numbers. In this section, we carry this out for 
the simplest case, when B is definite over Q; we consider the general case in the next 
section. For further reading on the rich theory of the geometry of numbers, see Cassels 
[Cas97], Gruber—Lekkerkerker [GrLe87], and Siegel [Sie89]. 

Our strategy is as follows: if J is an invertible right O-ideal, we will show there 
exists œ € J~! with the property that aJ = I C O has bounded absolute norm 
NU) = #(O/D < C where C € Ryo is independent of J. The result will then follow 
from the fact that there are only finitely many right O-ideals of bounded absolute norm. 

We begin with some definitions (generalizing Definition 9.3.1 slightly). 


Definition 17.5.1. A Euclidean lattice is a Z-submodule A C R” with A = Z” such 
that RA = R”. The covolume of a Euclidean lattice A is covol(A) = vol(R”/A). 


17.5.2. Equivalently, a Euclidean lattice A c R” is the Z-span of a basis of R”, and if 
A = @; Za;, then covol(A) = |det(a;;); jl. 


Lemma 17.5.3. A subgroup A C R” is a Euclidean lattice if and only if A is discrete 
and the quotient R"/A is compact. 


Proof. Exercise 17.6. oO 


Definition 17.5.4. Let X c R” be a subset. 


(a) X is convex if tx + (1 — t)y € X for all x, y € X andt € [0, 1]. 
(b) X is symmetric if —x € X for all x € X. 


The main result of Minkowski’s geometry of numbers is the following convex body 
theorem. 


Theorem 17.5.5. (Minkowski). Let X c R” be a closed, convex, symmetric subset of 
R”, and let A C R” be a Euclidean lattice. If vol(X) = 2" covol(A), then there exists 
O#a@EANX. 


The following proposition can be seen as a generalization of what was done for the 
Hurwitz order (11.3.1). 
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Proposition 17.5.6. Let B be a definite quaternion algebra over Q and let O C B be 
an order. Then OX = O! is a finite group, and every right ideal class in ClsO is 
represented by an integral right O-ideal with 


N(I) < = discrd(O) 
m 


and the right class set Cls O is finite. 


,b 
Proof. Let B = k l with a,b € Zo. Since B is definite, there is an embedding 


B => Bo = B®gR ~ H. Inside Bo ~ R* with Euclidean norm nrd, the order O 
sits as a Euclidean lattice. The set O! is therefore a discrete subset of the compact set 
BŁ ~ H!, so it is finite. 

Explicitly, we identify 


Bo > R* 
(17.5.7) 
t+xi+yj+ zij V2(t,xvlal, yvidl zylab]) 
Then 2nrd(œ) = |la||* for a € B in this identification, and we have covol(O) = 


discrd(O) (Exercise 17.7). 

Let J C B be an invertible right fractional O-ideal. To find J with [Z] = [J] and 
I integral, we look for a small a € J~! so that J = aJ c O will do. As a measure of 
(co)volume, counting cosets and applying the definition (16.4.9), we obtain 


covol(J~!) = [O : J7}]z covol(O) = N(J~') diserd(O). (17.5.8) 
Let c > 0 satisfy c* = (32/n7) covol(J~!), and let 
X ={xER?*: |[xl] < c}. 


Then X is closed, convex, and symmetric, and vol(X) = n?ct/2 = 16 covol(J~!). Then 
by Minkowski’s theorem (Theorem 17.5.5), there exists 0 # œ € J! AX, and 


N(@J) = Nmgig(@)N(J) = nrd(a)N(J) = Hlal ÉN) 
4 
1 8 (17.5.9) 
< —c*N(J) = — discrd(O). 
4 7 


Since œ is nonzero and B is a division algebra, œ € B*. Since a € J~!, the integral 
right fractional O-ideal J = aJ € O is as desired. 


If 7 c O has NV) = #(O/D) < C for C € Zso, then CO C I C O hence there are 
only finitely many possibilities for Z, and the second statement follows. o 


17.6 > Example 


We pause for an extended example. We steal the following lemma from the future. 
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Lemma 17.6.1. Let e € Zso. Then every principal right M2(Zp)-ideal I with nrd(J) = 
p° is of the form I = aM2(Zp) where 


awe y > :u,v E€ Z>0,u + v = e,andc € zip"2| ; (17.6.2) 
Proof. The lemma follows from the theory of invariant factors: a more general state- 
ment is proven in Lemma 26.4.1. o 

—1,-23 
Example 17.6.3. Let B = , and let 
1+] 1+] 
G2 74747 — en" 
2 2 


We have discrd(O) = disc B = 23, so O is a maximal order, and 6 = (1 + j)/2 satisfies 
B — B + 6 = 0. For convenience, let a = i, so O = Zia, B}. Then 


ap + pa =a. (17.6.4) 


By Proposition 17.5.6, it is sufficient to compute the (invertible) right O-ideals 
I C O such that 


nrd( DÊ = NI) < 2o < 18.7 
T 


so nrd(/) < 4. For nrd(/) = 1, we can only have J = O, and the class [71] = [O]. Let 
O; =O. 

We move to nrd(/) = 2, and refer to Lemma 17.6.1. Since B is split at 2, there is 
an embedding 


O a M2(Z2) 
QO -l1\ fl 0 
oa (i Mo ale 
where bo = 2+8+164+32+4+---€ Z2 satisfies bj — bo + 6 = 0 and bp = 0 (mod 2). 
We have 
_ fil 0) (0 O\ {1 O 
P.B +1,(@« +1) = o o i o) (mod 2) 

so we obtain the three right ideals 

Ia:0) =20+ BO, To:1) =20+ (£ = 1)O, la: =20+ (a + 1)BO (17.6.5) 


labelled by the corresponding nonzero column. If one of these three ideals is principal, 
then it is generated by an element of reduced norm 2. We have 


nrd(t + xa + yB + zap) 


=P +ty+x +xz+6y +62 


; A (17.6.6) 
aipe + r ee a 22 2 
= z? x+ 5% re tT 
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So nrd(y) = 2 with y € O has t, x, y,z € Z and therefore y = z = 0 and t = x = 1, i.e., 
laa) = (a + 1)O is principal, and the ideals T(1:0); To:1) are not. But [a:o] = [Zo:1] 
because aJ(1:9) = 1:1) because a € O* and by (17.6.4) 


a(20 + (£ - 1)O) = 2a0 + a(B - 1)O = 20 - BaO = Io:1) 


(We have alq: + Ia:0) precisely because a ¢ O,(J/).) In this way, we have found 
exactly one new right ideal class, [2] = [/(1.9)]. We compute its left order to be 


i(1 + 37) 
4 

and we also have a new type [O2] + [O;] € Typ O. 
In a similar way, we find 4 right ideals of reduced norm 3, and exactly one new 


right ideal class, represented by the right ideal J; = 30 + (a + 1)8O. For example, we 
find that the right ideal 7’ = 30 + GO is not principal using (17.6.6): letting 


O2 := OL Un) = Z + BZ + Z+(2ij)Z #0 


1 — 
2 by a1 = ge 
and we find a shortest vector 


(1-f/2eU': hy, 


so [7] = [4]. 
Repeating this with ideals of reduced norm 4 (Exercise 17.8), we conclude that 


Cls O = {[7], [2], U3 ]} 


and letting O3 := O, (B), checking it is not isomorphic to the previous two orders, we 
have 


Typ O = {[01], [02], [O3]}. 


17.7 Finiteness of the class set: over number rings 


We now turn to the general case. 


Main Theorem 17.7.1. Let F be a number field, let SC P1 F be eligible and R = Rs) 
be the ring of S-integers in F. Let B be a quaternion algebra over F, and let O C B be 
an R-order in B. Then the class set Cls O and the type set Typ O is finite. 


We call #Cls O the (right) class number of O. (By 17.3.2, the left class number 
suitably defined is equal to the right class number.) This result will be drastically 
improved upon in Part II of this text from analytic considerations; the proof in this 
section, using the geometry of numbers, has the advantage that is easy to visualize, it 
works in quite some generality, and it is the launching point for algorithmic aspects. 
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17.7.2. Before we begin, two quick reductions. The finiteness of the type set follows 
from finiteness of the right class set by Lemma 17.4.13. And if R = Zr is the ring of 
integers of F, then the general case follows from the fact that the map 

Cls O > Cl(O 8R Ris) 


(17.7.3) 
[I] > [1 8r R] 


is surjective for an eligible set S. 


Let F be a number field of degree n = [F : Q], let R = Zr be the ring of integers 
in F, and let B be a quaternion algebra over F. 


17.7.4. Suppose that F has r real places and c complex places, so that n = r +2c. Then 


F > Fo =F 8&QR= | |F, =R" xC.. (17.7.5) 
ylco 
Taking the basis 1,i for C, we obtain Fœ ~ R”, and then in the embedding (17.7.5), 
the ring of integers R ~ Z” sits discretely inside F» ~ R” as a Euclidean lattice. 


,b 
17.7.6. Suppose B = (=) and let 1, i, j, k be the standard basis for B with k = ij, so 
B=F@®Fi@®Fj@® Fk ~ F* as F-vector spaces. Then 

B > Bo = B Q&Q R = B 8p Fo = FE (17.7.7) 


in this same basis. Via (17.7.5) in each of the four components, the embedding (17.7.7) 
then gives an identification B» ~ (R")* = R*”. 

The order R«i, j,k) = R+Ri+Rj+Rk is discrete in B% exactly because R is discrete 
in F. But then implies that an R-order O is discrete in Bæ, since [O : RU, j, k)|]z < ©. 
Therefore O <> R^” has the structure of a Euclidean lattice. 


In the previous section, the real vector space Bœ was Euclidean under the reduced 
norm. In general, that need no longer be the case. Instead, we find a positive definite 
quadratic form Q: B» — R that majorizes the reduced norm in the following sense: 
we require that 

INmp/o(nrd(@))| < O(a)" (17.7.8) 


for all œ € BC Bo. 


Remark 17.7.9. With respect to possible majorants (17.7.8): in general, there are 
uncountably many such choices, and parametrizing majorants arises in a geometric 
context as part of reduction theory. As it will turn out, the only “interesting” case to 
consider here is 17.7.10, by strong approximation (see Theorem 17.8.3). 


17.7.10. Let B be a totally definite (Definition 14.5.7) quaternion algebra over F, a 
totally real number field. Then the quadratic form 


Q:B>Q 
a Trrg(nrd(@)) = $` v(nrd(@)) (17.7.11) 


vloo 
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is positive definite: B, ~ H and so v(nrd(q)) > 0 with equality if and only if a = 0. We 
call this quadratic form the absolute reduced norm. In this case, by the arithmetic- 
geometric mean, 


l/n 
Nmr olnrd(a))!” = g vot) 


v 


(17.7.12) 
< — J v(nrd(a)) = O(a) 


(with equality if and only if v(nrd œ) agrees for all v). 
We pause to note the following important consequence of 17.7.10. 


Lemma 17.7.13. Let B be a totally definite quaternion algebra over a totally real field 
F and let O C B be a Zp-order. Then the group of units of reduced norm 1 


O! = {y € O : nrd(y) = 1} 
is a finite group. 


In Lemma 17.7.13, if F = Q then OX = O!, so we have captured the entire unit 
group. 


Proof. As in 17.7.10, we equip Br := B ®g R = H” ~ R^” with the absolute reduced 
norm giving O —> Bp the structure of a Euclidean lattice (17.7.7). We have 


O! ={yeO: Oy) =n} (17.7.14) 


by the arithmetic-geometric mean (17.7.12). But the set {x € Br : Q(x) = n} is an 
ellipsoid in R” so compact, and O is a lattice so discrete. Therefore the intersection 
O! is finite. o 


17.7.15. We now generalize 17.7.10 to the general case. For v an infinite place of F, 
define 


Qy: Bp > R 
t+xi+ yj +zij = WOP +MP +OP + vab); 
then Q, is a positive definite quadratic form on B,, and 
\v(nrd(a@))| = |v(t? — ax? — by? + abz’)| 
< WOP + MDO + OIO + vabo? 07.7.16 
= 0,(a). 


Let m, = 1,2 depending on if v is real or complex, and define 


Q: Bo = | |B, >R 


v|oo 


(ayy > $ mOva). 


vloo 


(17.7.17) 


290 CHAPTER 17. CLASSES OF QUATERNION IDEALS 


Then Q is a positive definite quadratic form on Boo, again called the absolute reduced 


a, 


norm (relative to a, b); it depends on the choice of representation B = . Never- 
theless, (17.7.16) and the arithmetic-geometric mean yield 
1 
N d(a))|'/" < = y{v(ard 
INmpja(nrd(a))|!"" < = 2m |v(nrd(a’)| 
(17.7.18) 


1 
< — J mQx(a) = Ola). 


yloo 
We are now ready to prove the main result of this section. 


Proposition 17.7.19. There exists an explicit constant C € Rso such that for all R- 
orders O, every right ideal class in Cls O is represented by an integral right O-ideal I 
with 

NC) < CN(discrd(O)). 


Proof. If B ~ M2(F), then we appeal to 17.3.7, where such a bound comes from the 
finiteness of Cl R. So we may suppose that B is a division ring. 
Let 
X = (x); € R” : O(a) < 1}. (17.7.20) 


Then X is closed, convex, and symmetric. 
Let O be an R-order in B and let J be an invertible right fractional O-ideal. As in 
(17.5.8), counting cosets gives 


covol(J~!) = N(J)! covol(O). (17.7.21) 
Let 


aoe (17.7.22) 


Then vol(cX) = c^" vol(X) = 2*" covol(J~!). By Minkowski’s theorem, there exists 
0 ža e J! A cX, so O(a) < c°. By (17.7.18), 


(a) i 
c := 2 | ——— 


1 cn 
INmprg(nrd(@))| < zn O(a)" < oo 


Consequently 


4n 
N(@J) = [Nmp/o(nrd(a))PN(J) < ŽNO) 
n 


24"N(J)~! covol(O) 24" covol(O) (17.7.23) 
i 2n NJ) = 2n 
n” vol(X) n-” vol(X) 
= CN(discrd(O)) 


with ‘ 
pan covol(O) 
C:= : 17.7.24 
n?” vol(X) N(discrd(O)) ( ) 
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The ratio covol(O)/N(discrd(O)) is a constant independent of O: if O’ is another 
R-order then 
N(discrd(O’)) _ [O : O’]zN(discrd(O)) _ N(discrd(O)) 
covol(O’) — [O:O’]zcovol(O) ~~ covol(O) ` 


Since @ is nonzero and B is a division algebra we conclude that a € B*, and since 
a € J~!, the ideal J = aJ is as desired. oO 


Remark 17.7.25. For an explicit version of the Minkowski bound in the totally definite 
case, with a careful choice of compact region, see Kirschmer [Kir2005, Theorem 
3.3.11]. 


Lemma 17.7.26. For all C > 0, there are only finitely many integral right O-ideals 
with NW) < C. 


Proof. We may suppose C € Z. If J c O then N(D) = [O : I]z < C,soCO CIC 
O. But the group O/CO is a finite abelian group and there are only finitely many 
possibilities for Z. o 


We now have the ingredients for our main theorem. 


Proof of Main Theorem 17.7.1. Combine Proposition 17.7.19, the reductions in 17.7.2, 
and Lemma 17.7.26. o 


Remark 17.7.27. The finiteness statement (Main Theorem 17.7.1) can be generalized 
to the following theorem of Jordan—Zassenhaus. Let R be a Dedekind domain with 
F = Frac(R) a global field, let O C B be an R-order in a finite-dimensional semisimple 
algebra B, and let V be a left B-module. Then there are only finitely many isomorphism 
classes 7 C B with O c O, (D). Specializing to V = B a quaternion algebra, we recover 
the Main Theorem 17.7.1. For a proof, see Reiner [Rei2003, Theorem 26.4]; see also 
the discussion by Curtis—Reiner [CR81, §24]. 


17.8 Eichler’s theorem 


In this section, we state a special but conceptually important case of Eichler’s theorem 
for number fields: roughly speaking, the class set of an indefinite quaternion order is 
in bijection with a certain class group of the base ring. 

Let F be a number field with ring of integers R = Zp and let B be a quaternion 
algebra over F. 


Definition 17.8.1. We say B satisfies the Eichler condition if B is indefinite. 


Definition 17.8.1 introduces a longer (and rather opaque) phrase for something 
that we already had a word for, but its use is prevalent in the literature. There are two 
options: either B is totally definite (F is a totally real field and all archimedean places 
of F are ramified in B) or B is indefinite and satisfies the Eichler condition. 
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17.8.2. Recall 14.7.2 that we define Q C Ram B to be the set of real ramified places of 
B and F% 0 to be the positive elements for v € Q. 
We now define the group Clo R as 


the group of fractional ideals of F under multiplication 
modulo 


the subgroup of nonzero principal fractional ideals 
generated by an element in F% 0 


If Q is the set of all real places of F, then Clo R = CI* R is the narrow (or strict) class 
group. On the other hand, if Q = Ø, then Clo R = Cl R. In general, we have surjective 
group homomorphisms Cl* R > Clo R and Clo R — CIR. In the language of class 
field theory, Clo R is the class group corresponding to the cycle given by the product 
of the places in Q. 


Theorem 17.8.3. (Eichler; strong approximation). Let F be a number field and let B 
be a quaternion algebra over F that satisfies the Eichler condition. Let O C B be a 
maximal Z-order. Then the reduced norm induces a bijection 


Cls O & Clo R 


(17.8.4) 
[I] > [nrda(]. 


where Q C Ram B is the set of real ramified places in B. 


Proof. Eichler’s theorem is addressed by Reiner [Rei2003, $34], with a global proof 
of the key result [Rei2003, Theorem 34.9] falling over several pages. We will instead 
prove a more general version of this theorem as part of strong approximation, when 
idelic methods allow for a more efficient argument: see Corollary 28.5.17. o 


Eichler’s theorem says that when B is not totally definite, the only obstruction for 
an ideal to be principal in a maximal order is that its reduced norm fails to be (strictly) 
principal in the base ring. In particular, we have the following corollary. 


Corollary 17.8.5. If #Cl* R = 1, then #ClsO = 1: ie., every right O-ideal of a 
maximal order in an indefinite quaternion algebra is principal. 


Proof. Immediate from Eichler’s theorem and the fact that Cl* R surjects onto Clo R, 
by 17.8.2. o 


Corollary 17.8.6. There is a bijection Cls Mo(Zr) > CI ZF. 


Proof. Immediate from Eichler’s theorem; we proved this more generally for a matrix 
ring (17.3.9) using the Hermite normal form. o 


17.8.7. It is sensible for the class group Clo R to appear by norm considerations. Let 
v € Q; then B, = H, and if œ € B* then v(nrd(@)) > 0, as the reduced norm is positive. 


The class sets of totally definite orders are not captured by Eichler’s theorem, 
and for good reason: they can be arbitrarily large, a consequence of the Eichler mass 
formula (Chapter 25). 
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Exercises 


Unless otherwise specified, throughout these exercises let R be a Dedekind domain 
with field of fractions F, let B be a quaternion algebra over F, and let O C B be an 
R-order. 


1. Argue for Proposition 17.2.2 directly in a special case as follows. Let Z € M2(F) 
be a lattice with Orn(J) = M2(R). 


(a) By considering Z &gr F show that 


Ic k 0 M2(R) ® E 4 M2(R). 


(b) Suppose that R is a PID. Conclude that J is principal. 


2. Let O, O’ C B be R-orders. Show that the map in Lemma 17.4.11 is a bijection 
of pointed sets if and only if O is isomorphic to O’. 


>3. Let O,O’ C B be R-orders with O C O’. 


(a) If J is an invertible right O-ideal, show that JO’ is an invertible right 
O’-ideal. (The product JO’ is not necessarily compatible.) 


(b) Show that the map 


Cls O > Cls O’ 
[Z] => UO’) 


is well-defined, surjective, and has finite fibers. /Hint: let r € R be nonzero 
such that O' C r'O. If IO' =T, then I’ = 10 Cr'icr'l' so 
rl’ CICI, and conclude there are only finitely many possibilities for I.] 


4 Let O,O’ C B be maximal R-orders. In this exercise, we prove the following 
statement: 


There is a unique integral connecting O,O’ ideal J of minimal 
reduced norm; moreover, we have nrd(J) = [O : ON O’]. 


(a) Show that this statement is local, i.e., the statement is true over R if and 
only if it is true over Ry for all primes p of R. 

(b) Suppose R is a DVR. Show that the statement is true if B is a division 
algebra. 

(c) Suppose R is a DVR with maximal ideal p, and that B ~ M2(F). Show that 
there is a unique a € O\pO such that O’ = a~!Oa up to left multiplication 
by O%, and conclude that J = Oa is the unique integral connecting O, O’ 
ideal of minimal reduced norm. /Hint: Nor,(r)\(M2(R)) = F* GL2(R).] 

(d) Continuing (c), show that nrd(a@) = [O : ON O’). [Hint: the statement is 
equivalent under left or right multiplication of a by O* =~ GLa(R), so 
consider invariant factors.] [For another perspective, see section 23.5.] 
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>5. Let O C B be an R-order and let J be an invertible fractional right O-ideal. Let 
a C R be a nonzero ideal. Show that there exists a representative J € [/]p (in the 
same right ideal class as 7) such that J © O and nrd(J) is coprime to a. [Hint: 
look for a € (O : Dr and then look locally. ] 


>6. Prove Lemma 17.5.3: a subgroup A C R” is a Euclidean lattice if and only if A is 
discrete (every point of A is isolated, i.e., every x € A has an open neighborhood 
U > x such that A N U = {x}) and the quotient R”/A is compact. 


>7. Let B be a definite quaternion algebra over Q and let O C B be an order. 


(a) Let Bo = B ®g R. Show that nrd is a Euclidean norm on Bæ, and O is 
discrete in B,, with covol(O) = 4 discrd(O). [So it is better to take V2 nrd 
instead, to get covol(O) = discrd(O) on the nose.] 

(b) Let Kı, K2 C B be quadratic fields contained in B with Kı N K2 = Q. Let 
Si := K; O O and d; = disc S;. Show that 


(ldi| — 1)(ld2| — 1) > 4 discrd(O). 


[Hint: write S; = Z[a;] and consider the order Zla, a2). ] 
(c) Prove that if 1, œ2 € O have 


4/discrd(O) 


nrd(@1), nrd(a2) < 7 


then œa = a2Q1. 


8. Complete Example 17.6.3 by showing explicitly that all right O-ideals of reduced 
norm 4 are in the same right ideal class as one of 1), h, B. 


9. Let R be a global ring with #C1 R = 1, i.e., every fractional R-ideal is principal 
a = aR. Suppose further that #ClsO = 1. Let a € O have nrd(œ) + 0, and 
factor nrd(@) = mm +- a, € R where 7; € R are pairwise nonassociate nonzero 
prime elements (equivalently 2;R are pairwise distinct nonzero prime ideals). 


(a) Show that there exist w1, w2, . . ., W, € O such that a = ww? w, and 
nrd(w;)R = 7;R for alli = 1,...,r. 


(b) Show that every other such factorization is of the form 
@ = (DY, 2V2) V1 Gr) 
where y1, .. ., Yr € O”. 
(c) Suppose that nrd(O*) = R*. Refine part (a) and show that the stronger 


conclusion that there exist w; such that nrd(@;) = 7; for all i. 


[This generalizes Theorem 11.4.8.] 


10. We have seen that maximal orders in (definite) quaternion algebras of discrim- 
inant 2 (the Hurwitz order) and discriminant 3 (Exercise 11.11) are Euclidean 
with respect to the norm, and in particular they have trivial right class set. 
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(a) Show that maximal orders O in quaternion algebras of discriminants 5, 7, 13 
have #Cls O = 1. 
(b) Conclude that the quaternary quadratic forms 
t? + tx tty +tz +x? + xy +xz+2y?— yz + 2z, 
t? + tz +x? + xy + 2y? +2, 
t + ty +tz+2x? + xy +2xz+2y? + yz + 4z? 


are multiplicative and universal, i.e., represent all positive integers. 
(c) Show that for discriminant 7, 13 the maximal orders are not Euclidean with 
respect to the norm. 


[We discuss the maximal orders of class number 1 in Theorem 25.4.1. The 


maximal order for discriminant 5 is in fact norm Euclidean: see Fitzgerald 
[Fit2011].] 


11. In this exercise, we show that the group of principal two-sided ideals PIdl(O) 
need not be normal in the group of invertible fractional O-ideals Idl(O) of an 
order. 

Let B = (—1,-1 | Q), and let O C B be the Hurwitz order. Let O’ = Z + 50 = 
O(5) (cf. Exercise 18.6). Show that 


I’ =100' +(1 -2i + j)O’ 
is a two-sided invertible O’-ideal, and that 
rj@’y'! =50' +4437 + k)O' 
is not principal. 


12. The finiteness of the class group (see Reiner [Rei2003, Lemma 26.3]) can be 
proven replacing the geometry of numbers with just the pigeonhole principle, as 
follows. Let B be a division algebra over a number field F with ring of integers 
R, and let O C B be an R-order. 


(a) To prove the finiteness of Cls O, show that without loss of generality we 
may take F = Q. 

(b) Show that Nmgig(x1@1 +--+ + Xn@n) € Qlxı, ..., Xn] is a homogeneous 
polynomial of degree n. 

(c) Show that there exists C € Zs such that for all t > O and all x € Z” with 
lxi| < t, we have INmpjr(x1Q] + +++ + Xn@n)| < Ct”. 

(d) Let 7 C O be a lattice. Let s € Z be such that 


s” < N(D) =#(O/D < (s +1)". 


Using the pigeonhole principle, show that there exists a = )}; xa; € I 
with x; € Z and |x;| < 2(s + 1) for all i. 
(e) Show that N(@O) < 2”(s + 1)"C, and conclude that 


#([/a@O) < 4"C. 
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(f) Let M = (4” C)! and show that MI C œO, whence 
MOCI’cO 


where J’ = (Ma™')/. Conclude that the number of possibilities for 7’ is 
finite, hence the number of right classes of lattices J C O is finite, and 
hence #Cls O < o. 
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Chapter 18 


Two-sided ideals and the Picard group 


In this chapter, we treat maximal orders like noncommutative Dedekind domains, and 
we consider the structure of two-sided ideals (and their classes), in a manner parallel 
to the commutative case. 


18.1 > Noncommutative Dedekind domains 


Let R be a Dedekind domain with field of fractions F: then by definition R is noetherian, 
integrally closed, and all nonzero prime ideals of R are maximal. Equivalently, every 
ideal of R is the product of prime ideals (uniquely up to permutation). To establish this 
latter property of unique factorization of ideals, there are two essential ingredients: 
first, every proper ideal contains a finite product of prime ideals, and second, every 
nonzero prime ideal p C R is invertible. The first of these uses that R is noetherian 
and that nonzero prime ideals of R are maximal; the second uses that R is integrally 
closed. 

Here, the theorems are no easier to prove in the case of a quaternion algebra, so we 
might as well consider them in more generality. Let B be a simple F-algebra and let 
O C B be an R-order. 

To draw the closest analogy with Dedekind domains, we suppose that O c B 
is maximal: this is the noncommutative replacement for integrally closed. Since O 
is finitely generated, if Z C O is a two-sided O-ideal, then / is a finitely generated 
R-submodule, so the noetherian condition on R automatically implies that every chain 
of ideals of O stabilizes. We say a two-sided ideal P C O is prime if P + O and for 
all two-sided ideals J, J C O, we have 


IJCP > ICPorJs CP. 
Running parallel to the above, we have the following initial lemma. 


Lemma 18.1.1. A nonzero two-sided O-ideal is prime if and only it is maximal, and 
every two-sided O-ideal contains a product of prime two-sided O-ideals. 


Completing the analogy with the commutative case, we then have the following 
theorem. 
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Theorem 18.1.2. Let R be a Dedekind domain with field of fractions F = Frac R, let 
B be a simple F-algebra and let O C B be a maximal R-order. Then the following 
statements hold. 


(a) If I C B is an R-lattice such that OL(I) = O or Og() = O, then I is invertible 
and both OL (T) and Or(1) are maximal R-orders. 


(b) Multiplication of two-sided O-ideals is commutative, and every nonzero two- 
sided O-ideal is the product of finitely many prime two-sided O-ideals, uniquely 
up to permutation. 


Let Idl(O) be the group of invertible two-sided fractional O-ideals. Put another 
way, Theorem 18.1.2 says that if O is maximal, then Idl(O) is isomorphic to the free 
abelian group on the set of nonzero prime two-sided O-ideals under multiplication. 

We now consider classes of two-sided ideals, in the spirit of section 17.1. Two 
candidates present themselves. On the one hand, inside the group Idl(O) of invertible 
fractional two-sided O-ideals, the principal fractional two-sided O-ideals (those of 
the form Oa@O = Oa = aO for certain a € B*) form a subgroup PIdl(O), and we 
could consider the quotient. On the other hand, for a commutative ring S, the Picard 
group Pic(S) is defined to be the group of isomorphism classes of rank one projective 
(equivalently, invertible) S-modules under the tensor product. When S is a Dedekind 
domain, there is a canonical isomorphism Cl $ = Pic(S). 

For simplicity, suppose now that R = Z. In this noncommutative setting, we 
analogously define Pic O to be the group of isomorphism classes of invertible O- 
bimodules (over Z) under tensor product. If 7, J € Idl(O), then J, J are isomorphic as 
O-bimodules if and only if J = al with a € Q*, and this yields an isomorphism 


Pic O = Idl(O)/Q*. 
Let 
Npx(O) = {a € BX : «O = Oa} 
be the normalizer of O in B. By the Skolem—Noether theorem, 


Ngx(O)/Q* = Aut(O) 


is the group of Z-algebra (or ring) automorphisms of O. 


Theorem 18.1.3. Let B be a quaternion algebra over Q of discriminant D := disc B, 
and let O C B be a maximal order. Then 


PicO = I] Z/2Z 
p|\D 


generated by (unique) prime two-sided O-ideals with reduced norm p | D, and there 
is an exact sequence 
0 — Ngx(O)/(Q*O*%) > Pic O > Idl(O)/ PIdl(O) > 0 


Xx (18.1.4) 
a(Q*O*) > [OaO]. 
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In particular, Pic O is a finite abelian 2-group. As an application of Theorem 18.1.3, 
we revisit the map (17.1.3): 


Cls O > TypO 
[I]r t class of OL(D 


We recall that this map is surjective. The fibers are given by Theorem 18.1.3 (see 
Proposition 18.5.10): the fiber above the isomorphism class of O’ is in bijection with 
the set PIdl(O’)\ Id1(O’). 


Remark 18.1.5. The structure of Pic O is more complicated when O is not necessarily 
a maximal order: in general, the group Pic O is finite but it may be nonabelian (see 
Exercise 18.6); worse still, in general the subgroup PIdl(O) may not be a normal 
subgroup in Id1(O). 


18.2 Prime ideals 


Throughout this chapter, let R be a Dedekind domain with field of fractions F = Frac R, 
let B be a simple finite-dimensional F-algebra, and let O C B be an R-order. 


18.2.1. Let J C O be a nonzero two-sided ideal. In view of Remark 16.2.10, we see 
that J is automatically an R-lattice: ZJF C B is a two-sided ideal of B, so since B is 
simple and J + {0} we must have IF = B. 


Definition 18.2.2. A two-sided ideal P C O is prime if P + O and for all two-sided 
ideals J, J C O we have 


IJCP > ICPorJlcCP. 


A two-sided O-ideal M C O is maximal if M + O and M is not properly contained 
in another two-sided ideal. 


Example 18.2.3. The zero ideal P = {0} is prime: see Exercise 18.2. 


18.2.4. Let P C O be a two-sided ideal. Then the two-sided O/P-ideals are in bijection 
with the two-sided O-ideals containing P. If P # O, then P is prime if and only if for 
all two sided O/P-ideals 7/P, J/P, we have 


U/P)\(J/P)={0} = I/P ={0}or J/P = {0}. (18.2.5) 
Lemma 18.2.6. If M is a maximal two-sided O-ideal, then M is prime. 


Proof. Suppose IJ © M. Then 7+ MXJ + M) € M. But 1+ M 2 M so either 
I+M =Morl+M =O by maximality, and the same is true for J. Since M + O we 
must have either Z + M = M or J + M = M, whichis to say 7G MorJ CM. o 


Proposition 18.2.7. 


(a) A nonzero two-sided O-ideal is prime if and only it is maximal. 
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(b) If P C O is a nonzero prime two-sided O-ideal, then p = P A R is a nonzero 
prime ideal of R, and O/P is a finite-dimensional simple algebra over the field 
R/p. 


Proof. We follow Reiner [Rei2003, Theorem 22.3]. The implication (=) in (a) follows 
from Lemma 18.2.6. Conversely, let P be a nonzero prime two-sided O-ideal, and let 
p = PA R. We show p is a nonzero prime. By 18.2.1, P is an R-lattice, so p + {0}; 
since 1 ¢ P, we have p + R and p is nontrivial. If a,b € R, then ab € p implies 
(aO)(bO) € P; since P is prime, we have aO C P or bO C P,soaeporbe p. 

Now let J/P = rad(O/P) be the Jacobson radical of O/P (see section 7.4). By 
Lemma 7.4.8, the ideal J/P is nilpotent; by (18.2.5), we conclude J/P = {0}. Thus 
O/P is semisimple by Lemma 7.4.2 and thus is a product of simple R/p-algebras by 
the Wedderburn—Artin theorem (Main Theorem 7.3.10). But the simple components 
of O/P are two-sided ideals that annihilate one another; again by (18.2.5), there can 
be only one component, and O/P is simple. Thus O/P has no nontrivial ideals, and P 
is maximal. o 


Lemma 18.2.8. Every nonzero two-sided ideal of O contains a (finite) product of 
nonzero prime ideals. 


Proof. If not, then the set of ideals which do not contain such products is nonempty; 
since O is noetherian, there is a maximal element M. Since M cannot itself be prime, 
there exist ideals /, J, properly containing M, such that JJ c M. But both J, J contain 
products of prime ideals, so the same is true of M, a contradiction. oO 


We now turn to notions of invertibility. 


18.2.9. Let I be an invertible two-sided fractional O-ideal (cf. Definition 16.2.9 and 
16.5.17). In particular, OL() = OrU) = O. If J is another invertible two-sided 
fractional O-ideal, then so is ZJ, by Lemma 16.5.11: we have OL(IJ) = OL(I) = O 
and Og(J) = Or(J) = O. Let Idl(O) be the set of invertible two-sided fractional 
O-ideals. Then Idl(O) is a group under multiplication with identity element O. 


The structure of Idl(O), and quotients under natural equivalence relations, is the 
subject of this chapter. 


18.3 Invertibility 


We now consider invertibility first in the general context of orders, then for maxi- 
mal orders. The general theory of maximal orders over Dedekind domains in simple 
algebras was laid out by Auslander-Goldman [AG60]. One of the highlights of this 
theory are the classification of such orders: they are endomorphism rings of a finitely 
generated projective module over a maximal order in a division algebra. For a quite 
general treatment of maximal orders, see the book by Reiner [Rei2003]; in particular, 
the ideal theory presented here is also discussed in Reiner [Rei2003, §§22—23]. 


Lemma 18.3.1. Let J be a two-sided O-ideal, not necessarily invertible. If J Ç O, 
then J7! 9 O. 
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Proof. The R-lattice J7! has J7! 2 O and O.(J~!) C Og(J) = O and the same result 
holds interchanging left and right. 

We follow Reiner [Rei2003, Lemma 23.4] (who calls the proof “mystifying”’). 
Assume for the purposes of contradiction that J7! = O. Since J Ç O, there exists a 
maximal two-sided O-ideal M 2 J. Thus M7! c J7! = O. By Lemma 18.2.6, M is 
prime. Let a € RN J7! be nonzero. By Lemma 18.2.8, aO contains a product of prime 
two-sided O-ideals, so 

M 2 aO 2 P;P2:::P,, 


with each P; prime. We may suppose without loss of generality that r € Z>o is minimal 
with this property. Since P;---P, © M and M is prime, we must have P; C M, so 
P; = M by Proposition 18.2.7. Thus 


M 2a02JıMh) 


with Jı, J2 two-sided O-ideals. From a7! J;MJo C O, we have J(a™!MJ)Ji € Ji, 
so by definition aM hJı © OL (Jı) = O. Thus M(a7! hJ) M c M and a™! hJ € 
M`! c Oso Jo J; C aO. This shows that aO contains the product J2 J1 of r — 1 prime 
two-sided O-ideals, contradicting the minimality of r. o 


Using this lemma, we arrive at the following proposition for maximal orders. 


Proposition 18.3.2. Let I C B be an R-lattice such that O, (I) is a maximal R-order. 
Then I is right invertible, i.e., IIT! = O, (D). 


Of course, one can also swap left for right in the statement of Proposition 18.3.2. 
Using the standard involution, we proved Proposition 18.3.2 when B is a quaternion 
algebra (Proposition 16.6.15(b)). 


Proof of Proposition 18.3.2. We follow Reiner [Rei2003, Theorem 23.5]. Let O = 
OL(I). Let J = II! c O. Then JI = III C I, so J C O (D = O and J is a 
two-sided O-ideal. We have 


JJ! = ITJ! c O, 
sol~!J~! c I~! and therefore J7! c Og(“!). Additionally, 
Or(I7') 2 OLD) = O; (18.3.3) 


but O is maximal, so equality holds in (18.3.3) and therefore J7! C O. But O c J! 
as well, so J7! = O. If J Ç O, then we have a contradiction with Lemma 18.3.1; so 
J = O, and the proof is complete. o 


Putting these ingredients together, we have the following theorem. 


Theorem 18.3.4. Let R be a Dedekind domain with F = Frac R, let B be a simple 
F-algebra, and let O C B be a maximal R-order. Then: 
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(a) Multiplication of two-sided ideals is commutative: if I, J are two-sided O-ideals, 
then IJ = JI. 

(b) Every nonzero two-sided O-ideal is invertible and uniquely expressible as a 
product of prime two-sided ideals in O. 


Proof. For (b), invertibility follows from Proposition 18.3.2. For (b) without unique- 
ness, assume for purposes of contradiction that there is a two-sided ideal of O that is 
not the product of prime ideals; then there is a maximal counterexample J. Since J is 
not prime, there exists a prime Q with JC QC O, so J c JQ! CO.IfJ=JQ"', 
so by cancelling Q = O, a contradiction. Therefore JQ! = P| --- P, is the product of 
primes by maximality, and J = P, --- P,Q is the product of primes, a contradiction. 

We now prove (a). If P,Q © O are distinct nonzero prime two-sided ideals, and 
we let Q’ = P~'QP, then Q’ c P~'OP = O is prime and PO’ = OPC O,so PCO 
or Q’ C Q; but equality would hold in each case by maximality, and since P + Q, we 
must have Q’ = Q, and multiplication is commutative. 

Finally, uniqueness of the factorization in (b) follows as in the commutative case. 
If Pi- P, = Q,---Qs, then P; = Q; for some i; multiplying by Pe and repeating 
the argument, we find that {P}, . . ., P+} = {Q1,...,Qs}, and the result follows. oO 


Corollary 18.3.5. With hypotheses as in Theorem 18.3.4, the group Id\(O) is isomor- 
phic to the free abelian group on the set of nonzero prime ideals. 


With these arguments in hand, we have the following foundational result for quater- 
nion orders. 


Theorem 18.3.6. Suppose that R is a Dedekind domain. Let B be a quaternion algebra 
over F and let O C B be a maximal R-order. Then the map 


{Prime two-sided O-ideals} e {Prime ideals of R} 


(18.3.7) 
P= PAR 
is a bijection. 
Moreover, if R is a global ring, then there is an exact sequence 
0 > IdI(R) > Idl(O) > Fi Z/2Z > 0 
pID (18.3.8) 


a > OaO 
where D = discr(B). 


Proof. The map (18.3.7) is defined by Proposition 18.2.7, and it is surjective because 
pO C P is contained in a maximal therefore prime ideal. 

Next we show that the map is injective. Let P be a prime ideal, and work with 
completions at a prime p. Then Pp = P r Rp S Op is a maximal ideal of Oy. If 
By =~ M2(Fp), so Op ~ M2(Rp), then the only maximal two-sided ideal is pO,; if 
instead By is a division algebra, then there is a unique maximal two-sided ideal Pp 
with fas = pO, by Theorem 13.3.11. We can also describe this uniformly, by the proof 
of Proposition 18.2.7: in all cases, we have P, = rad(O,). 
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There is a natural group homomorphism 


Idl(R) — Idl(O) 
at OaO = aO 


This map is injective, since if aO = O then a? = nrd(aO) = nrd(O) = R, soa = R. 
The cokernel of the map is determined by the previous paragraph. oO 


Remark 18.3.9. Many of the theorems stated in this section (and chapter) hold more 
generally for hereditary orders: this notion is pursued in Chapter 21. To see what this 
looks like in a more general context, see Curtis—Reiner [CR81, §26B]. A very general 
context in which one can make an argument like in section 18.3 was axiomatized by 
Asano; for an exposition and several references, see McConnell—Robson [McCR87, 
Chapter 5]. 


18.4 Picard group 


We now proceed to consider classes of two-sided ideals. We begin with a natural but 
abstract definition, in terms of bimodules. (Recall 20.3.7, that a bimodule is over R if 
the R-action on left and right are equal.) 


Definition 18.4.1. The Picard group of O over R is the group Picr(O) of isomorphism 
classes of invertible O-bimodules over R under tensor product. 


Remark 18.4.2. Some authors also write Picent(O) = Picz(q)(O) when considering 
the Picard group over the center of O, the most important case. To avoid additional 
complication, in this section we suppose that B is central over F, so Picr(O) = 
Picent(O). 


18.4.3. If Z C B is an R-lattice that is a fractional two-sided O-ideal, then Z is a 
O-bimodule over R. Conversely, if J is a O-bimodule over R then J &r F = B as 
B-bimodules, and choosing such an isomorphism gives an embedding J — B as an 
R-lattice. 


Lemma 18.4.4. Let I, J C B be R-lattices that are fractional two-sided O-ideals. Then 
I is isomorphic to J as O-bimodules over R if and only if there exists a € F* such that 
J=al. 


Proof. See Exercise 18.9. Oo 


18.4.5. By 18.4.3, there is a natural surjective map 
Idl(O) > Picr(O); 


we claim that the kernel of this map is PIdl(R) < Idl(O). By Lemma 19.5.1, every 
isomorphism class of invertible O-bimodule is represented by an invertible R-lattice 
I C B, unique up to scaling by F*, and if a € F* then aO = O if and only if 
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a € RA O* = R%, so the ideal aR € PIdl(R) is well-defined. Thus, we obtain a natural 
isomorphism 
Idl(O)/ PIdl(R) — Picr(O). (18.4.6) 


Equivalently, the sequence 
1 > R* > F* > Idl(O) > Picr(O) > 1 
is exact. One might profitably take (18.4.6) as the definition of Picr(O). 


18.4.7. If O’ is locally isomorphic O (so they are in the same genus), then there is a 
O, O’-connecting ideal J, and the map 


Idl(O) = Idl(O’) 
Testis 


is an isomorphism of groups restricting to the identity on Cl R, so from (18.4.6) we 
obtain an isomorphism 
Picr(O) = Picr(O’), 


analogous to Lemma 17.4.11. 


Our remaining task in this section is to examine the structure of Picr(O), and to 
this end we suppose that B is a quaternion algebra over F. 


18.4.8. Suppose that O is a maximal R-order with F = Frac R a global field. Then 
taking the quotient by PIdI(R) in the first two terms in (18.3.8) yields an exact sequence 


0 — CIR > Picr(O) > | [zz > 0. (18.4.9) 
plD 
Although this sequence need not split, it does show that the Picard group of the maximal 


order O is not far from the class group Cl R, the difference precisely measured by the 
primes that ramify in B. 


In general, for a quaternion R-order O we have the following result. 
Proposition 18.4.10. Picr(O) is a finite group. 


Proof. If O is maximal, we combine (18.4.9) with the finiteness of Cl R and the fact 
that there are only finitely many primes p dividing the discriminant D. 

Now let O be an R-order. Then there exists a maximal R-order O’ 2 O. We argue 
as in Exercise 17.3. We define a map of sets: 


Picr(O) > Picr(O’) 
[Z] => [0'10] 
The class up to scaling by F* is well-defined, and I’ := O'IO’ 2 I an R-lattice with 


left and right orders containing O’, but since O’ is maximal these orders equal O’ and 
I’ is invertible. 
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By the first paragraph, by finiteness of Picr(O’), after rescaling we may suppose 7” 
is one of finitely many possibilities. But there exists nonzeror € RsuchthatrO’ C O, so 


I = O'IO' c (rO LO) = r° cr 


sor?’ CI C T; since T’ /r?T is a finite group, this leaves only finitely many possibil- 
ities for J. o 


Remark 18.4.11. The study of the Picard group is quite general. It was studied in detail 
by Frohlich [Frö73]; see also Curtis-Reiner [CR87, §55]. 


18.5 Classes of two-sided ideals 


In this section, we compare the Picard group to the group of “ideals modulo principal 
ideals”. 
Let PIdl(O) < Idl(O) be the subgroup of principal two-sided fractional O-ideals 
(invertible by 16.5.4). Let 
Npx(O) = {a € BX : a 'Oa = O} 


be the normalizer of O in B*. 


Lemma 18.5.1. There is an exact sequence of groups 


1 > O* > Ngx(O) > PId\(O) > 1 


18.5.2 
ath OaO. ( ) 


Proof. We have a € Ngx(O) if and only if @O0 = Oa if and only if Oa@O is a 
principal two-sided fractional O-ideal, as in Exercise 16.17; this gives a surjective 
group homomorphism Ngx(O) — PIdl(O). The kernel is the set of a € B* such that 
aO = O, and this normal subgroup is precisely O%. o 


Proposition 18.5.3. There is an isomorphism of groups 
Npx(O)/(F*O*%) æ PIdI(O)/ PIdl(R) 


18.5.4 
aF*O* > class of OaO. ( ) 


If PIdl(O) < Idl(O) is normal, then the isomorphism (18.5.4) induces a natural exact 
sequence 
0 > Ngx(O)/(F*O*) > Picr(O) > Idl(O)/ PIdl(O) > 0 


18.5.5 
aF*O* > class of OaO. ; ) 


Proof. There is an isomorphism Ngx(O)/O* = PIdl(O) by (18.5.2). The image of 
F* < Ngx(O) in PIdl(O) under this map consists of two-sided ideals of the form OaO 
with a € F*; we have OaO = O if and only if a € O% if and only if a € R*, so this 
image is isomorphic to the group PId1(R) of principal fractional R-ideals via the map 
aR +> OaO. The first isomorphism follows. The exact sequence (18.5.5) is then just 
rewriting the natural sequence 


0 — PIdl(O)/ PIdl(R) — Idl(O)/ PIdl(R) — Idl(O)/ PIdl(O) > 0. o 
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Remark 18.5.6. The moral of Proposition 18.5.3 is that, unlike the commutative case 
where the two notions coincide, the two notions of “isomorphism classes of invertible 
bimodules” and “ideals modulo principal ideals” are in general different for a quaternion 
order. These notions coincide precisely when Ngx(O)/F* =~ O*/R*, or equivalently 
(by the Skolem—Noether theorem) that every R-algebra automorphism of O is inner, 
which is to say Autr(O) = Inng(O) = O*/R*. 


18.5.7. Unfortunately, the subgroup PIdl(O) < Idl(O) need not be normal in general 
(Exercise 17.11), so statements like Proposition 18.5.3 depend on the order O having 
good structural properties. If O is a maximal order, then Id1(O) is abelian, so the result 
holds in this case. 

In general, from the proof but using cosets one still obtains the equality 


#(Idl(O)/ PIdl(O)) - #(Ngx(O)/(F*O%)) = #Picr(O). (18.5.8) 


Remark 18.5.9. If O,O’ are connected, then Picr(O) =~ Picr(O’) by 18.4.7 but this 
isomorphism need not respect the exact sequence (18.5.5). Each order O “balances” the 
contribution of this group between the normalizer Ngx(O)/(F*O*) and the quotient 
Idl(O)/ PIdl(O)}—and these might be of different sizes for O’. We will return to examine 
more closely this structure in section 28.9, when strong approximation allows us to be 
more precise in measuring the discrepancy. 


We conclude with an application to the structure of (right) class sets. We examine 
from Lemma 17.4.13 the fibers of the surjective map (17.4.14) 


Cls O > TypO 
[7] > class of O.(). 


Refreshing our notation, let B be a central simple F'-algebra and let O C B an R-order. 


Proposition 18.5.10. The map I > [I] induces a bijection 
PIdl(O)\ Idl(O) e {[7] € Cls O : OL(D = O}. 


Proof. Let O’ be an order of the same type as O. Since (17.4.14) is surjective, there 
exists [J] € Cls O such that O_(J) ~ O’. We are free to replace O’ by an isomorphic 
order, so we may suppose O(I) = O’. For all [/’] € Cls O with O_U/’) ~ O’ (running 
over the fiber), since O,(a@/’) = aO’a™! for œ € B* we may suppose without loss of 
generality that the representative J’ has O,(/’) = O’. 

We then define a map 


PIdl(O’)\ Idl(O ) > G ]eClsO : OL’) = 0} (18.5.11) 

J = [J'I] 
The map is surjective, because if J’ = J’I~! then OL(J^) = OR(J’) = O, so J’ is a two- 
sided invertible O’-ideal. It is injective because if [J'7] = [K’/] for J’, K’ € Idl(O’) 
then K’ = a’ J’ with a’ € B*, but further we need OL(K’) = a'O'a’™! = O', so in fact 
[J’T] = [K’T] if and only if a’ € Ngx(O’), and the result then follows from Lemma 
18.5.1. Oo 
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We have the following corollaries. 


Corollary 18.5.12. We have 


#ClsO = > [Idl(O’) : PIdl(O’)] = #Picr O 2 a 


[O’JeTyp O [O’JeTyp 0 2O 
where zo = [Ngx(O’) : FXO” ]. 


Proof. For the first equality, combine Lemma 17.4.13 and Proposition 18.5.10, com- 
puting the size of the fibers. For the second, substitute (18.5.8) and use 18.4.7. o 


Corollary 18.5.13. Let O; be representatives of Typ O. For each i, let I; be a connecting 
O;, O-ideal, and let J;,; be representatives of PIdI(O;)\ Id1(O;). Then the set {Ji ;1i}i,; 
is a complete set of representatives for Cls O. 


Proof. We choose representatives and take the fibers of the map (17.4. 14). o 


Remark 18.5.14. When PIdl(O) < Id1(O), then in Proposition 18.5.10 we have written 
the class set Cls O as a disjoint union of abelian groups. The fact that the bijection is 
noncanonical is due to the fact that we choose a connecting ideal, so without making 
choices we obtain only a disjoint union of principal homogeneous spaces (i.e., torsors) 
under the groups PIdl(O’)\ Idl(O’). 


Exercises 


Unless otherwise specified, let R be a Dedekind domain with field of fractions F = 
Frac R, let B be a simple finite-dimensional F-algebra, and let O C B be an R-order. 


1. Show that the following are equivalent: 


(i) Ois a maximal R-order; 
(ii) OL) = Or() = O for all fractional two-sided O-ideals 7; and 
(iii) OL.) = Or() = O for all two-sided O-ideals 7 C O. 


2. Show that the zero ideal is a prime ideal of O. 

3. Let J C O be a nonzero two-sided ideal of O in the ring-theoretic sense: J is an 
additive subgroup closed under left and right multiplication by O. Show that J 
is an R-lattice. 


4. Let R be a DVR with maximal ideal p, and let O = i 


p R 
Show that the two-sided ideal p M2(R) € O is not a prime ideal. 
5. Let R := Z[V-6] and F := Q(V—6). Let B := (2, V—6 | F). 


) C B = M,(F). 


(a) Show that 2R = p5 and 3R = p3 for primes p2, p3 C R. 

(b) Show that Ram(B) = {p2, p3}. 

(c) Let O be a maximal order in B. Show that there is a unique two-sided ideal 
P such that P? = p0. 
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(d) Prove that [P2] € Picr(O) has order 4, and conclude that the sequence 
(18.4.9) does not split. 
(e) Show that we may take 


O = R +p3'(V-6 +i) + Rj + pz'(V-6 +; 
as the maximal order, and then that J is generated by i and V—6ij/2, and 
finally that P = (v-6 + i)/2. 


6. Let B = M, (F) with n > 2, let O = M, (R), let p C R be prime with k = R/p, 
and let O(p) = R + pO. 


(a) Show that O(p) is an order of reduced discriminant p°. 
(b) Show that OX =~ GL,,(R) normalizes O(p) S O, so that 


O(p)* < O* = GL, (R), 
and that the map 
O* => Idl(O) 
y= Oy = yO 


induces an injective group homomorphism PGL,,(k) — Idl(O). Conclude 
that Idl(O) is not an abelian group. 


7. Show that Theorem 18.3.4 holds more generally for B a semisimple F-algebra 
(but still O © B maximal). [Hint: Decompose B into a product of simple F- 


algebras. ] 
8. Let O be maximal, and let P;,...,P; C O be distinct prime two-sided ideals. 
Let : 
I:= Gee and J:= | 
i=l i=l 
with ej, fi EZ. 


(a) Prove that J C O if and only if e; > 0 for alli = 1,...,n, and in this case 
there is a ring isomorphism 


o/I = orë. 
i=l 
(b) Prove that J 2 J if and only if e; < fi; for all i. 
(©) Show I +J = J; Pr" and In J = TI, Pee. 


> 9. Prove Lemma 18.4.4: Show that fractional two-sided O-ideals [, J C B are 
isomorphic as O-bimodules over R if and only if there exists a € F* such that 
J = al. [Hint: Peek at Lemma 19.5.1.] 


10. Let K 2 F be a finite, separable extension and let S be the integral closure 
of R in K. Show that the map J + I &pr S defines a group homomorphism 
Pic O — Pic(O @p S). 
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Chapter 19 


Brandt groupoids 


In this chapter, we study the relationship between multiplication and classes of quater- 
nion ideals. 


19.1 > Composition laws and ideal multiplication 


To guide our investigations, we again appeal to the quadratic case. Let d € Z be a 
nonsquare discriminant. A subject of classical interest was the set of integral primitive 
binary quadratic forms of discriminant d, namely 


Q(d) = {ax? + bxy + cy” : a,b,c € Z, b? — 4ac = d, and gcd(a, b,c) = 1}. 


Of particular interest to early number theorists (Fermat, Legendre, Lagrange, and 
Gauss) was the set of primes represented by a quadratic form Q € Q(d); inquiries of 
this nature proved to be quite deep, giving rise to the law of quadratic reciprocity and 
the beginnings of the theory of complex multiplication and class field theory. 

An invertible, oriented change of variables on a quadratic form Q € Q(d) does 
not alter the set of primes represented, so one is naturally led to study the equivalence 
classes of quadratic forms under the (right) action of the group SL2(Z) given by 


Q |g), y) = Q, y)-g) for g € SLp(Z). (19.1.1) 


The set Cl(d) of SL2(Z)-classes of forms in Q(d) is finite, by reduction theory (see 
section 35.2): every form in Q(d) is equivalent under the action of SL2(Z) to a unique 
reduced form, of which there are only finitely many. To study this finite set, Gauss 
defined a composition law on Cl(d), giving Cl(d) the structure of an abelian group 
by an explicit formula. Gauss’s composition law on binary quadratic forms can be 
understood using 2 x 2 x 2 Rubik’s cubes, by a sublime result of Bhargava [Bha2004a]. 

Today, we see this composition law as a consequence of a natural bijection between 
Cl(d) and a set equipped with an obvious group structure. Let S = S(d) be the quadratic 
ring of discriminant d. Define the narrow class group Cl*(S) as 


the group of invertible fractional ideals of S under multiplication 
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modulo 


the subgroup of nonzero principal fractional ideals 
generated by a totally positive element 


(i.e., one that is positive in every embedding into R, so if d < O then this is no 
condition). (Alternatively, Cl*(S) can be thought of as the group of isomorphism 
classes of oriented, invertible S-modules, under a suitable notion of orientation.) Then 
there is a bijection between Cl(d) and Cl*(S): explicitly, to the class of the quadratic 
form Q = ax? + bxy + cy? € Q(d), we associate the class of the ideal 


-b+ Vd 


=aZ 
eee 


je c S(d). (19.1.2) 


Conversely, the quadratic form is recovered as the norm form on K = Qvd) restricted 


t 7 
i -b+ vd 
r 


5 | = ax? + bay toy” (19.1.3) 


Ninxy(as + 


where c = bd EZ. 

Much of the same structure can be found in the quaternionic case, with several 
interesting twists. It was Brandt who first asked if there was a composition law for 
(integral, primitive) quaternary quadratic forms: it would arise naturally from some 
kind of multiplication of ideals in a quaternion order, with the analogous bijection 
furnished by the reduced norm form. Brandt started writing on composition laws 
for quaternary quadratic forms in 1913 [Bra13], tracing the notion of composition 
back to Hermite, who observed a kind of multiplication law (bilinear substitution) for 
quaternary forms x$ + F(x1, x2, x3) in formulas of Euler and Lagrange. He continued 
on this note during the 1920s [Bra24, Bra25, Bra28, Bra37], when it became clear that 
quaternion algebras was the right framework to place his composition laws; in 1943, 
he developed this theme significantly [Bra43] and defined his Brandt matrices (that 
will figure prominently in Chapter 41. 

However, in the set of invertible lattices in B under compatible product, one cannot 
always multiply! However, this set has the structure of a groupoid: a nonempty set with 
an inverse function and a partial product that satisfies the associativity, inverse, and 
identity properties whenever they are defined. Groupoids now figure prominently in 
category theory (a groupoid is equivalently a small category in which every morphism 
is an isomorphism) and many other contexts; see Remark 19.3.12. 

Organizing lattices by their left and right orders, which by definition are connected 
and hence in the same genus, we define 


Brt(O) = {I : I C B invertible R-lattice and OL (7), Or(/) € Gen O}; (19.1.4) 


visibly, Brt(O) depends only on the genus of O. Organizing lattices according to the 
genus of orders is sensible: after all, we only apply the composition law to binary 
quadratic forms of the same discriminant, and in the compatible product we see 
precisely those classes whose left and right orders are connected. In other words, the 
set of invertible lattices in the quadratic field K = Q(Vd) has the structure of a groupoid 
if we multiply only those lattices with the same multiplicator ring. 
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Theorem 19.1.5. Let B be a quaternion algebra over Q and let O C B be an order. 
Then the set Brt(O) has the structure of a groupoid under compatible product. 


We call Brt(O) the Brandt groupoid of (the genus of) O. 

We now consider classes of lattices. A lattice 7 C B has the structure of a 
OL), Or()-bimodule. Two invertible lattices 7, J with the same left and right orders 
OL(D = OL(J) and Or) = Or(J) are isomorphic as bimodules if and only if there 
exists a € Q* such that J = al. Accordingly, we say two lattices Z, J c B are homo- 
thetic if there exists a € Q* such that J = al. 

For connected orders O, O’ c B, we define 


Pic(O, O’) := {[I] : J c B invertible and O(I) = O and Or(/) = O’} (19.1.6) 


to be the set of homothety classes of lattices with left order O and right order O’, or 
equivalently the set of isomorphism classes of O, O’-bimodules over R. Restricting to 
the subset of lattices with O = O’, and the lattices J c B are O-bimodules, we recover 
Pic(O, O) = Pic O the Picard group from the previous chapter. 
Now let O C B be an order and let O; be representative orders for the type set 
Typ O. Let 
BrtClO :=| | Pic(O;, O;). (19.1.7) 


i,j 


Theorem 19.1.8. Let B be a quaternion algebra over Q and let O C B be an order. 
Then the set BrtClO has the structure of a groupoid that, up to isomorphism, is 
independent of the choice of the orders Oj. 


In particular, BrtCl O depends only on the genus of O. We call the set BrtCl O the 
Brandt class groupoid of (the genus of) O. 

Returning to quadratic forms, to each R-lattice J with nrd(/) = aZ and a > 0, we 
associate the quadratic form 


nrd: I> Z 
nrdz(4) = nrd(q)/a 


Alternatively, up to similarity we can just take the quadratic module nrd |z: Z — nrd(/) 
remembering that the quadratic form takes values in nrd(/). The discriminant of an 
invertible lattice 7 C B is equal to the common discriminant N? of the genus of its left 
or right order. The quadratic forms nrd; are all locally similar, respecting the canonical 
orientation 5.6.7 on B. Therefore, there is a map 
Quaternary quadratic forms over Z 
BrtClO > locally similar to nrd |o 
| up to oriented similarity | 


[7] }> nrdz 


is (well-defined and) surjective. Unfortunately, this map is not injective (a reflection 
of the lack of a natural quotient groupoid homomorphism): the Brandt class is a kind 
of rigidification of the oriented similarity class. Nevertheless, Theorem 19.1.8 can be 
viewed as a generalization of Gauss composition of binary quadratic forms, defining a 
partial composition law on (rigidified) classes of quaternary quadratic forms. 
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19.2 Example 
SLEI Vis Lad a bs 
Consider the quaternion algebra B := | ——— |] with standard basis 1, i, j, k = ij, and 
the maximal order O of reduced discriminant 37 defined by 
l+i+j 2+i+k 
i+j Z4 +i+ 
2 4 


The type set Typ O of orders connected to O has exactly two isomorphism classes, 
represented by O; = O and 


O:=Z+iZ+ Z. (19.2.1) 


Z. 


-7i+j 2-3i+k 
Opie zane 2 tig 2 Ett 


4 


These orders are connected by the O2, O1-connecting ideal 


I:=3Z+3iZ + 


3—-i+j 2+3i-k_ | 3-i+j 
5 Z+—7—Z =30+ 5 O. 


There are isomorphisms 
Pic(O) = Pic(O2) =~ Z/2Z 


with the nontrivial class in Pic(O;) represented by the principal two-sided ideal J; = 
jO = Oj with j € Ngx(O), and the nontrivial class in Pic(O2) represented by the 
nonprincipal (but invertible) ideal 


111 -259i + j 
Jh := IJI! = 3702 F ——* 0». 


In particular, 
Clsp(O1) = {[01], U1, [27} and Clsa(O2) = {[02], [J2], UI}, 
with [J1] = [01]. 


We can visualize this groupoid as a graph as in Figure 19.2.2, with directed edges 
for multiplication: 


[I Jo] = [AT] 


Figure 19.2.2: BrtCl O, for discrd O = 37 
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The Brandt class groupoid 
BrtCl O = Pic(O,) u C1(O1, O2) U Cl(O2, O1) U Pic(O2) 


has 2+4+4+2 = 12 elements; it is generated as a groupoid by the elements 
[J1], [J2], [Z], with relations 


[J1] = [0], [2] = [02], [JJ] = Wil. 


Restricting the reduced norm to these lattices, we obtain classes of quaternary 
quadratic forms of discriminant 377: 


nrdo, = t? + ty +t¢ + 2x* + xy + 2xz +5y? + yz + 102” 
nrdo, =f +tx+tz+4x? — xy +4xz + Sy” + 2yz + 627 
nrdy = 3t? —tx + ty + tz + 3x* — 3xy — xz + 4y* — yz + 5z” 
nrd; = 3t? + tx — ty — tz + 3x” — 3xy — xz + 4y? — yz + Sz? 
nrd;, = 20? —tx + ty + 2x* —2xy + xz + 3y* + 2yz + 102" 


The quadratic forms nrd7 and nrd; are isometric but not by an oriented isometry. 


19.3 Groupoid structure 
We begin with some generalities on groupoids. 


Definition 19.3.1. A partial function f: X — Y is a function defined on a subset of 
the domain X. 


Definition 19.3.2. A groupoid G is a set with a unary operation ~! : G > G anda 
partial function * : G x G — G such that * and ~! satisfy the associativity, inverse, 
and identity properties (as in a group) whenever they are defined: 


(a) [Associativity] For all a,b,c € G such that a x b is defined and (a * b) * c is 
defined, both b + c and a * (b x c) are defined and 


(ax b)*c=ax(b*c). 


(b) [Inverses] For all a € G, there exists a~! € G such that a x a7! and a7! * a are 
defined (but not necessarily equal). 
(c) [Identity] For all a, b € G such that a * b is defined, we have 


(axb)*xb!'=a and a!«(axb)=b. (19.3.3) 


A homomorphism ¢: G — G” of groupoids is a map satisfying 
p(a * b) = o(a) * ob) 
for all a,b € G. 
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19.3.4. Let G be a groupoid. Then the products in the identity law (19.3.3) are defined 
by the associative and inverse laws, and it follows that e = a * a`! , the left identity of 
a, and f = a"! x a the corresponding right identity of a, satisfy e *a = a = a * f 
for all a € G. (We may have that e + f, i.e., the left and right identities for a € G 
disagree.) The right identity of a € G is the left identity of a~! € G, so we call the set 


{e=axa!:aeG} 
the set of identity elements in G. 


19.3.5. Equivalently, a groupoid is a small category (the class of objects in the category 
is a set) such that every morphism is an isomorphism: given a groupoid, we associate 
the category whose objects are the elements of the set S := {e = axa! : a € G} of 
identity elements in G and the morphisms between e, f € S are the elements a € G 
such that e * a and a * f are defined (see Proposition 19.3.9 below). Conversely, to 
a category in which every morphism is an isomorphism, we associate the groupoid 
whose underlying set is the union of all morphisms under inverse and composition. 


Example 19.3.6. The set of homotopy classes of paths in a topological space X forms 
a groupoid under composition: the paths y1, y2 : [0, 1] —> X can be composed to a path 
y2 0 yı : [0,1] —> X if and only if y2(0) = yı (1). 


Example 19.3.7. A disjoint union of groups is a groupoid, with the product defined if 
and only if the elements belong to the same group; the set of identities is canonically 
in bijection with the index set of the disjoint union. 


19.3.8. Let G be a groupoid and let e, f € G be identity elements. We say that e is 
connected to f if there exists a € G such that a has left identity e and right identity f. 
The relation of being connected defines an equivalence relation on the set of identity 
elements in G, and the resulting equivalence classes are called connected components 
of G. We say G is connected if all identity elements e, f € G are connected; connected 
components of a groupoid are connected. 

Viewing the groupoid G as a small category as in 19.3.5, we say two objects are 
connected if there exists a morphism between them, and the category is connected if 
every two objects are connected. 


If e € Gis an identity element in a groupoid G, then the set of elements a € G 
with left and right identity equal to e has the structure of a group; for the associated 
category, this is the automorphism group of the object. More generally, the following 
structural result holds. 


Proposition 19.3.9. Let G be a connected groupoid, and let e, f be identity elements 
in G. Let 
G(e, f) := {a € G: ex aanda x f are defined}. (19.3.10) 


Then the following statements hold. 


(a) The set G(e, e) is a group under =. 
(b) There is a (noncanonical) isomorphism G(e, e) = G(f, f). 
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(c) The set G(e, f) is a principal homogeneous space for G(e, e) ~ G( f, f). 


Proof. The set G(e, e) is nonempty, has the identity element e € G, and if a € G(e, e) 
then a x a7! =a! «a =e. If e, f are identity elements, since G is connected there 
exists a € G(e, f), so ale G(f, e) and the map G(e, e) > G(f, f) by x WP a* xx a! 
is an isomophism of groups. Similarly, the set G(e, f) has a right, simply transitive 
action of G(e, e) under right multiplication by x. o 


19.3.11. The moral of Proposition 19.3.9 is that the only two interesting invariants of a 
connected groupoid are the number of identity elements (objects in the category) and 
the group of elements with a common left and right identity (the automorphism group 
of every one of the objects). A connected groupoid is determined up to isomorphism 
of groupoids by these two properties. 


Remark 19.3.12. After seeing its relevance in the context of composition of quaternary 
forms, Brandt set out general axioms for his notion of a groupoid [Bra27, Bra40]. 
(Brandt’s original definition of groupoid is now called a connected groupoid.) This 
notion has blossomed into an important structure in mathematics that sees quite general 
use, especially in homotopy theory and category theory. It is believed that the groupoid 
axioms influenced the work of Eilenberg-Mac Lane [EM45] in the first definition of 
a category: see e.g., Brown [Bro87] for a survey, Bruck [Bruc71] for context in the 
theory of binary structures, as well as the article by Weinstein [Wein96]. 

Groupoids exhibit many facets of mathematics, arising naturally in functional anal- 
ysis (C*-algebras) and group representations, as Figure 19.3.13 indicates (appearing 
in Williams [Will2001, p. 21], and attributed to Arlan Ramsay). 


r 


group 


actions 
groups 
sets 


equivalence 
groupoids relations 


XL Jt 


Figure 19.3.13: Groupoids, as they relate to other mathematical objects 


(In this diagram, for example, a set X is a groupoid with only the multiplications 
x * x = x for x € X. The corner between sets and groups can be explained by a set 
with one element which can be made into a group in a unique way.) 
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19.4 Brandt groupoid 


Let R be a Dedekind domain with field of fractions F and let B be a quaternion algebra 
over F. 


Proposition 19.4.1. The set of invertible R-lattices in B is a groupoid under inverse 
and compatible product; the R-orders in B are the identity elements in this groupoid. 


Proof. For the associative law, suppose 7, J, K are invertible R-lattices with JJ and 
(IJ)K compatible products. Then Or(J) = OL(J) = OL(JK) and OR(IJ) = OR(J) = 
OL(K) by Lemma 16.5.11, so the products JK and I(J K) are compatible. Multiplication 
is associative in B, and it follows that (JK) = (IJ)K. Inverses exist exactly because 
we restrict to the invertible lattices. 

The law of identity holds as follows: if I, J are invertible R-lattices such that JJ is 
a compatible product, then (IJ)J lisa compatible product since Og(IJ) = Or(J) = 
O.(J~!), and by associativity 


(DJ! = WII) = IOL(J) = TOR) = 1, 


with a similar argument on the left. If J is an invertible R-lattice, then ZI -I = OL(D is 
an R-order in B, and every R-order O arises by taking J = O itself, so the R-orders are 
the identity elements in the groupoid. o 


Lemma 19.4.2. The connected components of the groupoid of invertible R-lattices in 
B are identified by the genus of the (left or) right order, and the group defined on such 
a component corresponding to an order O is Idl(O), the group of invertible two-sided 
O-ideals. 


Proof. By Proposition 19.4.1, the identity elements correspond to orders, and two 
orders are connected if and only if there is a (invertible, equivalently locally principal) 
connecting ideal if and only if they are in the same genus, as in section 17.4. The 
second statement follows immediately. o 


As a consequence of Lemma 19.4.2, the subset of R-lattices whose (left or) right 
order belong to a specified genus of orders is a connected subgroupoid. 


Definition 19.4.3. Let O C B be an R-order. The Brandt groupoid of (the genus of) 
Ois 


Brt(O) = {I : I C B invertible R-lattice and O, (J), OR(7) € Gen O}. 


In the next section, we consider a variant that considers classes of lattices, giving 
rise to a finite groupoid. 
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19.5 Brandt class groupoid 


We now organize lattices up to isomorphism as bimodules for their left and right orders. 


Lemma 19.5.1. Let I,J C B be lattices with O.J) = O (J) = O and Op() = 
Or(J) = O’. Then I is isomorphic to J as O, O'-bimodules if and only if there exists 
a € F* such that J = al. 


Proof. We have F = Z(B). If J = al with a € F*, then multiplication by a gives an 
R-module isomorphism J — J that commutes with the left and right actions and so 
defines a O, O’-bimodule isomorphism. 

Conversely, suppose that 6: I — J is a O,O’-bimodule isomorphism. Then 
o(uav) = ud(a)v for all a € J and u,v € O. Extending scalars to B, we obtain a 
B-bimodule isomorphism ¢: IF = B > JF = B. Let ¢(1) = $. Then for all a € B, 
we have ¢(@) = (l)a = Ba; but by the same token, é(@) = «œp for all a € B, so 
BEZB=F. oO 


Definition 19.5.2. Let J, J C B be R-lattices. We say that J is homothetic to J if there 
exists a € F* such that J = al. 


Homothety defines an equivalence relation, and we let [Z] denote the homothety 
class of an R-lattice 7. The left and right order of a homothety class is well-defined. 


19.5.3. The set of homothety classes of invertible R-lattices J C B has the structure of a 
groupoid under compatible product, since the compatible product [ZJ] is well-defined: 
if I’ = al and J’ = bJ with a,b € F*, then [I’J’] = [abIJ] = [IJ] since a,b are 
central. 

The map which takes an invertible lattice to its homothety class yields a surjective 
homomorphism of groupoids. Taking connected components we obtain a connected 
groupoid associated to a (genus of an) R-order O. Recalling 19.3.11, we note that the 
group at an R-order O is Picr(O), but there are still infinitely many orders (objects in 
the category). 


In order to whittle down to a finite groupoid, we fix representatives of the type set, 
and make the following definitions. 


19.5.4. For R-orders O, O’ C B, let 
Picr(O, O’) = {[1] : I c B invertible and O,(J) = O, OR(D = O'} 


be the set of homothety classes of R-lattices in B with left order O and right order 
O’; equivalently, by Lemma 19.5.1, Pic(O, O’) is the set of isomorphism classes of 
invertible O, O’-bimodules over R. In particular, Picr(O) = Picr(O, O). 

We have Picr(O, O’) + 0 if and only if O is connected to O’. 


Let O c B be an order and let O; be representative orders for the type set Typ O. 
We define 
BrtCl O := || Picr(O;, O;). 


ij 
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Theorem 19.5.5. Let R be a Dedekind domain with field of fractions F, and let B be 
a quaternion algebra over F. Let O C B be an order. Then the set BrtCl O has the 
structure of a finite groupoid that, up to isomorphism, is independent of the choice of 
the orders Oi. 


In particular, by Theorem 19.5.5 BrtClO depends only on the genus of O up to 
groupoid isomorphism. We call the set BrtCl O the Brandt class groupoid of (the 
genus of) O. 


Proof. The groupoid structure is compatible multiplication, with 
Picr(O;, O;) Picr(O;, Ox) Cc Picr(O;, Ox) 


for all i, j, k, in other words, BrtCl O is a connected subgroupoid of the groupoid of 
homothety classes of R-lattices 19.5.4. 


The groupoid is finite, by 19.3.11: the type set Typ O is finite by Main Theorem 
17.7.1 and Picg(O) is finite by Proposition 18.4.10. Explicitly, if [J;;] € Picr(O;, O;) 
then the map 


Picr(O) = Picr(O;) = Picr(O;j, O;) 
[Z] => UE; ] 
is a bijection of sets, just as in the proof of Proposition 19.3.9. Therefore 


#BrtCl O = #Picr(O)# Typ O. (19.5.6) 


Finally, this subgroupoid is independent of the choices of the orders O; as follows: 
all other choices correspond to O; = a;O;a;! with a; € B*, and the induced maps 


Picr(O;, O;) > Pic(O;, O%) 
[1] > [æila7'] = [7] 
together give an isomorphism of groupoids, since 
IJ] = [aila ajaz] = le; Jo%"] 


for all [7] € Picr(O;, O;) and [J] € Picr(O;, Ox). o 


Remark 19.5.7. Unfortunately, there is not in general a natural equivalence relation on 
Brt(O) giving rise to a quotient groupoid homomorphism Brt(O) — BrtC1 O. Rather, 
we find that BrtCl O is naturally a subgroupoid of Brt(O). 


Turning to the invariants 19.3.11, we see that the Brant class groupoid BrtC1 O 
encodes two things: the group Picr(O) and the type set Typ O. 


Remark 19.5.8. The modern theory of Brandt composition was investigated by Kaplan- 
sky [Kap69] and generalized to Azumaya quaternion algebras over commutative rings 
by Kneser—Knus—Ojanguren—Parimala—Sridharan [KKOPS86] for a generalization of 
the composition law to Azumaya algebras over rings. 
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19.6 Quadratic forms 


We now connect the Brandt class groupoid to quadratic forms. For simplicity, we 
suppose char F + 2 throughout this section. 


19.6.1. We begin by recalling Proposition 4.5.17: for the quaternary quadratic form 
nrd: B —> F, every oriented similarity of nrd is of the form 


B= B 
xme axp! 


with œ, € B (in particular, respecting the canonical orientation 5.6.7 of B); the 
similitude factor of such a map is u = nrd(a)/ nrd(£). 


Let J C B be a projective R-lattice. 


19.6.2. Generalizing Exercise 10.2, the reduced norm restricts to give a quadratic form 
on J. We are given that J is projective of rank 4 as an R-module. Therefore the map 


nrdz : I > L =nrd(/) 


is a quaternary quadratic module over R. 

If J c Bis another projective R-lattice, and f is an oriented similarity from nrd; to 
nrd,, then extending scalars by F we obtain a oriented self-similarity of nrd: B — B; 
by 19.6.1, we conclude that J = œ187! for some a, 8 € B*: 


nrdy 


L 
a i? (19.6.3) 
J = aI p + abt 


19.6.4. Suppose that nrd(/) = L = aR is principal. Then there is a similarity 


I= paR 


| le (19.6.5) 


In other words, if every value of the quadratic form is divisible by a, then up to 
similarity it is equivalent to consider the quadratic form a7! nrd, taking values in R. 


a`! nrdy 


— R 


Lemma 19.6.6. Suppose I is invertible. Then the quadratic form nrdy : I > L is 
locally oriented similar to ntrdg : O > R, where O = Or(J). 


Proof. By 19.6.3, if I = œO is principal, then nrd; is similar to nrdo. If / is invertible, 
then J is locally principal, so for all primes p of R the quadratic form nrd: I, > Ly 
is similar to nrd: Op — Rp where Oy is the left (or right) order of Ip. (A similar 
statement holds on the left.) oO 
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19.6.7. From Lemma 15.3.6, it follows from Lemma 19.6.6 that 
disc(nrd;) = disc(O) 


and in particular this discriminant is a square. 


The quadratic forms nrd; are all locally similar, respecting the canonical orientation 
5.6.7 on B. Therefore, the map 


Quaternary quadratic forms over Z 
BrtClO > locally similar to nrdo 
up to oriented similarity 


[J] } nrd; 
is (well-defined and) surjective. 
Remark 19.6.8. The Brandt groupoid is connected as a groupoid. This can also be 
viewed in the language of quadratic forms: a connected class of orders is equivalently 


a genus of integral ternary quadratic forms, and this is akin to a resolvent for the 
quaternary norm forms. We refer to Chapter 23 for further discussion. 


Exercises 


1. Verify the computational details in the example of section 19.2. 


2. Let B = (-1,-11 | Q) with disc B = 11 and O = ZG, (1 + j)/2) a maximal 
order. Compute BrtCl O, in a manner analogous to the example of section 19.2. 


3. Let G be a groupoid. 


(a) Show that if a, b,c € G and both a + b and a » c are defined, then b * b7! = 
c » c7! (and both are defined). 


(b) Show that for all a € G we have (a7!)~!. 


4. Let G be a group acting on a nonempty set X. Let 
A(G, X) = {(g,x): 8 € Gx € X}. 


Show that A(G, X) has a natural groupoid structure with (g, x) * (h, y) = (gh, y) 
defined if and only if x = hy. What are the identity elements? 


5. Show that in a homomorphism ¢: G — G’ of groupoids, the set of identity 
elements of G maps to the set of identity elements of G’. 


6. Let C be a small category. Show that there is a unique maximal subcategory that 
is a groupoid. [Hint: Discard all nonisomorphisms.] 
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T: 


10. 


11. 


Let X be a set and let ~ be an equivalence relation on X, thought of as a 
subset S C X x X. Equip S with the partial binary operation + defined by 
(x, y) * (y,z) = (x, z) for (x, y), (y,z) € S (and (x, y) * (w, z) is not defined if 
y + w). Show that S is a groupoid. [This shows that “equivalence relations are 
groupoids”’, cf. (19.3.13).] 


. Let F be a field and let GL(F) = U*_, GL, (F). Show that GL(F) has a natural 


n=1 
structure of groupoid, sometimes called the general linear groupoid over F. 


. Show that the reduced norm is a homomorphism from the groupoid of invertible 


R-lattices in B to the group(oid) of fractional R-ideals in F. 

Let X be a nonempty topological space, and let x,y € X. Recall that a path 
from x to y is a continuous map vo: [0,1] —> X with v(O) = x and v(1) = y. 
We say that paths vo, vı: [0,1] — X from x to y are homotopic if there exists 
a continuous map H: [0,1] x [0,1] —> X such that H(0, s) = x and H(1, s) = y 
for all s € [0,1] and H(t, 0) = vo and A(t, 1) = vı(t) for all t € [0, 1]. [So each 
H(t, s) for fixed t € [0,1] is a path from x to y, and this set of paths varies 
continuously. ] 


(a) Check that being homotopic defines an equivalence relation on the set of 
continuous paths from x to y. 

(b) Check that paths can be composed (going at twice speed) and that compo- 
sition of paths is well-defined on homotopy classes. 

(c) Show that composition of homotopy classes of continuous paths is asso- 
ciative. 


Let II(X) be the category whose objects are the points of X and with mor- 
phisms to be the set of homotopy classes of continuous paths from x to y under 
composition. 


(d) Show that II(X) is a category. 

(e) Show that II(X) is a groupoid, called the fundamental groupoid of X. 

(£) Finally, for all x € X, show the set of all morphisms from x to x in II(X) is 
a group (the more familiar fundamental group 7) (X, x) with base point 


x). 


Continuing the previous exercise, show that if X is path-connected, then II(X) 
is equivalent as a category to a groupoid with one object. [Hint: choose a point 
x € X, look at the group(oid) m(X, x).] 
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Chapter 20 


Integral representation theory 


In this chapter, we consider a slightly more general framework on the preceding 
chapters: we consider lattices as projective modules, and relate this to invertibility and 
representation theory in an integral sense. 


20.1 > Projectivity, invertibility, and representation theory 


Let R be a Dedekind domain with field of fractions F = Frac R. Finitely generated, 
projective R-modules have played an important role throughout this text, and we now 
seek to understand them in the context of orders. 

To this end, let B be a finite-dimensional F-algebra and let O C B be an R-order. A 
left O-lattice M is an R-lattice that is a left O-module, i.e., M is a finitely generated, 
projective (locally free) R-module that has the structure of a left O-module. We make 
a similar definition on the right. 

We say that a left (or right) O-lattice M is projective if it is a direct summand of 
a free left (or right) O-module. Projectivity for lattices in B is related to invertibility 
as follows (Theorem 20.3.3). 


Theorem 20.1.1. Let I C B be an R-lattice. Then I is invertible if and only if I is 
projective as a left O (I)-module and as a right Op(1)-module. 


One can also tease apart left and right invertibility if desired; in the quaternion 
context, these are equivalent anyway because of the standard involution (Main Theorem 
20.3.9). 

Given our efforts to understand invertible lattices, one may think that Theorem 
20.1.1 is all there is to say. However, two issues remain. First, there may be finitely 
generated (projective) O-modules that are not lattices, and they play a structurally 
important role for the order O. Second, and this point is subtle: there may be lattices 
I C B that are projective as a left O-module, but with OL(/) 2 O; in other words, such 
lattices are invertible over a larger order, even though they still have good properties 
as modules over the smaller order. 
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Example 20.1.2. Let 


Z Z 
O:= (iz a C B := M(Q) 


be the order consisting of integral matrices that are upper triangular modulo a prime 
p. We will exhibit both of the issues above. First, we consider O as a left O-module: it 


decomposes as 
1 0 0 0 Z 0 0 Z 
0-05 Jools i}e(r2 oele z) 


=~ ia D (2) =: h 6h. 


The two left O-modules 7/1, J are visibly projective, and they are not isomorphic: 
intuitively, an isomorphism would have to be multiplication on the left by a 2x2-matrix 
that commutes with multiplication O, and so it must be scalar. More precisely, suppose 
ġ € Homo(h, I2) is an isomorphism of left O-modules. Extending scalars, we have 


(20.1.3) 


Qh = Qk = (3) =: V, 
and the extension of ¢ gives an element in Autg(V) where B = M2(Q) = Endg(V), so 
commutes with the action of B and is therefore central: which is to say ¢ is a scalar 
matrix, and that is absurd. 

The lattice J = M>(Z) is invertible as lattice, since it is an order (!); and it is a 
two-sided fractional O-ideal, but it is not sated. We claim that J is also a projective 
O-module: this follows from the fact that M2(Z) = i as a left O-module, so M2(Z) 
is isomorphic to a direct summand of O°. 


In this chapter, we establish some basic vocabulary of modules in the language of 
the representation theory of an order. In the case of algebras over a field, we defined 
a Jacobson radical as a way to measure the failure of the algebra to be semisimple. 
Similarly, for every ring A, we define the Jacobson radical rad A to be the intersection 
of all maximal left ideals of A: it again measures the failure of left indecomposable 
modules to be simple. There is a left-right symmetry to rad A, and in fact rad A C A is 
a two-sided A-ideal. 

Locally, the Jacobson radical plays a key role. Suppose R is a complete DVR with 
unique maximal ideal p. Then p = rad O since it is the maximal ideal. Moreover, we 
will see that pO C rad O, so O/ rad O is a finite-dimensional semisimple k-algebra. 
Much of the structure of O-modules is reflected in the structure of modules over the 
quotient O/ rad O (see Lemma 20.6.8). 


Remark 20.1.4. In representation theory, generally speaking, to study the action of a 
group on some kind of object (vector space, simplicial complex, etc.) one introduces 
some kind of group ring and studies modules over this ring. The major task becomes 
to classify such modules. For example, let R be a Dedekind domain with F = Frac R, 
and let O be an R-order in a finite-dimensional F-algebra B. A finitely generated 

integral representation of O is a finitely generated O-module that is projective as 
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an R-module (in particular, is R-torsion free). The integral representations of O are 
quite complicated! Nevertheless, integral representation theory is a beautiful blend of 
number theory, commutative algebra, and linear algebra. In section 20.6, we will see 
some of the basic ingredients when R is a DVR, and in section 21.4 in the next chapter 
we will show that hereditary orders have a tidy integral representation theory. For 
more on the subject, see the surveys by Reiner [Rei70, Rei76] as well as the massive 
treatises by Curtis—Reiner [CR81, CR87]. 


20.2 Projective modules 


As we will need the notion over several different rings, we start more generally: let A 
be a ring (not necessarily commutative, but with 1). For an introduction to the theory 
of projective modules and related subjects, see Lam [Lam99, §2] and Curtis—Reiner 
[CR81, §2], and Berrick—Keating [BK2000, $2]. 


Definition 20.2.1. Let P be a finitely generated left A-module. Then P is projective 
as a left A-module if it is a direct summand of a free left A-module. 


A finitely generated free module is projective. The notion of projectivity is quite 
fundamental, as the following proposition indicates. 


Proposition 20.2.2. Let P be a finitely generated left A-module. Then the following 
are equivalent: 
(i) P is projective; 


(ii) There exists a finitely generated left A-module Q such that P ® Q is free as a left 

A-module. 
(iii) Every surjective homomorphism f: M — P (of left A-modules) has a splitting 

g:P>Mi(ie, fog =idp); 
(iv) Every diagram 

Z 
E # ? 
& 


M — > N —>0 
of left A-modules with exact bottom row can be extended as indicated, with 
p= f oq; and 
(v) Homa(P, —) is a (right) exact functor. 
Proof. See Lam [Lam99, Chapter 2]. In statement (v), given a short exact sequence 
0>Q0>M>N->0 
then Hom4(P, —) is always left exact, so 


0 — Hom,(P, Q) — Hom,(P, M) — Hom,(P, N) (20.2.3) 
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is exact; the condition for P to be projective is that Hom,(P, —) is right exact, so the 
full sequence 


0 — Hom,(P, Q) — Hom,(P, M) — Homa (P, N) > 0 (20.2.4) 
is short exact. o 
20.2.5. A finite direct sum P = Q,, P; of finitely generated A-modules is projective if 


and only if each summand P; is projective: indeed, the functor Homg(P, —) is naturally 
isomorphic to [[; Homg(P;, —), so we apply condition (v) of Proposition 20.2.2. 


20.2.6. Localizing Proposition 20.2.2(v), and using the fact that a sequence is exact if 
and only if it is exact locally (Exercise 20.1(a)), we see that P is projective as a left 
O-module if and only if Py») is projective as a left O(p)-module for all primes p C R 


Definition 20.2.7. A left O-lattice is an R-lattice M that is a left O-module. 
We make a similar definition on the right. 


20.2.8. A left O-lattice M is locally free of rank r > 1 if Mp ~ OF as left O-modules 
for all primes p C R. If follows from 20.2.5 and 20.2.6 that a locally free O-lattice is 
projective. 


20.3 Projective modules and invertible lattices 


Now let R be a noetherian domain with F := Frac R, let B be a finite-dimensional 
F-algebra, and let O C B be an R-order. 

One can extend the base ring of the module while preserving projectivity, as 
follows. 


Lemma 20.3.1. Let O C O’ be R-orders in B and let M be a left O'-lattice. If M is 
projective as a left O-module, then M is projective as a left O’-module. 


Proof. Suppose M is projective as a left O-module; then M $ N ~ O" for somer > 0. 
Tensor with O’ to get 


(O’ 80 M) ®(O’ 80 N) = (O'Y. (20.3.2) 


Since multiplication gives an isomorphism of left O’-modules O’@9 M æ> O'M = M, 
the result follows. (More generally, see Harada [Har63a, Lemma 1.3].) oO 


In the commutative case, an R-lattice a G F is invertible as an R-module if and 
only if a is projective as a (left and right) R-module. Something is true in this more 
general context. 


Theorem 20.3.3. Let I C B be an R-lattice. 


(a) ITH = Og() if and only if I is projective as a left O.(1)-module, and IT! = 
OL) if and only if I is projective as a right Opn(1)-module. 
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(b) I is projective as a left OL (D)-module and a right Og()-module if and only if I 
is invertible (as an R-lattice). 


The difference between (a) and (b) in Theorem 20.3.3 is the compatibility of the 
two products. 


Proof. We begin with (a). To prove the implication (=), suppose J~'7 = Op(J); then 

there exist a; € J and a} € I -l such that i @;a; = 1. We may extend the set a; 

to generate J as a left O_(/)-module by taking a; = 0 if necessary. We define the 
surjective map 

f:M= Doe: >I 

i (20.3.4) 


ei > Qi. 
Consider the map 
g:loM 
pre Y, bajer 
i 


the map g is defined because for all 6 € I, we have Ba} € IT}, and as always ZIT! cI 
so ZI! C O(I). The map g is a splitting of f since 


(f © 8X8) = > Baai = 22 aai = B 


Therefore / is a direct summand of M, so T is projective as a left O.(/)-module. 

Next we prove (<=). There exists a nonzero r € Z N R (Exercise 9.2), so to show 
that J is left invertible, we may replace 7 with r~'J and therefore suppose that 1 € 7. 
Following similar lines as above, let a; generate J as a left O_(/)-module, and consider 
the surjective map f: M = @; OL(De; — I by e; + aj. Then since J is projective as 
a left O_(/)-module, this map splits by a map g : I — M; suppose that g(1) = (a7); 
with a; € O_(/); then 


(fog) =1= )) afay. (20.3.5) 
For all $ € I, we have g(6) = (Ba;); € M, so Ba; € O(I) for all i; therefore for 


all a, p € I we have Ba;a@ € OL(I)I C I, whence a? € I~! by definition. Thus from 
(20.3.5) we have 1 € I!I, whence 


ORD) € THORD =H C ORD 


and thus equality holds. 
For part (b), the implication (4) follows from (a), and the implication (=) for 
compatibility follows from Proposition 16.5.8. o 


Remark 20.3.6. The proof of Theorem 20.3.3 follows what is sometimes called the 
dual basis lemma for a projective module: see Lam [Lam99, (2.9)], Curtis-Reiner 
[CR81, (3.46)], or Faddeev [Fad65, Proposition 18.2]. 
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20.3.7. Let O, O’ C B be R-orders. A O, O’-bimodule over R is an abelian group M 
with a left O-module and a right O’-module structure with the same action by R on 
the left and right (i.e., acting centrally, so rm = mr for all r € R and m € M). The 
R-lattice J C B is an O_(J), Op(J)-bimodule over R. 

When the equivalent conditions of Theorem 20.3.3(b) hold, we say that J is pro- 
jective as a OL (7), Or(/)-bimodule over R. 


Remark 20.3.8. In Theorem 20.3.3, we only considered an R-lattice J as a module over 
its left and right orders (i.e., we considered only sated fractional O, O’-ideals), for the 
reasons explained in 16.5.18. 

Although invertible requires working in this way, it is possible for an R-lattice I to 
be projective as a left O-module but still O Ç O_(): for example, if O is a hereditary 
order (see Chapter 21) contained properly in a maximal order O Ç O’, then O’ is 
projective as a left O-module. 


Although this may seem a bit complicated, it is refreshing that for quaternion 


algebras, all of the sided notions coincide. We recall the equivalences in Main Theorem 
16.7.7), building upon them. 


Main Theorem 20.3.9. Suppose R is a Dedekind domain and B is a quaternion algebra 
over F = Frac R, and let I C B be an R-lattice. Then the following are equivalent: 


(ii) Z is invertible; 

(iii) I is left invertible; 
Gii’) I is right invertible; 

(v) I is projective as a left OL (I)-module; and 
(v’) I is projective as a right Op(1)-module. 


Proof. The equivalences (ii) © (ii’) © (iii) are from Main Theorem 16.7.7 (proven in 
Lemma 16.7.5). Theorem 20.3.3(a) gives (v) => (iii) and (v’) > (iii’), and Theorem 
20.3.3(b) gives (ii) > (v), (v^). o 


Example 20.3.10. Consider again Example 16.5.12. The lattice J has OL (I) = Og() = 
O (so has the structure of a sated O, O-bimodule) but 7 is not invertible; from Main 
Theorem 20.3.9, it follows that J is not projective as a left or right O-module. 


20.4 Jacobson radical 


Before proceeding further in our analysis of orders, we pause to extend some notions 
in sections 7.2 and 7.4 from algebras to rings. We follow Reiner [Rei2003, §6a]; see 
also Curtis—Reiner [CR81, §5]. 

Throughout, let A be a ring (not necessarily commutative, but with 1). 


Definition 20.4.1. Let M be a left A-module. We say M is irreducible or simple if M + 
{0} and M contains no A-submodules except {0} and M. We say M is indecomposable 
if whenever M = Mı ® Mz with Mı, M2 left A-modules, then either Mı = {0} or 
Mp = {0}. 
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20.4.2. We generalize Lemma 7.2.7. If J is a maximal left ideal of A, then A/I is a 
simple A-module. Conversely, if M is a simple A-module, then for any x € M nonzero 
we have Ax = M; therefore M = A/I where 


I = ann(x) := {a E€ A: ax = 0}. 


Definition 20.4.3. The Jacobson radical rad A is the intersection of all maximal left 
ideals of A. The ring A is Jacobson semisimple if rad A = {0}. 


Lemma 20.4.4. The Jacobson radical rad A is the intersection of all annihilators of 
simple left A-modules; rad A C A is a two-sided A-ideal. 


Proof. The same proof as in Lemma 7.4.5 and Corollary 7.4.6 applies, mutatis mutan- 
dis. oO 


Example 20.4.5. If A is a commutative local ring, then rad A is the unique maximal 
ideal of A. 


Example 20.4.6. Let R be a complete DVR with maximal ideal p = rad R. Let F = 
Frac R and let D be a division algebra over F. Let O C D be the valuation ring, the 
unique maximal R-order (Proposition 13.3.4). Then O has a unique two-sided ideal P 
by 13.3.10, and so rad O = P. 


Lemma 20.4.7. A/ rad A is Jacobson semisimple. 


Proof. Let J = rad A. Since JM = {0} for each simple left A-module M, we may view 
each such M as a simple left A/J-module. Now let a € A be such thata+J € rad(A/J); 
then (a+J)M = {0},soa@M = {0} anda € J; thus rad(A/J) = {0}, and A/J is Jacobson 
semisimple. o 


Lemma 20.4.8. We have 
rad A = {£ € A: 1 — œ Baz € AŽ for all œj, œ € A}. 
Proof. See Exercise 20.6. o 
Corollary 20.4.9. rad A is the intersection of all maximal right ideals of A. 
Proof. Lemma 20.4.8 gives a left-right symmetric characterization of rad A. o 


Corollary 20.4.10. If 6: A — A’ is a surjective ring homomorphism, then ġ(rad A) S 
rad A’ and we have an induced surjective homomorphism A/rad A > A'/ rad A’. 


Proof. Let B € radA, let œj, œ, € A’; since ¢ is surjective, there exist preimages 
@, @2 € A. By Lemma 20.4.8, 1 — œı Gaz € A* and 


$ — a1 Biaz) = 1 — a} d(B)as € A’, 
so by the same lemma, (8) € rad A’. o 


Corollary 20.4.11. Let I C A be a two-sided A-ideal. 


332 CHAPTER 20. INTEGRAL REPRESENTATION THEORY 


(a) If A/I is Jacobson semisimple, then rad A C I. 
(b) If I C rad A, then (rad A)/I = rad(A/T). 


Proof. We have a surjection ø: A — A/I. For (a), we get (rad A) € rad(A/I) = {0} 
from Corollary 20.4.10, so rad A C J. For (b), we get rad(A)/J © rad(A/J) from the 
surjection, and applying (a) to (A/T)/(rad(A)/7) we get rad(A/J) © rad(A)/T. Oo 


Lemma 20.4.12. (Nakayama’s lemma). Let M be a finitely generated left A-module 
such that (rad A)M = M. Then M = {0}. 


Proof. If M + {0}, let x1,...,x, be a minimal set of generators for M as a left 
A-module. Since x; € M = (rad A)M, we may write 


x1 = Bix, +++ + Buxn 


with £; € rad A. But then 1 — 6; € A*, so the generator x, is redundant, a contradiction. 
o 


Corollary 20.4.13. Let M be a finitely generated left A-module, and let N C M be a 
submodule such that N + (rad A)M = M. Then N = M. 


Proof. By hypothesis, M/N is finitely generated, and (rad A)(M/N) = M/N, so by 
Nakayama’s lemma, M/N = {0} and M = N. o 


Lemma 20.4.14. Let I be a maximal two-sided ideal of A. Then I contains rad A. 


Proof. If I does not contain rad A, then J + rad A is a two-sided ideal of A containing 
rad A and properly containing 7. Since J is maximal, we have J + rad A = A. By (the 
corollary to) Nakayama’s lemma, we get J = A, a contradiction. Oo 


20.5 Local Jacobson radical 


Suppose now that R is a complete DVR with fraction field F = Frac R, maximal ideal 
p = rad R, and residue field k = R/p. Let B be a finite-dimensional F’-algebra, and let 
O C B be an R-order. 

In this setting, we may identify the Jacobson radical via pullback as follows. 


Theorem 20.5.1. Let 6: O — O/pO be reduction modulo p. Then 
rad O =  !(rad O/pO) 2 pO, 
and (rad O)” € pO for somer > 0. 
Proof. See Reiner [Rei2003, Theorem 6.15]. oO 
Corollary 20.5.2. O/ rad O is a (finite-dimensional) semisimple k-algebra. 


Proof. Since radO 2 pO, we conclude that O/pO is a k-algebra; it is Jacobson 
semisimple by 20.4.7 and hence semisimple by Lemma 7.4.2. o 
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Definition 20.5.3. A two-sided ideal J C O is topologically nilpotent if J” € pO for 
somer > 0. 


Remark 20.5.4. The order O as a free R-module has a natural topology induced from 
the p-adic topology on R; J is topologically nilpotent if and only if J” — {0} in this 
topology. 


Corollary 20.5.5. Let I C O be a two-sided ideal. Then the following are equivalent: 


(a) I € rad O; 
(b) I” C rad O for some r > 0; and 
(c) I is topologically nilpotent. 


Proof. See Reiner [Rei2003, Exercise 39.1, Exercise 6.3]. oO 


20.6 Integral representation theory 


We continue our notation that R is a complete DVR. We now turn to some notions in 
integral representation theory. In this local case, there is a tight connection between 
the representation theory of O (viewed in terms of O-modules) and the representation 
theory of the quotient O/pO which is a k-algebra of finite dimension over k, since O 
is finitely generated as an R-module. 


20.6.1. Recall that a representation of B over F is the same as a left B-module. If M is 
a finitely-generated left O-module, then V := M Qpr F is a left B-module, and M CV 
is an R-lattice. A O-supermodule of M is a left O-module V 2 M’ 2 M. 


The following result is foundational. 


Theorem 20.6.2. (Krull-Schmidt). Every finitely generated left O-module M is 
expressible as a finite direct sum of indecomposable modules, uniquely determined 
by M up to O-module isomorphism and reordering. 


Proof. Since M is finitely generated over R it is itself noetherian, so the process of 
decomposing M into direct summands terminates. See Curtis—Reiner [CR81, (6.12)] or 
Reiner [Rei2003, §6, Exercise 6] for hints that lead to a proof of the second (uniqueness) 
part. oO 


Corollary 20.6.3. Let M = Mı ©---®M, be a decomposition into finitely generated 
indecomposable left O-modules, and let N C M be a direct summand. Then N ~ 
M;i, ®-::®M,, for some subset {ij,...,is} G {1,...,n}. 


Proof. By hypothesis, we can write Ð M;i = N N’, with N’ a finitely generated 
left O-module. By the Krull-Schmidt theorem (Theorem 20.6.2), if we write N, N’ as 
the direct sums of indecomposable modules, the conclusion follows. o 


334 CHAPTER 20. INTEGRAL REPRESENTATION THEORY 


20.6.4. We saw in 7.2.19 that idempotents govern the decomposition of the F-algebra B 
into indecomposable left B-modules. The same argument shows that a decomposition 


O=P,®:::@P, (20.6.5) 


into a direct sum of indecomposable left O-modules corresponds to an idempotent 
decomposition 1 = e; +--- + e,, with the e; a complete set of primitive orthogonal 
idempotents. Moreover, each P; = Oe; is a projective indecomposable left O-module. 

Conversely, if P is a projective indecomposable finitely generated left O-module, 
then P = P; for some i: taking a set of generators we have a surjective O-module 
homomorphism O” — P, and since P is projective we have P € O” a direct summand, 
so Corollary 20.6.3 applies. 

Consequently, if P is a projective left O-lattice, then P =~ Py)" ®--- @ Py” with 
ni >Ofori=1,...,r. 


20.6.6. The decomposition of an order into projective indecomposables is a nice way 
to keep track of other orders, as follows. We extend our notation slightly, and define 


OL(M) := {@eE B:aMcM} 


for every left O-submodule M C B. 
Take a decomposition of O in (20.6.5); since each P; is a left O-module, extending 
scalars it is a left B-module, so 


() O.(P;) = O. (20.6.7) 
i=1 


Now let J C B be an R-lattice with O C O_(/) that is projective as an O-module. By 
20.6.4, considering / as a left O-module, we have an isomorphism of left O-modules 


o: 1 > PP" -9 Pr 
with n; > 0. We claim that 
Ou) = { ) OP). 
ni>0 


Indeed, we have a/ C I if and only if ¢(@/) = ad(J) € (I), since ¢ is a O-module 
homomorphism so extends to a B-algebra homomorphism, and finally ad(J) € (1) if 
and only if @P; C P; for alli with n; > 0, as in (20.6.7). 


We now relate a decomposition of O into a decomposition of O/pO. 
Lemma 20.6.8. Let J = rad O. The association I œ I/JI gives a bijection between 
isomorphism classes of indecomposable finitely generated projective left O-modules 


and isomorphism classes of simple finite-dimensional left O/J-modules. 


Proof. The proof requires a bit of fiddling with idempotents, but is otherwise straight- 
forward—so it makes a good exercise (Exercise 20.7). oO 
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Corollary 20.6.9. If I is projective indecomposable, then JI C I is the unique maximal 
O-submodule of I. 


Proof. By Lemma 20.6.8, since J is indecomposable, J/JT is simple so JT is a unique 
maximal submodule. If J’ C I is another maximal O-submodule, then JJ + J’ = J, and 
by Nakayama’s lemma 7’ = J, a contradiction. o 


We finish our local study over R a complete DVR with composition series for 
modules over an order. 


Definition 20.6.10. Let M be an O-lattice. A composition series for M is a strictly 
decreasing sequence 
M =Mọo2 M2 M22... 


such that N2; M; = {0} and each composition factor M;/M;+ı is simple as a O- 
module. 

The length of a composition series is the largest integer r such that M, = {0} if r 
exists (in which case we call the series finite), and otherwise the length is co. 


20.6.11. If M has a finite composition series, then its length (M) is well-defined, 
independent of the series. For example, taking R = F and O = B, a finitely generated 
B-module is a finite-dimensional F-vector space, so every composition series is finite 
and every B-module V has a well-defined length £(V). 


20.6.12. Let N C M be a maximal O-submodule. We claim that JM C N. Otherwise, 
N +JM = M by maximality, so by Nakayama’s lemma (Corollary 20.4.13), N = M, 
a contradiction. 


20.7 =» Stable class group and cancellation 


To conclude this chapter, we apply the above results and consider a different way to 
form of a group of ideal classes; for further reference on the topics of this section, see 
Curtis—Reiner [CR87, §§49-51] or Reiner [Rei2003, §38]. 

Let R be a Dedekind domain with field of fractions F. 


20.7.1. Recall that the group Cl R records classes of fractional ideals, or what is more 
relevant here, isomorphism classes of projective modules of rank |. Here is another way 
to see the group law on Cl R: given two such fractional ideals a, 6 up to isomorphism, 
there is an isomorphism of R-modules 


a®b = Rab, 
and the class of ab is uniquely determined by this isomorphism by 9.3.10. 
We now consider an analogous construction to 20.7.1 in the noncommutative 


setting. Let B be a simple F-algebra and O C B an R-order. We begin with a technical 
lemma. 
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Lemma 20.7.2. (Weak approximation). Let I be a locally principal left fractional 
O-ideal and let a C R be an ideal. Then there exists B € B* such that IB € O and 


(IB) = Op forall p|a. (20.7.3) 


Proof. For each prime p, we have Ip = Op@p with ap € By. Because F is dense in 
Fp, there exists £ € (O : J) such that apf, = 1 (mod pO,) for all p | a. By norms, 
we have $ € B*. Letting Up := app, we have up — 1 € pOp 2 rad Op by Theorem 
20.5.1 so up € O% by Lemma 20.4.8. Therefore (J8)p = Op for all p | a. o 


Proposition 20.7.4. If I, I’ C B are locally principal left fractional O-ideals, then 
there exists a locally principal left fractional O-ideal J and an isomorphism 


Tel’=~JeO (20.7.5) 
of left O-modules. 


Proof. We may suppose without loss of generality that 7, /’ C O. Then we have exact 
sequences of left O-modules 


475050746 
iar 0507 S60 


The module O/I is R-torsion, annihilated by the (nonzero) R-ideal a = [O : J], 
and similarly with 7’, annihilated by a’ = [O : I’]r. By weak approximation (Lemma 
20.7.2), replacing 7’ with I’B we may suppose that 7, = Op for all p | a, and hence 
a, a’ are coprime. Then for all primes p of R, we have either (O/I)p = {0} so Øp is 
surjective, or correspondingly ¢,, is surjective. 

Now consider the left O-module homomorphism 


¢+¢':1el' 30 (20.7.6) 


obtained by summing the two natural inclusions. We just showed that (¢ + $’)p is 
surjective for all primes p, so it follows that ¢ + ¢’ is surjective: the cokernel M := 
coker(¢ + ¢’) has My = {0} for all p, and M = {0}. Moreover, since O is projective as a 
left O-module, the map ¢+ ¢’ splits (or note that the map splits locally for every prime 
p, so it splits globally, Exercise 20.1(b)). If we let J := ker(¢ + ¢’), we then obtain an 
isomorphism 

Tel’=Jeo. (20.7.7) 


To conclude, we show that J is locally principal. To this end, we localize at a prime 
p and note that 7, T’ are locally principal, so 


Ip © I, = OF? = Jp ® Op. (20.7.8) 


But by the Krull-Schmidt theorem (Theorem 20.6.2) and Exercise 20.8, we can cancel 
one copy of Op from both sides! We conclude that Jp ~ Oy as left O-modules and 
therefore by Lemma 17.3.3 that Jp is (right) principal. o 
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The candidate binary operation in Proposition 20.7.4 has a simple description in 
the “coprime” case. 


Lemma 20.7.9. Let I, I’ C O be locally principal integral left O-ideals, and suppose 
for every prime p © R either Iy = Oy or I, = Op. Then 


Ilel’~OeJ, whereJ =I’. 
Moreover, writing Ip = Ody and I, = Oa, we have 


By weak approximation (Lemma 20.7.2), the hypothesis of Lemma 20.7.9 can 
always be arranged to hold for J, /’, up to isomorphism (as left O-ideals). 


Proof. By hypothesis, if ¢, ¢: I, I’ — O are the inclusions, then the map ¢ + ¢’: 1® 
I’ > O as in (20.7.6) is surjective. We have 


ker( + ¢') ={(a,e’)€1@l’:at+a’ =0}~INI’ 


by projection onto either coordinate, since a = —a’ € INI’. This gives an exact 
sequence 
0-Inl >lel>O0>0 


and as above 7 @ Il’ ~ J@O with J = INI’. The final statement follows from the 
hypothesis that either 7p = Oy or J, = Op, since then ay € Of or ay, € OF. o 


In order to get a well-defined binary operation, we need an equivalence relation: 
we will need to identify J, J’ if J O =~ J’ ® O. But the copies of O needed for the 
axioms start to pile up, so we make the following more general definition. 


Definition 20.7.10. Let J, J’ C B be locally principal left O-ideals. We say that J is 
stably isomorphic to J’ if there exists an isomorphism of left O-modules 
Jo O” = J' o” 


for some r > 0. 
Let [J]s+ denote the stable isomorphism class of a left O-ideal J and let StCl O be 
the set of stable isomorphism classes of left O-ideals in B. 


Proposition 20.7.11. StCl O is an abelian group under the binary operation (20.7.5), 
written [I]se + WU’ lst = [J] st, with identity [O]s+- 


Accordingly, we call StCl O the stable class group of O; it is also referred to as 
the locally free class group of O. 
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Proof. The operation is well-defined: if [h ]s: = Lo]st via 7 @ O% ~ h 6 O®" and 


the same with LT) Ise = [5 |se. and we perform the binary operation [; ® D = J0 


and the same with the subscripts 2, then 
Je OC +r'+1) Š (h ay OF") e ee a oO?) 
~ (h @ 0") @ (L a O’) (20.7.12) 
= h e oer +r'+1) 


so [Ji ]st = [J2]s+. It is similarly straightforward to verify that the operation is associa- 
tive and commutative and that [O]s+ is the identity. 

To conclude, we show that StCl O has inverses. Let J C O be a locally principal 
O-ideal. For each prime p C R, we have Ip = Opay with œp € By, and ay = 1 for 
all but finitely many p. Let 7’ be the R-lattice with 7, = Opa, l for all p. Then J’ is 
a left fractional O-ideal, because this is true locally. By weak approximation (Lemma 
20.7.2), there exists 8 € B* such that (J’B)p = Op for all p such that Ip + Op, i.e., 
for all p such that ay + 1. But now we can perform the group operation as in Lemma 
20.7.9: we have [I]s: + U’]st = [J]si where J = Z N T£, and for all p we have 


Jp = Opapay By = Opbp 
so J = OB and J = O, so [I]se + H’]se = [O]s¢ and J’ is an inverse. o 


Remark 20.7.13. There is a related group to StClO, defined as follows. Let A be 
a ring, and let P(A) be the category of finitely generated projective left A-modules 
under isomorphisms. We define the group Ko(A) to be the free abelian group on the 
isomorphism classes [P] of objects P € P(A) modulo the subgroup of relations 


[P@P’])=[P]+[P’], for P,P’ € P(A); 
equivalently relations [P] + [P’] = [Q] for each exact sequence 
0> P>Q0>P—>0 


since such a sequence splits. The group Ko(A) is sometimes called the projective class 
group of A. (The group Ko(A) is the Grothendieck group of the category P(A).) 
Then for P,Q € P(O), we have [P] = [Q] € Ko(O) if and only if P, Q are stably 
isomorphic [CR87, Proposition 38.22]. Moreover, there is a natural map 
Ko(O) > Ko(B) 
[P] > [F 8r O], 
and we let SKo(O) be its kernel, called the reduced projective class group of O. The 
abelian group SKo(O) is generated by elements [P] — [Q] where P,Q € P(O) and 
F @pr P = F &pr Q. Finally, we have an isomorphism [CR87, Theorem 49.32] 
StClO > SKo(O) 
(20.7.14) 
Uls: > H] - [0]. 


In other words, after all of this work—at least for maximal orders—the reduced 
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projective class group and the stable class group coincide. (For a more general 
order, one instead compares to a maximal superorder via the natural extension maps 
StClO > StCl O’.) 

The stable class group was first introduced and studied by Swan [Swa60, Swa62] 
in this context in the special case where O = Z[G] is the group ring of a finite 
group G. 


Definition 20.7.15. We say that O has stable cancellation (or the simplification prop- 
erty) if stable isomorphism implies isomorphism, i.e., if whenever J, 1’ are locally 
principal left O-ideals with J @ O” = I’ @ O” for r > O, then in fact J = I’. 


If we had defined stable isomorphism and cancellation for locally free O-modules, 
we would arrive at the same groups and condition, so stable cancellation is also called 
the locally free cancellation. 

From now on, suppose that R is a global ring with F = Frac R, and O C Bisa 
maximal R-order in a quaternion algebra B over F. We recall section 17.8, and the 
class group CloR, where Q c Ram B is the set of real ramified places. 


Theorem 20.7.16 (Frohlich-Swan). Let R = R (s) be a global ring, let B be a quater- 
nion algebra over F, and let O C B be a maximal R-order. Then the reduced norm 


induces an isomorphism 
nrd : StClO æ ClgR (20.7.17) 


of finite abelian groups. 


Proof. See Frohlich [Fr675, Theorem 2, S X], Swan [Swa80, Theorem 9.4], or Curtis— 
Reiner [CR87, Theorem 49.32]; we will sketch a proof of a more general version of 
this theorem in section 28.10, when we have idelic methods at our disposal. oO 


20.7.18. Since B is a quaternion algebra, the notions of invertible and locally principal 
coincide. Then there is a surjective map of sets 


Cls, O > StClO 
M > Us. (20.7.19) 


Suppose further that F = Frac R is a number field and R is a global ring. Then Cls, O 
is a finite set, by Main Theorem 17.7.1; consequently, the stable class group StC1 O is 
a finite abelian group. However, the map (20.7.19) of sets need not be injective. 


The order O has stable cancellation if and only if the map (20.7.19) is injective 
(equivalently, bijective). 


20.7.20. Suppose that B satisfies the Eichler condition. Then by Eichler’s theorem 
(Theorem 17.8.3), the reduced norm also gives a bijection Cls O — CloR compatible 
with the surjective map Cls.O — StClO (20.7.19) which must therefore also be a 
bijection. 


What remains, then, is the case where B is definite. We restrict attention to the 
case where the base field F is a number field, hence a totally real field, and we work 
with R-orders O C B, where R = Zr is the ring of integers of F. Vignéras [Vig76b] 
initiated the classification of definite quaternion orders with stable cancellation, and 
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showed that there are only finitely many such orders. Hallouin—Maire [HM2006] and 
Smertnig [Sme2015] extended this classification to certain classes of orders, and the 
complete classification was obtained by Smertnig—Voight [SV2019]. 


Theorem 20.7.21. (Vignéras, Hallouin—Maire, Smertnig, Smertnig—Voight). Up to 
isomorphism, there are exactly 316 definite quaternion R-orders with stable cancella- 
tion. 


The isomorphisms in Theorem 20.7.21 are as R-orders; up to ring isomorphism 
(identifying Galois conjugates), there are exactly 247. 


Example 20.7.22. If O is a definite maximal quaternion Z-order, by Theorem 20.7.16 
we have StClO = #CI* Z = 1, so O has stable cancellation if and only if #Cls O = 1. 
These orders will be classified in section 25.4: they are the orders of discriminant 
D = 2,3,5,7,13. (More generally, if R = Zr has #Cl* Zr = 1, then a definite, 
maximal quaternion R-order has stable cancellation if and only if #Cls O = 1.) 


Remark 20.7.23. Jacobinski [Jaci68] was the first to consider the stable class group for 
general orders in the context of his work on genera of lattices; his cancellation theorem 
states more generally that if B is a central simple algebra over F and B is not a totally 
definite quaternion algebra, then every R-order O C B has stable cancellation. This 
result was reformulated by Frohlich [Fr675] in terms of ideles and further developed 
by Frohlich-Reiner-Ullom [FRU74]. Swan [Swa80] related cancellation to strong 
approximation in the context of K-groups. 

Brzezinski [Brz83b] also defines the spinor class group of an order, a quotient of 
its locally free class group; this group measures certain invariants phrased in terms of 
quadratic forms. 


Remark 20.7.24. More generally, a ring A in which every stably free right A-module 
is free is called a (right) Hermite ring by some authors: for further reference and 
comparison of terminology, see Lam [Lam2006, Section I.4]. If O has locally free 
cancellation, then O is Hermite; however, the converse does not hold in general— 
a counterexample is described in detail by Smertnig [Sme2015]. Smertnig—Voight 
[SV2019] show that there are exactly 375 definite quaternion R-orders with the Hermite 
property up to isomorphism. 


Exercises 


Throughout these exercises, let R be a noetherian domain with F = Frac R, let B be a 
finite-dimensional F-algebra, let O C B be an R-order, and let J = rad O. 


> 1. Let M, N be left O-lattices. 


(a) Show that a sequence 0 > M > N > M’ — 0 of left O-lattices is exact 
if and only if the sequences 0 > Mp > Ny > Mj, > 0 are exact for 
all primes p C R. [Hint: Consider the modules measuring the failure of 
exactness and show they are locally zero, hence zero. ] 
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> 2. 


> 6. 


> 7. 


> 8. 


. Let R be a DVR with maximal ideal p, and let O = 


(b) Let 6: M — N be a surjective O-module homomorphism. Show that 
@ splits (there exists Y: N — M such that ġy = idy) if and only if 
$p : Mp — Ny splits for all primes p C R. 


Suppose R is a DVR and B is a quaternion algebra. Let J = rad O. Show that 
J = J and O,(rad O) = Op(rad O). 


. Let R be a complete DVR with p = rad R. Show that the p-adic topology and the 


J-adic topology on O are the same. 


R R 

R 
I C B be a left fractional O-ideal. Show that either J is invertible as a O-ideal or 
I is conjugate to M2(R) by an element of B*. 


C B = M)(F). Let 


. Let O be a maximal R-order, and let M be a projective left O-lattice. Show 


that M is indecomposable if and only if FM is a simple left B-module. [Hint: 
Suppose W C FM is a left B-submodule of FM, and let N := MN W. Show 
that M/N is a projective O-lattice, so the sequence 0 > N > M > M/N > 0 
splits. ] 


Let A be a ring (not necessarily commutative, but with 1. In this exercise, we 
prove Lemma 20.4.8, that 


rad A = {£ € A: 1—a)Baz € A” for all a, a2 € A}. 


We first show the inclusion (C). 


(a) Since rad A is a two-sided ideal, it suffices to show that 1 — 6 € A*. Show 
that A(1 — £) = A. 

(b) Let a € A be such that a(1 — 8) = 1. Repeating the argument, show that 
A(1 - (1 - œ)) = Aa = A. 

(c) Show that a is also a right inverse of 1 — 6, so 1 — B € A”. 


Next we show the inclusion (2). 


(d) Let 8 € A be such that 1 - a@By € A” for all a, y € A. Let M be a simple 
left A-module. Show that aM = {0}. Conclude that œ € rad A. 


Suppose R is a complete DVR. Prove Lemma 20.6.8: the association J => 
I/JI gives a bijection between isomorphism classes of indecomposable finitely 
generated projective left O-modules and isomorphism classes of simple finite- 
dimensional left O/J-modules. 


Let R be a complete DVR, and let J, J’, J be finitely generated left O-modules 
such that 
leJ~l oes 


as left O-modules. Prove that J = I’ as left O-modules. 
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9. Let A = M,(O) with n > 1. Show that StCl A = StClO. 
10. Show that the Lipschitz order has stable cancellation. 
11. Let B = (-1,-3 | Q) and let O = Z + Zi) + Z(-1 + j)/2 + Zi + ij)/2. 


(a) Show that O is an order with discrd O = 9. 
(b) Show that O has stable cancellation. 
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Chapter 21 


Hereditary and extremal orders 


In this chapter, we consider hereditary orders, those with the simplest kind of module 
theory; we characterize these orders in several ways, including showing they have an 
extremal property with respect to their Jacobson radical. 


21.1 > Hereditary and extremal orders 


Let R be a Dedekind domain. Then R is hereditary: every submodule of a projective 
module is again projective. (Hence the name: projectivity is inherited by a submodule.) 
A noetherian domain is hereditary if and only if every ideal of R is projective, or 
equivalently, that every submodule of a free R-module is a direct sum of ideals of 
R. This property is used in the proof of unique factorization of ideals and makes the 
structure theory of modules over a Dedekind quite nice. (Note, however, that every 
order in a number field which is not maximal is not hereditary.) 

It is important to identify those orders for which projective modules abound. Let 
B be a simple finite-dimensional F-algebra and let O C B be an R-order. 


Definition 21.1.1. We say O is left hereditary if every left O-ideal J C O is projective 
as a left O-module. 


We could define also right hereditary, but left hereditary and right hereditary are 
equivalent for an R-order O, and so we simply say hereditary. We have O hereditary 
if and only if every O-submodule of a projective finitely generated O-module is 
projective—that is to say, projectivity is inherited by submodules. Moreover, being 
hereditary is a local property. 

Maximal orders are hereditary (Theorem 18.1.2), and one motivation for hereditary 
orders is that many of the results from chapter 18 on the structure of two-sided ideals 
extend from maximal orders to hereditary orders (Theorem 21.4.9). 


Proposition 21.1.2. Suppose O is hereditary. Then the set of two-sided invertible 
fractional O-ideals of B forms an abelian group under multiplication, generated by 
the prime O-ideals. 
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Hereditary orders are an incredibly rich class of objects, and they may be charac- 
terized in a number of equivalent ways (Theorem 21.5.1). We restrict to the complete 
local case, and suppose now that R is a complete DVR with unique maximal ideal p 
and residue field k = R/p. 

Just as maximal orders are defined in terms of containment, we say O is extremal 
if whenever O’ > O and rad O’ 2 rad O, then O’ = O. If O is not extremal, then 


O' := O.(radO) 2 O (21.1.3) 


is a superorder. We then have the following main theorem (Theorem 21.5.1). 


Main Theorem 21.1.4. Let R be a complete DVR and let O © B be an R-order in a 
simple F-algebra B. Let J := rad O. Then the following are equivalent: 


(i) O is hereditary; 
(ii) J is projective as a left O-module; 
Gi’) J is projective as a right O-module; 
(iii) OL(J) = O; 
(iii’) ORr(J) = O; 
(iv) J is invertible as a (sated) two-sided O-ideal; and 


(v) O is extremal. 


The fact that hereditary orders are the same as extremal orders is quite remarkable, 
and gives tight control over the structure of hereditary orders: extremal orders are 
equivalently characterized as endomorphism algebras of flags in a suitable sense, and 
so we have the following important corollary for quaternion algebras. 


Corollary 21.1.5. Suppose further that B is a quaternion algebra. Then an R-order 
O C B is hereditary if and only if either O is maximal or 


R R 
gsf p| EMP) = B. 


Itis no surprise that we meet again the order from Example 20.1.2! The reader who 
is willing to accept Corollary 21.1.5 can profitably move on from this chapter, as the 
ring of upper triangular matrices is explicit enough to work with in many cases. That 
being said, the methods we encounter here will be useful in framing investigations of 
orders beyond the hereditary ones. 


21.2 Extremal orders 


In this section, we will see how to extend an order to a superorder using the Jacobson 
radical, and we will characterize those orders that are extremal with respect to this 
process. 

We work locally throughout this section; let R be a complete DVR with maximal 
ideal p = rad(R) and residue field k = R/p, and let F = Frac R. Let B be a finite- 
dimensional separable F-algebra and let O C B be an R-order. 
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21.2.1. Our motivation comes from the following: we canonically associate a super- 
order as follows. Let J := rad O and O’ := OL (J). Then O’ 2 O. By Corollary 20.5.5, 
J” C pO C pO’ for some r > 0, and then J S rad O’. 


Definition 21.2.2. An R-order O’ C B radically covers O if O’ 2 O and rad O’ 2 
rad O. We say O is extremal if whenever O’ radically covers O then O’ = O. 


We can think of extremal orders as like maximal orders, but under certain inclu- 
sions. 


Proposition 21.2.3. An R-order O is extremal if and only if O, (rad O) = O if and only 
if Or(rad O) = O. 


Proof. The argument is due to Jacobinski [Jaci71, Proposition 1]. 

We first prove (=). Suppose O is extremal, and let J = rad O and O’ = O_(J). 
By Corollary 20.5.5, J is topologically nilpotent as a O-ideal, so the same is true as a 
O’ ideal, and J € rad O’ and O’ radically covers O. Since O is extremal, we conclude 
O’ = O. The same argument works on the right. 

Next we prove (<=). Let J = rad O, suppose O = O,(J); let O’ radically cover O, 
and let J’ = rad O’. As lattices, we have p*O’ C J for some s > 0; by Theorem 20.5.1, 
(J’)” C pO’ for somer > 0, so putting these together we have (J’)’ c J for some t > 0. 
Suppose t > 1. Since O’ radically covers, we have J € J’; thus J(J’)"! c (J CJ 
and (J)! c Og(J) = O. But then since ((J’)'~!)? c (J) c J, by Corollary 20.5.5, 
(J’)'"! c J. Continuing in this way, we obtain t = 1 and J’ C J. Therefore J = J’ and 
O = OL(J) = OL(J’) = O’, thus O is extremal. o 


Lemma 21.2.4. Let O be an R-order and let O’ C B be an R-order containing O. Let 
J := radO’. Then O + J’ is an R-order that radically covers O. If further J’ C O, 
then J’ CJ. 


Proof. See Exercise 21.6. o 


21.2.5. In view of Lemma 21.2.4, an extremal order is determined by its homomorphic 
image in a nice k-algebra as follows. 

Let O be an extremal R-order and let O’ € B be a maximal R-order containing O. 
Let J’ := rad O’. By Lemma 21.2.4, O + J’ is an R-order that radically covers O, so 
O + J’ = O. Therefore J’ C O. By the second part of Lemma 21.2.4, we immediately 
conclude J’ C J := rad O. In sum, 


J’ =radO’ C J = radO C O. (21.2.6) 


Consider now the reduction map p: O’ — O’/J’. Since J’ € O, if A = p(O) then 
O = p~! (A). But since pO’ c J’ and O’ is a maximal R-order, the codomain is a nice, 
finite dimensional k-algebra, something we will get our hands on in the next section. 


Paragraph 21.2.5 has the following consequence. 


Lemma 21.2.7. Suppose that B is a division algebra over F and let O C B be extremal. 
Then O is maximal. 


346 CHAPTER 21. HEREDITARY AND EXTREMAL ORDERS 


Proof. Recall 13.3.7. The valuation ring O’ 2 O has the unique maximal two-sided 
ideal J’ = rad O’ = O’ \ (O’)*, so O’/J’ is a field. We have (21.2.6) J’ C J, but then 
J/J’ = {0} € O’/J’ thus J = J’. Thus O’ radically covers O, and since O is extremal, 
O=. (m 


Remark 21.2.8. We stop short in our explicit description of local extremal orders in 
section 21.2: we gave a construction in 21.3.1 only for B ~ M, (F). The results extend 
to B ~ M,(D) where D is a division algebra over F by considering lattices in a free 
left D-module: see Reiner [Rei2003, Theorem 39.14]. 


21.3 = Explicit description of extremal orders 


We now turn to an explicit description of extremal orders. In Lemma 10.5.4, we saw 
that maximal orders in a matrix algebra B = Endr (V) are endomorphism algebras of 
lattices. In this section, we extend this to encompass orders that arise from endomor- 
phism algebras of a chain of lattices: these orders are “block upper triangular”, and 
can be characterized in a number of ways. 


21.3.1. Let V be a finite-dimensional F-vector space and let B = Endr (V); then V is 
a simple B-module. Let M C V be an R-lattice. By Lemma 10.5.4, A := Endg(M) is 
a maximal R-order; we have rad A = pA. 
Choosing a basis for M, we get A ~ M, (R) € M, (F) = B, and rad A = M, (p). 
Now let Z := M 8r k = M/pM. Then Z is a finite-dimensional vector space over 
k. Let 


&: {0} = Z) CZ, C++ OC Zs-1 CZ, =Z 


be a (partial) flag, a strictly increasing sequence of k-vector spaces. We define 
OL(E) := {a E A:aZ, CZ, :i=0,..., s}. 


Equivalently, let M; be the inverse image of Z; under the projection M — Z; then we 
have a chain 


pM = Mo G Mı Ç+- G Ms-1ı G Ms =M (21.3.2) 
and 
OL(E) ={@a E A: aM; CM; :i =0,..., 5}. 
Lemma 21.3.3. O.(&) € A is an R-order with 
rad OL(E) = {a € A: a@Z; C Zi-1} = {a € A: aM; C Mi-1}. 
Proof. That O, (8) is an order follows in the same way as the proof of Lemma 10.2.7. 
For the statement on the radical: let J = {a € A : @Z; C Zi-1}. Then J C O,(6) 


is a two-sided ideal. We have J* C pA pushing along the flag, so J C rad OL(8) by 
Corollary 20.5.5. Conversely, 


OL(8)/J = PD Endk(Zi/Zi-1); 


i=1 


each factor is simple, so the sum is (Jacobson) semisimple; therefore J ¢ rad OL(E) 
and equality holds. o 
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Example 21.3.4. If we take the trivial flag O.(E) : {0} = Zo Ç Z = Z, then OL(6) = 
A, so this recovers the construction of maximal orders. 


Example 21.3.5. Let & be the complete flag of length s = n+ 1 = dimp V, where each 


quotient has dim; (Z;+1/Z:) = 1. Then there exists a basis z1, . . ., Zn of Z so that Z; has 
basis Z1,..., Zn-i, We lift this to basis to x1, . . ., Xn of M (by Nakayama’s lemma), and 
in this basis, we have 
R R R R 
p R R R 
Geya PR k 
p pp... R 
consisting of matrices which are upper triangular modulo p, and 
p R R.. R 
p pR... R O 1. ... 0 
radO(6)=|P P P- RJ=O]: i 7: (21.3.6) 
` 4 a ae a 0 0 see 1 
a ae a z0 
p p p.. p 


where the latter is taken to be a block matrix with lower left entry z and top right entry 
the (n — 1) x (n — 1) identity matrix. 

Other choices of flag give an order which lie between O, (6) and A: we might think 
of them as being block upper triangular orders. 


Now for the punch line of this section. 


Proposition 21.3.7. Let O C B be an R-order. Then O is extremal if and only if 
O = OL(8) for a flag 6. 


Proof. Let O = OL(8). Let J = radO; we seek to apply Proposition 21.2.3, so we 
show that O = O_(J). By Lemma 21.3.3, we have JM; = Mj-; so OL(J)Mi-1 = 
OLJ)JM; = JM; = Mi- fori = 1,...,s5. Since Mo = pM = M, we conclude 
OL(J) = OL(&) = O by definition. 

Conversely, suppose O is extremal with J = radO. Let s be minimal so that 
J5 = pO. We may embed O € A for some A, and we take the flag 


8: 0} = FZS PF (2x C IZ CF. 


Then O € O(&) and rad O(&) 2 J by construction, so since O is extremal, we have 
O =O€). oO 
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21.4 Hereditary orders 


We now link the orders in the previous two sections to another important type of order. 
The theory of extremal and hereditary orders was developed by Brumer [Brum63a, 
Brum63b], Drozd—Kirichenko [DK68], Harada [Har63a, Har63b, Har63c], Jacobinski 
[Jaci71], and Hijikata—Nishida [HN94]. An overview of the local and global theory of 
hereditary orders is given by Reiner [Rei2003, §§2f, 39-40], and Drozd—Kirichenko— 
Roiter [DKR67] and Hijikata—Nishida [HN94] extend some results from hereditary 
orders to Bass orders. 

Let R be a noetherian domain with F = Frac R, and let B be a separable F-algebra, 
and let O C B be an R-order. 


Definition 21.4.1. We say O is left hereditary if every left O-ideal J C O is projective 
as a left O-module. 


21.4.2. We could similarly define right hereditary, but since an order O is left and right 
noetherian, it follows that O is left hereditary if and only if O is right hereditary: see 
Exercise 21.8. When B is a quaternion algebra, the standard involution interchanges and 
left and right, so the two notions are immediately seen to be equivalent. Accordingly, 
we say hereditary for either sided notion. 


Example 21.4.3. In the generic case F = R and O = B, we note that every semisimple 
algebra B over a field F is hereditary: by Lemma 7.3.5, every B-module is semisimple 
hence the direct sum of simple B-modules equivalently maximal left ideals, by Lemma 
TAT, 


21.4.4. By 20.2.6, being hereditary is a local property. 


The following lemma motivates the name hereditary: projectivity is inherited 
by submodules. (Note that since R is noetherian, a finitely generated O-module is 
noetherian, so every submodule is finitely generated.) 


Lemma 21.4.5. Let O be hereditary, and let P be a finitely generated projective left 
O-module. Then every submodule M C P is isomorphic as a left O-module to a finite 
direct sum of finitely generated left O-ideals; in particular, M is projective. 


Proof. We may suppose without loss of generality that P ~ O”. We proceed by 
induction on r; the case r = 1 holds by definition. Decompose O” = E @ O where 
E ~ O'~!. From the exact sequence 


0—-ker@é-~M—-MNE-O0O 
and projectivity, we find that M ~ (M N E) @ ker ø where ker ¢ C O is a left ideal of 


O. By induction, M N E is projective, so the same is true of M. o 


Corollary 21.4.6. O is hereditary if and only if every submodule of a projective O- 
module is projective. 
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Proof. The implication (=) is Lemma 21.4.5; for the implication (<=), O is projective 
(free!) as a left O-module and every left ideal is aO-submodule J € O, so by hypothesis 
I is projective. oO 


Remark 21.4.7. O is hereditary if and only if every R-lattice Z C B with O € O(I) is 
projective as a left O-module, after rescaling. 

However, a bit of a warning is due. If J C B is an R-lattice that is projective as a 
left O-module, then we have shown that J is projective as a left O.(/)-module (Lemma 
20.3.1), whence left invertible (Theorem 20.3.3) as a lattice. But the converse need 
not be true; so it is important that in the definition of hereditary we do not require 
that every left O-fractional ideal 7 is invertible as a left fractional O-ideal (Definition 
16.5.17): the latter carries the extra assumption that J is sated. See also Remark 20.3.8. 


Lemma 21.4.8. Let O C B be a hereditary R-order and let O’ 2 O be an R-superorder. 
Then O’ is hereditary. 


Proof. Let I’ C O’ be a left O’-ideal. Scaling we may take 7’ © O, and it is a left 
O-ideal. Since O is hereditary, /’ is projective as a left O-module; by Lemma 20.3.1, 
I’ is projective as a left O’-module. o 


One of the desirable aspects of hereditary orders is that many of the results from 
chapter 18 on the structure of two-sided ideals extend from maximal orders to hereditary 
orders. Indeed, section 18.2 made no maximality hypothesis (we held out as long as 
we could!). 


Theorem 21.4.9. Let R be a Dedekind domain and let O be a hereditary R-order in a 
simple F-algebra B. Then the set of two-sided invertible fractional O-ideals of B forms 
an abelian group under multiplication, generated by the invertible prime O-ideals. 


Proof. Proven in the same manner as in Theorem 18.3.4; a self-contained proof is 
requested in Exercise 21.4. o 


Remark 21.4.10. Theorem 21.4.9 is proven by Vignéras [Vig80a, Théorème 1.4.5], but 
there is a glitch in the proof. Let R be a Dedekind domain, let B be a quaternion algebra 
over F = Frac R, and let O C B be an R-order. Vignéras claims that the two-sided 
ideals of O form a group that is freely generated by the prime ideals, and the proof 
uses that if J is a two-sided ideal then / is invertible. This is false for a general order 
O (see Example 16.5.12). 

If one restricts to the group of invertible two-sided ideals, the logic of the proof 
is still flawed. The proof does not use anything about quaternion algebras, and works 
verbatim for the case where R = Z C F = Q and B is replaced by K = Q(Vdx) and 
O is replaced by an order of discriminant d = dx f? that is not maximal, of conductor 
f € Zs), as in section 16.1. Then the ideal f = fZ + VdZ is not invertible, but f D (f) 
and (f) is invertible but not maximal, so the group of invertible ideals is not generated 
by primes. 

However, if one supposes that every two-sided ideal is invertible (as a lattice), 
then the argument can proceed: this is the class of hereditary orders, and is treated in 
Theorem 21.4.9. 
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Remark 21.4.11. The module theory for hereditary noetherian prime rings, generaliz- 
ing hereditary orders, has been worked out by Levy—Robson [LR2011]. 


21.5 = Classification of local hereditary orders 


We now come to the main theorem of this chapter, relating extremal orders, hereditary 
orders, their modules and composition series in the local setting. 


Theorem 21.5.1. Let R be a complete DVR and let F := Frac R. Let B be a finite- 
dimensional F-algebra, and let O C B be an R-order. Let J := radO. Then the 
following are equivalent, along with the conditions’ where ‘left’ is replaced by ‘right’: 


(i) O is extremal; 
(ii) Every projective indecomposable left O-submodule P © B is the minimum O- 
supermodule of JP; 
(iii) Every projective indecomposable left O-module P has a unique composition 
series; 
(iv) Every projective indecomposable left O-module P has a unique composition 
series consisting of projectives; 
(v) O is hereditary; 
(vi) J is projective as a left O-module; 
(vii) If P is a projective indecomposable left O-module, then JP is also projective 
indecomposable; and 
(viii) J is invertible as a (sated) two-sided O-ideal. 


Proof. See Hijikata—Nishida [HN94, $1]. o 
Corollary 21.5.2. A maximal order is hereditary. 


Proof. We proved this in Theorem 18.1.2, but here is another proof using Theorem 
21.5.1: the property of being maximal is local, and a maximal order is extremal. O 


To conclude, we classify the lattices of a local hereditary order. 


21.5.3. Suppose R is a complete DVR and that B ~ M,(F). Suppose O € B is a 
hereditary R-order; then by Theorem 21.5.1, O = O_(&) is extremal, arising from a 
chain 21.3.2 which by Lemma 21.3.3 is of the form 


pM = Mo =J M Ç JT'M Ç ++- G Ms-ı = JM G Ms =M, 


with each quotient M;/M;i+ı ~ M/JM simple. 

We claim that the set M,JM,...,J°~'M form a complete set of isomorphism 
classes of indecomposable left O-modules. Indeed, these modules are all mutually 
nonisomorphic, because an isomorphism ¢: JM —> J/M of left O-modules extends 
to an isomorphism ¢ € Endg(B) = F so is given by (right) multiplication by a power 
of z, impossible unless i = j (mod s). And if N is an indecomposable left O-module, 
then FN = F” is ‘the’ simple B-module, so N is isomorphic to a lattice in V. Since J 
is invertible, we may replace N by J’N withr € Z, to suppose that M 2 N > JM. But 
M/JM = O/J is simple as a left O-module, so N = M. (See also Reiner [Rei2003, 
Theorem 39.23].) 
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Exercises 


l. 
2. 


> 4. 


> 6. 


> 8. 


10. 


Show that a Dedekind domain is hereditary (cf. Exercise 9.5). 
Let R = Z, let B = K = Q(Vd) with d the discriminant of K, and let S C Zg be 
an order. Show that S is hereditary if and only if S is maximal. 


. Let R be a DVR with maximal ideal p = xR and F = Frac R with char F # 2. 


1 
Let B = zz) and O = Rii, j) the standard order. Show directly that rad O = 


Oj = jO, and conclude that O is hereditary (but not a maximal order). 


Give a self-contained proof of Theorem 21.4.9 following Theorem 18.3.4. 
(Where does the issue with invertibility arise?) 


. Let R be a complete DVR and let O be a hereditary R-order. Show that O is 


hereditary if and only if rad O is an invertible (sated) two-sided O-ideal. 

In this exercise, we prove Lemma 21.2.4 following Reiner [Rei2003, Exercise 
39.2]. We adopt the notation from that section, so in particular R be a complete 
DVR with maximal ideal p = rad(R). Let O be an R-order and let O’ C B be an 
R-order containing O. Let J’ = rad O’. 


(a) Show that O + J’ is an R-order. 

(b) Show that O + J’ radically covers O. [Hint: let J = rad O, and claim that 
J+J’ Crad(O+J’). For r large, show J” C pO so (J+J’)" € pO’ +J’ and 
(J'Y C pO’, and then making r even larger show (J + vy Cp(O+ J’). 
Conclude using Corollary 20.5.5.] 

(c) If further J’ C O, show that J’ € J. 


. Let R be a Dedekind domain with F = Frac(R), let B be finite-dimensional 


F-algebra, and let O C B be a hereditary order. Let P be a finitely generated 
projective O-module. Show that P is indecomposable if and only if V := P®pr F 
is simple as a B-module. 

Let R be a Dedekind domain, and let O C B be an R-order in a finite-dimensional 
F-algebra. Show that O is left hereditary (every left O-ideal is projective) if and 
only if it is right hereditary (every right O-ideal is projective). [See Reiner 
[Rei2003, Theorem 40.1].] 


A:= ((; () ied bee Qh. 


Show that every submodule of a projective left A-module is projective, but the 
same is not true on the right. 

Let R be a Dedekind domain. Let B be a separable F-algebra, and let B = 
Bı xX -++ X B, be its decomposition into simple components, with B; = Be; for 
central idempotents e;. Let K; be the center of B;, and let S; be the integral 
closure of R in K;. 


. Consider the ring 


(a) Let O C B bea hereditary R-order. Show that O = O; X --- x O, where 
O; = Oe;, and each O; is a hereditary R-order in B;. 
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(b) Conversely, if O; C B; is a hereditary R-order, then O; x --- x O, is a 
hereditary R-order in B. 


[Hint: use the fact that hereditary orders are extremal. ] 


11. For the following exercise, we consider integral group rings. Let G be a finite 
group of order n = #G and let R be a Dedekind domain with F = Frac R. 
Suppose that char F { n. Then B := F[G] is a separable F-algebra by Exercise 
7.16. Let O = R[G]. 


(a) Let O’ 2 O be an R-superorder of O in B. Show that 
OcO'cn'!O. 
[Hint: for ell a = dig agg € O' with ag € F, show that 
Trgjr(@g) = nag E€ R. 


Conclude that O' C n!O.] 

(b) Show that O is maximal if and only if O is hereditary if and only if 
n € R*. [Hint: if O is hereditary, then O contains the central idempotent 
n! DigeG & by Exercise 21.10.] 

(c) We define the left conductor of O’ into O to be the colon ideal 


(O’: O), ={a€ B: a0’ CO}. 


(and similarly on right). Prove that 


t 
CoO. >| ~ codiff(O;). 
i=l! 


12. Give an explicit description like Example 21.3.5 for O, (€) when dim V = 3,4. 


13. Let R be a Dedekind domain, and let O C B be an R-order in a finite-dimensional 
simple F-algebra. Show that O is maximal if and only if O is hereditary and 
rad O C O is a maximal two-sided ideal. 
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Chapter 22 


Quaternion orders and ternary 
quadratic forms 


In this chapter, we classify orders over a Dedekind domain in terms of ternary quadratic 
forms; this is the integral analogue to what we did over fields in Chapter 5. 


22.1 > Quaternion orders and ternary quadratic forms 


We begin our project by returning to the classification over fields: in Chapters 5 and 
6 (see Main Theorem 5.2.5 and Theorem 6.4.7), we saw that quaternion algebras over 
a field F are classified by similarity classes of nondegenerate ternary quadratic forms 
over F. We will soon see that, suitably interpreted, quaternion orders are classified by 
similarity classes of integral ternary quadratic forms. 

Let R be a PID with field of fractions F := Frac R. We recall that the similarity 
class of a ternary quadratic form Q: R? — R is determined by the natural change of 
variable by GL3(R) on the domain and by rescaling by R* on the codomain, and that 
Q is nondegenerate if and only if disc(Q) # 0. 


Main Theorem 22.1.1. Let R be a PID. Then there is a (reduced) discriminant- 
preserving bijection 


Nondegenerate ternary quadratic ‘es Quaternion orders over R 
forms Q over R up to similarity up to isomorphism 


One beautiful feature of the bijection in Main Theorem 22.1.1 is that it can be given 
explicitly. Let Q: R? — R be a ternary quadratic form with nonzero discriminant, and 
let e1, e2, e3 be the standard basis for R>. Then the extension to F given by Or: F 34 F 
is a ternary quadratic space whose even Clifford algebra (section 5.3) is a quaternion 
algebra B. Moreover, the R-lattice O with basis 


1, i:= €2€3, J := €3e€], k= €1€2 
is closed under multiplication and so defines an R-order in B. Explicitly, if the quadratic 
form Q is given by 
Q(x, y, z) = ax? + by? + cz? + uyz + vxz + wxy € R[x, y, z] 
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with (half-)discriminant 
N := 4abc + uvw — au? — bv? — cw’ £ 0, 


then we associate the quaternion R-order O C B with basis 1,i, j, k and multiplication 
laws 


im = ui — be jk = ai = a(u—-i) 

j =vj-ac ki = bj = b(v- j) (22.1.2) 
k? = wk - ab ij = ck = c(w — k). 

The other multiplication rules are determined by the skew commutativity relations 

(4.2.16) coming from the standard involution; one beautiful consequence is the equality 


ijk = jki = kij = abc. 


The R-order O defined by (22.1.2) is called the even Clifford algebra Clf°(Q) of 
Q—its algebra structure is obtained by restriction from the even Clifford algebra of 
Qr—and the reduced discriminant of O is discrd(O) = (N). At least one of the minors 


y= 4bc, v= 4ac, w° — 4ab 


of the Gram matrix of Q in the standard basis is nonzero since Q is nondegenerate, so 
for example if w? — 4ab + 0 and char F + 2, completing the square we find 


w? — 4ab, =) 


ocs] 
F 


It is straightforward to show that the isomorphism class of O is determined by 
the similarity class of Q (using the even Clifford algebra construction). Therefore, the 
proof of Main Theorem 22.1.1 amounts to verifying that every quaternion order arises 
this way up to isomorphism, and that isomorphic quaternion algebras yield similar 
ternary quadratic forms. 

To this end, we define an inverse to the even Clifford algebra construction. Let 
O C B be aquaternion order over R with reduced discriminant discrd(O) generated by 
N € R nonzero. Recalling 15.6, let 


(OË)? = {a € OF : trda) = 0} 


be the trace zero elements in the dual of O with respect to the reduced trace pairing. 
Then we associate the ternary quadratic form 


Nnrd*(Q): (O#)° > R (22.1.3) 
a+ Nnrd(a); E 
explicitly, we have 


Ni# = jk- kj, Nj* = ki — ik, NkË = ij - ji 
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where 1,i, j, k is an R-basis of O, so 
N nrdË(x, y, z) = nrd(x(jk — kj) + y(ki — ik) + z(ij - ji). (22.1.4) 


It is then a bit of beautiful algebra to verify that N nrd? has discriminant N and that 
(22.1.3) furnishes an inverse to the even Clifford map. 

Just as in the case of fields, the translation from quaternion orders to ternary 
quadratic forms makes the classification problem easier: we replace the potentially 
complicated notion of finding a lattice closed under multiplication in a quaternion 
algebra with the simpler notion of choosing coefficients of a quadratic form. 

To conclude this introduction, we state a more general bijective result stated in 
terms of lattices. Let R be a Dedekind domain with F = Frac R, let QF : V —> F bea 
nondegenerate ternary quadratic form. If M C V is an R-lattice, and [ C F is a fractional 
ideal of R such that Q(M) C I, then we have an induced quadratic form Q: M —> [; we 
call such a form a quadratic module in V. Given a fractional ideal a C F, the twist 
by a of the quadratic module Q: M = (in V is the quadratic module aM —> aI. A 
twisted similarity between quadratic modules Q, Q’ in V is a similarity between Q 
and a twist of Q’. From these notions in hand, we have the following theorem (a special 
case of Main Theorem 22.5.7). 


Theorem 22.1.5. Let R be a Dedekind domain, and let Qp : V — F be a nondegen- 
erate ternary quadratic form. Let B := CIf? V. Then the even Clifford map yields a 
discriminant-preserving bijection 


Quadratic modules in V = Quaternion orders in B 
up to twisted similarity up to isomorphism 


that is functorial with respect to R. 


By functorial with respect to R, we mean the same thing as in Corollary 5.2.6, 
but with respect to any homomorphism R — S of Dedekind domains. In particu- 
lar, the bijection in Theorem 22.1.5 is compatible with the bijections obtained over 
localizations of R, including the bijection over F between quaternion algebras and non- 
degenerate ternary quadratic forms previously obtained. In the language of quadratic 
forms (Definition 9.7.13), after some additional work (nailing down the difference 
between similarity and isometry), we conclude: if the ternary quadratic module Q 
corresponds to the quaternion order O, then there is a bijection 


CIQ e TypO, (22.1.6) 


i.e. the type number of a quaternion order is the same as the class number of the 
corresponding ternary quadratic form. 


Remark 22.1.7. If we restrict the correspondence to primitive modules Q: M —> I (i.e., 
Q(M) = 1), then we need only remember the underlying lattice M, and on the right- 
hand side we obtain precisely the Gorenstein orders; these orders will be introduced in 
24.1.1 and this correspondence is proven in section 24.2. 
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22.2 Even Clifford algebras 


In this section, we construct the even Clifford algebra associated to a quadratic module: 
see Remark 22.2.14 below for further references. The reader who wants to skip over 
technicalities at first is encouraged to skip this section and accept 22.3.2 as a definition. 

Let R be a noetherian domain with F = Frac R. Let Q: M —> L be a quadratic 
module over R (see section 9.7), so that M is a projective R-module of finite rank and L 
is an invertible R-module (rank 1). Write LY := Homp(L, R) and M® = R. (For further 
reference on tensor algebra, see Matsumura [Mat89, Appendix C] or Curtis—Reiner 
[CR81, §12].) 


22.2.1. Let 


Ten?(M; L) := Pu 3 M @LY)®, 
d=0 


Now Ten?(M; L) has a natural tensor multiplication law (rearranging tensors), so 
Ten? (M ; L) is a graded R-algebra. Let 1°(Q) be the two-sided ideal of Ten°(M ;L) 
defined by 

ICQ) := (x8 x 8 g — 8(Q(x)) : x € M, g € LY) c Ten°(M; L); (22.2.2) 


note that Q(x) € L so g(Q(x)) € R. We define the even Clifford algebra of Q to be 
the quotient 
Clf°(Q) = Ten°(M; L)/I°(Q). (22.2.3) 


Remark 22.2.4. We might try to define 


Ten(M; L) := EP me o (LY ela?! =ROMO(MEMBEL’)®...; 
d=0 


unfortunately, Ten(M; L) does not have a natural tensor multiplication law, because 

there is no natural map M & M > M & M @LY. But see 22.2.16 below for the odd 

part. 

Example 22.2.5. Under the inclusion R —> F, we have a natural identification 

Clf°(O) 8r F = Clf°(Or). (22.2.6) 

We conclude that the R-lattice in clf°(O F) defined by the image of R? is closed under 

multiplication—something that may also be verified directly—and so Clf°(Q) is an 

R-order in CIF? (Q F). 

22.2.7. As in 5.3.7, for all x, y € M and g € LY, the calculation 
XOyVOSgty@x@g=g(T(x, y) ER (22.2.8) 

holds in Clf°(Q), where T(x, y) = Q(x + y) - Q(x) - Q(y) € L. 
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22.2.9. If M = R” is free with basis e),...,e, and L = Rg is free, then clf°(Q) isa 
free R-module with basis 


-@d/2 
2 


eiD @ei, OB l<e <+ < eig <n, deven, 


as a consequence of 22.2.8, just as in the case over fields 5.3.9. In particular, by 
localizing, if M has rank n as an R-module, then Clf°(Q) is projective of rank 2”~! as 
an R-module. We write elements of Clf°(Q) without tensors, for brevity. 


22.2.10. The reversal map defined on simple tensors 


rev : Clf°(Q) > Clf°(Q) 
x1 Q- 8 Xa @(81 +++ 8a/2) > Xa @ +++ @ xX @(g1-++ Ba2) 


for x; € M and g; € LY, and extended R-linearly, is an R-linear involution. 


Theorem 22.2.11. The association Q > Clf°(Q) is a functor from the category of 
quadratic R-modules under similarities to the category of projective R-algebras with 
involution under isomorphism. Moreover, this association is functorial with respect 
to R. 


We call the association Q > Clf°(Q) in Theorem 22.2.11 the even Clifford 
functor. 


22.2.12. The statement “functorial with respect to R” means the following: given 
a ring homomorphism R — S, there is a natural transformation between the even 
Clifford functors over R and S. Explicitly, given a ring homomorphism R — S and a 
quadratic module Q: M — L, we have a quadratic module Qs: M 8r S > L Qr S, 
and Clf°(Q) 8r S = CIf? (Qs) in a way compatible with morphisms in each category. 
In particular, this recovers the identification in Example 22.2.5 arising from R > F. 


Remark 22.2.13. The association Q +> Clf(Q) of the full Clifford algebra is a functor 
from the category 
of quadratic R-modules under isometries to the category of R-algebras with invo- 
lution under isomorphism that is also functorial with respect to R. See Bischel-Knus 
[BK94]. 


Proof of Theorem 22.2.11. The construction in 22.2.1 yields an R-algebra that is pro- 
jective as an R-module; we need to define an association on the level of morphisms. 
Let Q’: M’ > L’ be a quadratic module and (f, h) be a similarity with f: M — M’ 
and h : L — L’ satisfying Q’(f(x)) = h(Q(x)). We mimic the proof of Lemma 5.3.21. 
We define a map via 


Ten°(M; L) > Ten(M’; L^) 
x8y8g > f8 f(y OH"')*(g) 
for x, y € M and g € LY and extending multiplicatively, where 


(h')*(g):=goh!: L' oR 
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is the pullback under h~!. Then 
x@x@g-g(O(x)) > f(x) ® f(x) 8 (haD E) - gO) 


and since 
EOOD) = UEA = (AEA EE), 


we conclude that 7°(Q) is mapped to 1°(Q’). Repeating with the inverse similarity 

(f1, h!), and composing to get the identity, we conclude that the induced map 

cif? (Q) > clf°(Q’) is an R-algebra isomorphism. 
Functoriality in the sense of 22.2.12 then follows directly. 


Remark 22.2.14. In his thesis, Bichsel [Bic85] constructed an even Clifford algebra of 
a line bundle-valued quadratic form on an affine scheme using faithfully flat descent. 
A related and more general construction was given by Bischel—Knus [BK94]. Several 
other constructions are available: see Auel [Auel2011, §1.8] and the references therein. 

The direct tensorial construction given above is given for ternary quadratic modules 
by Voight [Voi201 1a, (1.10)] and in general by Auel [Auel2011, $1.8] and with further 
detail in Auel [Auel2015, $1.2]; for a comparison of this direct construction with 
others, see Auel—Bernardara—Bolognesi [ABB2014, §1.5, Appendix A]. 


Remark 22.2.15. Allowing the quadratic forms to take values in a invertible module 
is essential for what follows and for many other purposes: for an overview, see the 
introduction to Auel [Auel2011]. 


22.2.16. Let 
Ten! (M; L) := B M® g (L) = Me(M@Me@Me@L)e.... 
d=1d odd 


Then Ten!(M; L) is a graded Ten?(M ; L)-bimodule under the natural tensor multi- 
plication. Let /'(Q) be the R-submodule of Ten! (M; L) generated by the image of 
multiplication of IPQ) by M on the left and right: then 7 1(Q) is the Ten? (M ; L)- 
bisubmodule generated by the set of elements of the form 


X@x@y@g-—g(Q(x))y and y@x@x@g- g(Qx))y 


with x, y € M and g € LY. 
We define the odd Clifford bimodule as 


clf!(Q) := Ten! (M; L)/I'(Q). 
Visibly, Clf (Q) is a bimodule for the even Clifford algebra Clf?(Q). 


22.2.17. When L = R, we can combine the construction of the even Clifford algebra 
and its odd Clifford bimodule to construct a full Clifford algebra, just as in section 
5.3 over a field: see Exercise 22.7. This direct tensorial construction does not extend 
in an obvious way when L # R, as we would need to define a multiplication map 
MEeM>MeMEeLY. 
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22.2.18. We will employ exterior calculus in what follows: this is a convenient method 
for keeping track of our module maps in a general setting. Let M be an R-module and 
let r > 1. The rth exterior power of M (over R) is 


N'M := M” /E, 
where E, is the R-module 
E, := (x1 88 Xr | X1, ..., Xy E M and x; = x; for some i # j}. 


We let \°M = R (and A!M = M). The image of x; ®--- @ x, € M® in AM is 
written xı A ++- A xp. If M is projective of rank n over R, then A” M is projective of 
rank ("). 


22.3 Even Clifford algebra of a ternary quadratic module 


Now suppose that Q: M — L is a ternary quadratic module, which is to say M has 
rank 3; in this section, we examine its even Clifford algebra clf°(Q). Recall that an 
R-order is projective if it is projective as an R-module. The main result of this section 
is as follows. 


Theorem 22.3.1. Let R be a noetherian domain. Then the association Q +» Clf°(Q) 
gives a functor from the category of 


nondegenerateternary quadratic modules overR, 


under similarities 
to the category of 
projective quaternion orders over R , under isomorphisms. 


In the previous section, we defined the even Clifford functor, whose codomain was 
the category of projective R-algebras; in this section, we show that the restriction to 
nondegenerate ternary quadratic modules lands in projective quaternion orders. 

We begin with some explicit descriptions. 


22.3.2. By 22.2.9, the even Clifford algebra Clf°(Q) is an R-algebra that is projective 
of rank 4 as an R-module. Explicitly, as an R-module we have 


R®(M@MPeELY) 
IQ) 


where 7°(Q) is the R-submodule generated by elements of the form 


Clf°(Q) = (22.3.3) 


x@x@g-—1@g(Q)) 


for x€ M and g € LY. 


We now explicitly give the even Clifford algebra of a ternary quadratic module in 
the free case; this could also be taken as the definition when R is a PID and M = R?. 
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22.3.4. Let M = R? with standard basis e1, e2, e3 be equipped with the quadratic form 
Q: M —> R defined by 


Q(x, y, Z) = O(xe + yez + ze3) = ax? + by? + cz? + uyz + vxz + wxy, (22.3.5) 
with a, b, c, u, v, w € R. Then 
N := disc(Q) = 4abc + uvw — au? — by? — cw? € R/R”. (22.3.6) 


By 22.2.9, we have 
Clf°(Q) = R® Ri ® Rj @ Rk 


where 
i= €2€3, J [= 3€], k:= e1é2. 


The reversal involution acts by 
i = €3€) = T(ep,€3) -i =u-i, 


and similarly j = v — j and k = w — k by (22.2.8). 
We then compute directly the multiplication table: 


2? = ui — be jk=ai 
jf =vj-ac ki = bj (22.3.7) 


k? = wk -ab ij =ck 


i 


For example, 
i? = (e2e3)(e2e3) = er(e3e2)e3 = en(u — e2e3)ez = ueze3 — e5e3 = ui — bc 
and 
jk = (e3e1)(e1e2) = ae3e2 = ai. 


The remaining multiplication laws can be computed in the same way, or by using the 
reversal involution and (22.3.7): we compute 


ai=jk=kj=(w-kXv-j)=vw-wj-vk+kj 
so kj = -vw +ai + wj + vk. By symmetry, we find: 
kj =-vw+aitwj+vk 
ik = —uw + wi + bj + uk (22.3.8) 


ji =-—uv +vi+uj+ ck 


We note also the formulas 
ijk = jki = kij = abc. (22.3.9) 
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Example 22.3.10. It is clarifying to work out the diagonal case. Let B = (a,b | F) 
with a, b € R, and let 


O=Ri,j)=R+Ri+Rj+RjcB 
be the R-order generated by the standard generators, and let k = ij. Then: 
P=a jk = bi = —bi 
j’ =b ki =aj = -aj (22.3.11) 
k? = -ab ij=-k=k. 
Example 22.3.12. Consider 


Q(x, y, z) = xy- 2" 


so (a, b, c, u, v, w) = (0, 0, —1, 0,0, 1); then disc(Q) = —cw? = 1. Then the even Clifford 
algebra Clf°(Q) = R + Ri + Rj + Rk has multiplication table 


Z=0 jk=0 
j’ =0 ki=0 (22.3.13) 
k =k ij=-k=k-l1. 
We find an isomorphism of R-algebras 
Clf°(O) > M(R) 


ka 1 0 O\ fo 0 (22.3.14) 
h> 0 opl- oplo al 


22.3.15. Returning to the free quadratic form 22.3.4, the group GL3(R) acts naturally on 
M by change of basis, and this induces an action on Clf°(Q) by R-algebra automorphism 
by functoriality. Explicitly, for o € GL3(R), the action on the basis i, j,k is by the 
adjugate adj(p) of p, the 3 x 3 matrix whose entries are the 2 x 2 minors of p. The 
verification is requested in Exercise 22.2. 


22.3.16. Let F = Frac R. By base extension, we have a quadratic form Qf: V > F 
where V = M @p F, and by functoriality 22.2.12 with respect to the inclusion R > F, 
we have an inclusion clf°(Q) > Clf°(Or) realizing clf°(Q) as an R-order in the 
F-algebra CIF? (QF). 


Lemma 22.3.17. The reversal involution is a standard involution on Clf°(Q p). 


Proof. To check that the involution is standard, we could appeal to Exercise 3.19, but 
we find it more illustrative to exhibit the involution on a universal element, yielding a 
rather beautiful formula. We choose a basis for V and work with the presentation for 
Clf°(Qr) as in 22.3.4. 
Leta = t + xi + yj + zij with t, x, y,z € F. Then a@ = 2t + ux + vy +wz-—a, and 
we find that 
a —(a+ @)a + a@ = a — t(a)a + vla) =0 


362 CHAPTER 22. QUADRATIC FORMS 


where 
T(@) = 2t+uxt+vy+wz 
v(@) = t + utx + vty + wtz 
r (22.3.18) 
+ bex* + (uv — cw)xy + (uw — bv)xz 
+ acy? + (vw — au)yz + abz? 
so that the reversal map œ + t(«œ) — «æ defines a standard involution. o 


Lemma 22.3.19. We have 

discrd(Clf°(Q)) = disc(Q)R. 
Proof. The construction of the even Clifford algebra is functorial with respect to 
localization, and the statement itself is local, so we may suppose that M = R°>,L = R 


are free with the presentation for O = clf°(Q) as in 22.3.4. 
We refer to section 15.4 and Lemma 15.4.7: we compute 


mii, j, k) = trd((ij - jk) 
= trd(—2abe + au? + cw? — aui + (bv — uw)j — cwk) (22.3.20) 


= —4abc + au? + cw? — uvw + bv? = — disc(Q) 


and discrd(O) = m(i, j, k)R as claimed. 
Alternatively, we compute directly that 


2 u v w 
2 
foe u u — 2be cw bv 
ALG) = v cw v? — 2ac au (22.3.21) 
w bv au w? — 2ab 
= —(4abc + uvw — au? — by? — cw’)? = disc(Q)? 


so disc(O) = disc(Q)*R, and the result follows by taking square roots (as ideals). O 


Corollary 22.3.22. If Q is nondegenerate, then Clf°(Q) is an R-order in the quaternion 
algebra B = Clf°(Qr). 


Proof. The standard involution has discriminant disc(nrd) = disc(O)* = disc(Or) + 
0; the result then follows from the characterization of algebras with nondegenerate 
standard involution (Main Theorem 4.4.1 and Theorem 6.4.1). o 


Remark 22.3.23. Corollary 22.3.22 gives a characteristic independent proof of the 
fact that the even Clifford algebra of a nondegenerate ternary quadratic form over F 
is a quaternion algebra over F: we proved this in 5.3.23 and Exercise 6.10 (when 
char F = 2). 


Intermediate between the general abstract definition and the explicit description in 
the free case is the situation where the modules are completely decomposable, and we 
can work with a pseudobasis. 
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Example 22.3.24. Let R be a Dedekind domain. Then we can write 
M = ae; Ẹ® be2 cez and L=l 


for fractional ideals a, b, c, l. Let V = M@rF ~ F? with basis e4, e2, e3, so then M => V 
is a ternary R-lattice. Then we may take Qr: V —> F to have the form 22.3.5, and 
CIf’ (QF) = B is a quaternion algebra with O := cif? (Q) C B an R-order. 

Extending the description in 22.3.4, we find that 


O=Rebclieact'j eabl'k (22.3.25) 


where i, j, k satisfy the multiplication table (22.3.7). We can verify directly that O is 
closed under multiplication: for example, if œ € bel! so ai € O, then 


(ai? = uai — a*bc € O 
since O(be2) = b?’ Q(e2) C lso b = Q(e2) € 1b7? and therefore 
a? be € (be h PN’) = R. 


Example 22.3.26. Let F = Q(V10) and R = Zr = Q[V10] be the ring of integers. 
Then p = (3,4 + V10) is a prime ideal over 3 that is not principal. 
Let Q: M = R? => p be the quadratic module 


Q(x, y, z) = 3x? + 3y? + (4+ V10)z?. 
We have p = Q(R?). The even Clifford algebra is then 
O=CIPQ)=R+plitp yep lk 
with the multiplication law 
i =-3(4+ VIO) 9 jk =3i 
jP=-344+V10)  ki=3j (22.3.27) 
kK? =-9 ij = (4 + VIO)k. 
We have 
diserd(O) = 4(9)(4 + V10)p™ = (2, V10)°(3, 2 + V10)? 
and in particular p { discrd(O), and 


—3(4 + V10), -3(4 + V10) 


OcB= 
F 


with disc B = (2 + w)R, so Ram B = {(2, VIO), p, œ01, 009} where 001, co are the two 
real places of F. 
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22.4 Over a PID 


In the previous two sections, we observed that the construction of the even Clifford 
algebra gives a functorial association from nondegenerate ternary quadratic modules 
to quaternion orders. In this section, we show that this functor gives a bijection on 
classes over a PID, following Gross—Lucianovic [GrLu2009, $4]. 


Main Theorem 22.4.1. Suppose that R is a PID. Then the association Q +» Clf°(Q) 
induces a discriminant-preserving bijection 


| Nondegenerate ternary quadratic m orders over R 


forms over R up to similarity up to isomorphism | (24) 


that is functorial with respect to R. 


Remark 22.4.3. The bijection can also be rephrased in terms of the orbits of a group 
(following Gross—Lucianovic [GrLu2009]). The group GL3(R) has a natural twisted 
action on quadratic forms by (gQ)(x, y, z) = (det g(O(g7!(x, y, z)*)), i.e., the usual 
action with an extra scaling factor of det g € R*. This is the natural action on the R- 
module Sym?((R?)”)@ A3 R3, or equivalently on the set of quadratic modules Q : R? > 
/\3R>. Main Theorem 22.1.1 states that the nondegenerate orbits of this action are in 
functorial bijection with the set of isomorphism classes of quaternion orders over R. 


We prove this theorem in a few steps. Throughout this section, let R be a PID. 

First, we prove that the map (22.4.2) is surjective, or equivalently that the even 
Clifford functor is essentially surjective from the category of nondegenerate ternary 
quadratic forms to the category of quaternion orders. 


Proposition 22.4.4. Every quaternion R-order is isomorphic to the even Clifford alge- 
bra of a nondegenerate ternary quadratic form. 


Proof. We work explicitly with the multiplication table, hoping to make it look like 
(22:3-7). 

Let O be a quaternion R-order. Since R is a PID, O is free as an R-module. We 
need a slight upgrade from this, a technical result supplied by Exercise 22.1: in fact, O 
has an R-basis containing 1. 

So let 1,i, j,k be an R-basis for O. Since every element of O is integral over R, 
satisfying its reduced characteristic polynomial of degree 2 over R, we have 


“2 


i =uitl 
j =vj+m 
k =wk+n 


for some /, m,n, u,v, w € R. The product jk = r — ai + qj + ak can be written as an 
R-linear combination of 1, i, j, k, with g,r,a,a € R. Letting k’ := k — q, we have 


jk’ = j(k-q)=r-ai+ ak = (r +aq)-ai+ ak’. 
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So changing the basis, we may suppose jk is an R-linear combination of 1, i,k (no j 
term). By symmetry, in the product ki we may suppose that the coefficient of k is zero 
and in ij the coefficient of i is zero. Therefore: 


jk =r-aitak 
ki=s—bj+ fi 
ij=t-—ck+yj 
As before, the other products can be calculated using the standard involution: for 
example, we have 
ik + ki = —trd(ki) + trd(k)i + trd(i)k 
= —trd(k(u — i)) + wi + uk 
= (—uw + 2s — bv + Bu) + wi + uk 
so 
ik = (s + pu — bv — uw) + (w — p)i + bj + uk. (22.4.5) 


But now from these multiplication laws, we compute that the trace of left multiplication 
iis Tr(i) = 0+u+y +u = 2u +y. But in a quaternion algebra, we have Tr(i) = 2 trd(i) = 
2u, so we must have y = 0. By symmetry, we find that a = 6 = 0. Finally, associativity 
implies relations on the structure constants in the multiplication table: we have 


J(kk) = (jk 
—nj = (r — ai)\(w — k) = rw — awi -rk + aik (22.4.6) 
—nj = (rw + as — abv — auw) + abj + (au — r)k 
using (22.4.5) with 6 = 0; so equality of coefficients of j,k implies r = au and 
n = —ab. By symmetry, we find s = by,t = cw and m = —ac,n = —ab, so we have the 
following multiplication table: 
i? = ui — bc jk=ai 
pf =vj-ac ki = bj 
k? = wk - ab ij =ck 
This matches precisely the multiplication table (22.3.7) for the even Clifford algebra 


of the quadratic form Q(x, y, Z) = ax? + by? + cz? + uyz + vxz + wxy. o 


22.4.7. More generally, if R is a domain and O is a quaternion R-order such that O 
is free as an R-module with basis 1,7, j, k, then the proof of Proposition 22.4.4 shows 
O has a basis 1,i, j,k satisfying the multiplication laws (22.3.7) of an even Clifford 
algebra; we call such a basis a good basis for O. Moreover, we have seen that given a 
basis 1, i, j, k, there exist unique 7(i), 7(j), 7(k) € R (in fact, certain coefficients of the 
multiplication table) such that 


1, i= nò, j- nG), k- n(k) 


is a good basis. 
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To conclude, we need to show that if two quaternion R-orders are isomorphic, then 
they correspond to similar ternary quadratic forms. To this end, we define an inverse. 


22.4.8. Let O C B be a quaternion R-order with R-basis 1, i, j,k. Let N € R be such 
that (N) = discrd(O); then N + 0 and is well-defined up to multiplication by R*. Let 
14 i, j#, k# be the dual basis (see 15.6.3); then trd(i#) = trd(1 - if) = 0 and similarly 
for j E kË, so 

(O*)° = {a € OË : trd(a) = 0} = Rit + Rj? + Rk}. 


We define a candidate quadratic form 

Nnrd#(O)(x, y, z) = N nrd(xi* + yj? + zk”), (22.4.9) 
well-defined up to similarity (along the way, we chose a basis and a generator for 
discrd(O)). 
Example 22.4.10. We return to Example 22.3.10. The R-order O has reduced discrim- 
inant N = 4ab. The (rescaled) dual basis is 

Ni? =2bi, Nj*=2aj, Nk# =-2k 

and if jË, kř is a basis for (08°; thus 


1 
N nrd(xi* + yj + zk#) = wo abex? — 4a’ by? + 4abz”) = —bx* — ay? +2”. 


Example 22.4.11. We return to Example 22.3.12. We have 


# ot p# [0 O\) (0 -1) /-1 0 
Pode -( se oplo 1 
and N = 1 so 


N nrd(xi* + yjf + zk*) = det E 7) =xy-7z. 


Proposition 22.4.12. If Q: R? — R is a nondegenerate ternary quadratic form with 
disc Q = N, then N nrd? (C1f(0)) is similar to Q. If O is a quaternion R-order with 
discrd(O) = (N), then N nrd#(O): R? > R has Clf°(N nrd? (O)) = O. 


Proof. Proposition 22.4.4 shows that the even Clifford functor induces a surjective 
map from similarity classes of nondegenerate ternary quadratic forms over R to iso- 
morphism classes of quaternion R-orders. If we prove the first statement, then the 
second follows from set theory (and can be verified in a similar way). 

We start with the quadratic form (22.3.5) with O satisfying the multiplication laws 
(22.3.7). Let N := disc(Q). We claim that 


Ni} = jk — kj = (au + vw) — 2ai — wj — vk 
Nj = ki — ik = (bv + uw) — wi — 2bj — uk (22.4.13) 
Nk’ =ij — ji = (cw + uv) — vi — uj — 2ck. 


22.4. OVER A PID 367 


We see that trd(Ni#) = 0 and the same with J # kË. We recall the alternating trilinear 
form m (defined in 15.4.2). By (22.3.20) we have 


mi, j,k) = -N = trd(i(jk —kj)) = - trd(i(jk — kj)) = trd(i(Ni*)) 
and 
m(j, j,k) = 0 = trd(j(jk — kj)) = — trd(j(Ni*)) 


and similarly trd(k(N i#)) = 0. The other equalities follow similarly, and this verifies 
the dual basis (22.4.13). In particular, we have trd(Ni#) = trd(Nj*) = trd(NK*) = 0. 
We then compute the quadratic form on this basis and claim that 


nrd(N(xi® + yj? + zk#)) = N(ax? + by? + cz* + uyz + vxz + wxy) (224.14) 
= NQ(x, y, z). 
Indeed, 
2nrd(Ni”) = tra((Ni®(NI®) = — trd((Ni#)2) = -(-2a)N = 2aN 
since only the term trd(Ni t; ) = N is nonzero; and 
trd(NiËN j#) = — trd(NiË# N jË) = -wN. 

The other equalities follow by symmetry. Then the claim (22.4.14) implies that 
N nrd(xi# + yj + zk#) = Q(x, y, z), as desired. o 
Corollary 22.4.15. Let O be a quaternion R-order. Then 


O = R + discrd(O)(O*)°(0*)° = R + discrd(O)O#O?. 


Proof. If we take the identifications in the proof of Proposition 22.4.12 working within 
B 2 O, we see that CIE? (N nrd#(O)) is spanned over R by the elements 
1, Ni? jt NjËkË, KEE 


where discrd(O) = (N). In order to see that the other factors belong to this ring, we 
compute 


(Ni*)? = -aN 
(Nj*)? = -bN (22.4.16) 
(Nk#)? = -cN. 


and E 
(Nj*)(Ni#) = -Nk 


(NK*)(N jË) = -Ni (22.4.17) 
(Ni*)(Nk*) = —N7j. 
If we want to throw in the factors with 1# as well, then we check: 
N1# =2N -iit — jj? — ket 
= N — 2(abc + uvw) + (au + vw)i + (bv + uw)j + (cw + uv)k. 


368 CHAPTER 22. QUADRATIC FORMS 


satisfies 
(n1#y? — N(N1#) + N(abc + uvw) = 0 (22.4.18) 


and 
(N1*)(Ni#) = —N(au + vw — ai — vk) = (Ni*)(N1*) + N(wj — vk) 
(N1*)(N j*) = —N(bv + wu — wi — bj) = (Nj*)(N 14) + N(-wi + uk) 
(NIËNK®) = —N(cw + uv — uj — ck) = (Nk?®X(N1#) + N(vi - uj). 


The result follows. Oo 
Finally, we officially combine our work to prove the main theorem of this section. 
Proof of Main Theorem 22.4.1. Combine Propositions 22.4.4 and 22.4.12. o 


Remark 22.4.19. Just as in section 5.5, we may ask about embeddings of a quadratic 
ring in an order. However, moving from the rational to the integral is a bit tricky, and 
the issue of embeddings is a theme that will return with gusto in Chapter 30. In that 
context, it will be more natural to look at a different ternary quadratic form to measure 
embeddings; just as in the case of trace zero, it is related to but not the same as the one 
obtained in the above bijection. 


22.5 Twisting and final bijection 


In this final section, we conclude with the final bijection. We must keep track of the 
extra data of an ideal class, and along the way allow coefficient ideals. Throughout, let 
R be a Dedekind domain. 

We first need the following slightly revised notion of similarity (one that ‘glues 
together’ local similarities) allowing scaling by fractional ideals. 


Definition 22.5.1. Let Q: M — I be a quadratic module with [ a fractional R-ideal. 
The twist of Q by a fractional R-ideal u is the quadratic form u®Q: ug M —> wi 
defined by (u® Q)(u ® x) = u?Q(x). 

A twisted similarity between quadratic modules Q and Q’ is a similarity between 
Q and a twist u & Q’ for some fractional R-ideal u. 


Example 22.5.2. If u = uR is a principal fractional ideal, then twisted similarities 
between Q and aQ’ = uQ’ are precisely those obtained from a similarity between Q 
and Q’, multiplied by u. In particular, if R is a PID, then the notions of similarity and 
twisted similarity coincide. 


Example 22.5.3. Two quadratic modules Q, Q’: M, M’ — I with the same codomain 
are twisted similar if and only if they are similar. Indeed, if uĉl = I, then u = R. 


22.5.4. Second, we extend the definition of the inverse in 22.4.8 using the reduced 
norm to the noetherian domain R as follows. Let O C B be a quaternion R-order. Since 
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R is a Dedekind domain, the reduced discriminant discrd(O) € R of O is an invertible 
R-ideal. Define the map 
nrd#(O): (O#)° = F 
ae) (22.5.5) 
ab nrd(a). 
Lemma 22.5.6. For a quaternion order O, the map nrd” (O) defines a ternary quadratic 
module with values in discrd(O)"!. 


Proof. The reduced norm defines a quadratic map, so we only need to verify that the 
codomain is valid. To this end, we may check locally since reduced norm and reduced 
discriminant commute with localization. Reducing to the local case, suppose R is now 
a local Dedekind domain hence a PID. Choosing a basis, we verified in (22.4.14) that 
N nrd#(O) G R, where discrd(O) = (N); the result follows. o 


Main Theorem 22.5.7. Let R be a Dedekind domain. Then the associations 
Nondegenerate ternary quadratic Quaternion orders 
| modules over R | -> | over R up to | 
up to twisted similarity isomorphism 
Q > CIf(Q) 
nrd#(O) = O 


(22.5.8) 


are mutually inverse, discriminant-preserving bijections that are also functorial with 
respect to R. 


Proof. We proved a version of this statement when R is a PID in Theorem 22.4.1. 
More generally, we work now with a pseudobasis instead of a basis, explaining the 
presence of the twisted similarity. 

The surjectivity of the even Clifford map follows by generalizing the argument in 
Proposition 22.4.4 and 22.4.7 to show that O has a good pseudobasis: see Exercise 
Pence 

Let Q: M > [be a quadratic module with O := Clf°(Q). Returning to Example 
22.3.24, we may write 

M = ae; © ber È ce3 


for fractional ideals a, b, c, and with Q r(x, y, z) := Q(xe1 + yez + ze3) in the usual form 
provided by (22.3.5). Let N := disc(Q r). Then 


O=Rebclieacdt'joabl'k. 
Consider now nrd#(O): (O#)° — discrd(O)~!: as in 22.4.8 we have 
(OP = ((6c)~!i# © (ac)! j*# @ (aby !kF. (22.5.9) 
To prepare our twisted similarity, let 


d:= abel"! =~ AM BL”. (22.5.10) 
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Then 

0(0*)° = ait @ bjf @ ck’. (22.5.11) 
We claim that the reduced norm on 08)? has values in N~! { and is similar to Q. The 
claim follows from the same calculation in the proof of Proposition 22.4.12, namely 
that nrd(xi# + yj +zk#) = N! QF(x, y, z)! We conclude that nrd#(O) is twisted similar 
to Q. Oo 


We conclude with the following application to quadratic forms. 


Corollary 22.5.12. Let Q: M > I be a ternary quadratic module and O := clf°(Q). 
Then the even Clifford map induces a bijection C1Q © TypO. 


Proof. We first claim that the even Clifford map induces an injection Gen Q — Gen O, 
giving an injection CIQ — Typ O. Indeed, let Q’ € Gen Q, so Q’: M’ —> l is locally 
isometric to Q. Let O’ := Clf°(Q’). Since Q’, Q are locally isometric, they are locally 
similar, so O’, O are locally isomorphic by Main Theorem 22.4.1, so O’ € Gen O. And 
if Q’ ~ Q are isometric, again they are (twisted) similar, so by Main Theorem 22.5.7 
we have O’ = O. 

To finish, we need to show that the even Clifford map is surjective. We pass from 
similarity classes to isometry classes in the same way as in the proof of Corollary 
5.2.6. To this end, let O’ € Gen(Q). Let Q’ := dnrd#(O’) as in (22.5.11). By the 
same rescaling argument given in Corollary 5.2.6, applying a similarity to Q’ we may 
further suppose that disc Q, = disc Qr € F*/F x2. By surjectivity in Main Theorem 
22.5.7, for every prime p of R, there exists a twisted similarity from Qin) to Qp) over 
R(py—and since each Rip) is a PID, by Example 22.5.2, these are in fact similarities. 
Taking such a similarity and considering it as a similarity over F, again repeating the 
same argument as at the end of Corollary 5.2.6, we conclude that Q’ and Q are locally 
isometric, so Q’ € Gen Q. Finally, if O’ ~ O, repeating these arguments one more time 
over R (first to go from twisted similar to similar, then to note the similarity gives rise 
to an isometry) we conclude that Q’ = Q. oO 


Remark 22.5.13. The correspondence between ternary quadratic forms and quaternion 
orders has a particularly rich history. Perhaps the earliest prototype is due to Hermite 
[Herm1854], who examined the product of automorphs of ternary quadratic forms. 
Early versions of the correspondence were given by Latimer [Lat37, Theorem 3], Pall 
[Pall46, Theorems 4-5], and Brandt [Bra43, §3ff] over Z by use of explicit formulas. 

Various attempts were made to generalize the correspondence to Dedekind domains, 
with the thorny issue being how to deal with a nontrivial class group. Eichler [Eic53, 
§14, p. 96] gave such an extension. Peters [Pet69, §4] noted that Eichler’s correspon- 
dence was not onto due to class group issues, and he gave a rescaled version that gives 
a bijection for Gorenstein orders. Eichler’s correspondence was further tweaked by 
Nipp [Nip74, §3], who opted for a different scaling factor that is not restricted to a 
class of orders, but his correspondence fails to be onto [Nip74, p.536]. 

These correspondences were developed further by Brzezinski [Brz80, $3], [Brz85, 
§3], where he connected the structure of orders to relatively minimal models of the 
corresponding integral conic; see also Remark 24.3.1 1. He revisited the correspondence 
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again in the context of Gorenstein orders [Brz82, §3] and Bass orders [Brz83b, $2]. 
Lemurell [Lem2011, Theorem 4.3] gives a concise account of the correspondence of 
Brzezinski over a PID (the guts of which are contained in [Brz82, (3.2)]). 

More recently, Gross—Lucianovic [GrLu2009, §4] revisited the correspondence 
over a PID or local ring, and they extended it to include quadratic forms of nonzero 
discriminant and without restricting to Gorenstein orders; this extension is important for 
automorphic reasons, connected to Fourier coefficients of modular forms on PGSp(6), 
as developed by Lucianovic in his thesis [Luc2003]. Balaji [Bal2007, Theorem 3.1] 
studied degenerations of ternary quadratic modules in the context of orthogonal groups 
and Witt invariants and showed that the even Clifford functor is bijective over a general 
scheme. Finally, Voight [Voi2011la, Theorem B] gave a general and functorial corre- 
spondence without any of the above restrictive hypotheses, including the functorial 
inverse to the even Clifford functor provided above. 


Remark 22.5.14. In the most general formulation of the correspondence, allowing arbi- 
trary ternary quadratic modules discriminant over all sorts of rings, Voight [Voi201 la, 
Theorem A] characterizes the image of the even Clifford functor, as follows. Let B 
be an R-algebra that is (faithfully) projective of rank 4 as an R-module. Then B is 
a quaternion ring if B ~ Clf°(Q) for a ternary quadratic module Q. Then B is a 
quaternion ring if and only if B has a standard involution and for all x € B, the trace 
of left (or right) multiplication by x on B is equal to 2 trd(x). 

For example, if we take the quadratic form Q: R? — R defined by Q(x, y, z) = 0 
identically, the multiplication table on Clf°(Q) gives the commutative ring 


Clf°(Q) = RIi, j, KI/G, j, KY. 


One can see this as a kind of deformation of a quaternion algebra (in an algebro- 
geometric sense), letting a,b > 0. 


Exercises 


> 1. Let R be a PID or local noetherian domain. Let A be an R-algebra that is free 
of finite rank as an R-module. Show that A has an R-basis including 1. [Hint: 
show that the quotient A/R is torsion-free, hence free; since free modules are 
projective, the sequence 0 —> R —> A —> A/R > 0 splits, giving A ~ R ® A/R.] 
> 2. For a free quadratic ternary form (as in 22.3.4), show that a change of basis 
p € GL3(R) acts on i, j,k € clf°(Q) by the adjugate matrix adj(p) € GL3(R) 
(where the entries of adj(p) are the 2 x 2-cofactors of p and padj(p) = det(p)). 
3. Let R be a domain and let Pic R be the group of isomorphism classes of invertible 
R-modules (equivalently, classes of fractional R-ideals in F). Show that up to 
twisted similarity, the target of a quadratic module only depends on its class in 
Pic R/2 Pic R. [See Example 9.7.5.] 
4. Finish the direct verification in Example 22.3.24 that O is closed under multi- 
plication. 
> 5. Let R be a Dedekind domain, and let O be a quaternion R-order. Show that there 
existi, j,k € Oanda, b, c C F fractional R-ideals such that O = R+ai+bj+ck and 
such that 1, i, j, k satisfy the multiplication rules (22.3.7) for some a, b, c, u, v, w € 
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F—called a good pseudobasis for O. [Hint: revisit what goes into 22.4.7; a 
simple observation will suffice! ] 


. Let Q: R? — R bea ternary quadratic form and let O := Clf°(Q). Show that 


disc nrd |g = (disc Q}. 
[Hint: see (22.3.18).] 


. Let Q: M —> R be a quadratic form over R. Construct a Clifford algebra with a 


universal property analogous to Proposition 5.3.1, and recover the even Clifford 
algebra and odd Clifford bimodule. 


. Let Q: M — R be a quadratic form such that there exists x € M such that 


Q(x) € RY. 


(a) Show that the odd Clifford bimodule Cif! (Q) is free of rank 1 as a cif? (Q). 
(b) Generalize this result to case where Q: M — L is a quadratic module. 


. Let Q: M —> R be a quadratic form over R with M of odd rank as an R-module 


and let F = Frac R. Let S := Z(Clf Q) — K := Z(Clf Qr) be the center of the 
Clifford algebra of Q. Show that S is an R-order in K. 


Show that nrd(O#) = nrd((O#)°). [Hint: use (22.4.18).] 


. Let R be a Dedekind domain with F = Frac R, let B be a quaternion algebra 


over F, and let O C B be an R-order. Let S C O be an R-order. 


(a) Suppose S C O is integrally closed. Prove that O is projective of rank 2 as 
a left S-module. 

(b) If S is not integrally closed, then show that (a) need not hold by the 
following example. Let R = Z and F = Q, let B = (-1,-1 | Q), let 


O = Z + Zpi + Zj + Zij 


for an odd prime p (that is pi, not m!). Let S = Z[pi] € O. Show that O is 
not projective as a left S-module. 

(c) Show that the property that O is projective as an S-module is a local 
property (over primes of R). 

(d) In light of (c), suppose that R is a PID, and write O in a good basis (22.3.7). 
Suppose that § = R[i] with i? = ui — bc. Show that O is projective as an 
S-module if and only if the quadratic form bx? + uxy + cy? represents a 
unit. 

(e) Using (d), conclude in general that if S has conductor coprime to discrd O, 
show that O is projective as an S-module. 


Let R be a global ring with F := Frac R, let B, B’ be quaternion algebras over 
F, and let O C B and O’ C B’ be R-orders. Consider the quaternary quadratic 
forms Q := nrd |o: O > R and similarly Q’ on O’. 


(a) Show that Q is isometric to Q’, then there is an isomorphism of F-algebras 
B= B’. 
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(b) In light of (a), suppose O,O’ © B = B’. Show that O, O’ are isomorphic 
as R-orders if and only if Q,Q’ are isometric as (quaternary) quadratic 
modules. [Hint: if Q is isometric to Q’, then show that there is a similarity 
on the trace zero elements of the duals, thereby giving an isomorphism 


O>O' j. 


13. Let Q: M — L be a quadratic module. Show that the even Clifford algebra 
clf°(Q) with its map ı : M®M@LY > clf°(Q) has the following universal 
property: if A is an R-algebra and 14 : M@M®@L — A is an R-module 
homomorphism such that 


(i) tca(x @ x 8 f) = f(O(x)) for all x € M and f € LY, and 
(ii) tax @ y8 fay ® Z@g) = f(QV))ta(x 8z 8 g) for all x, y,z € M and 
f.g E LY, 


then there exists a unique R-algebra homomorphism ¢ : Clf°(Q) — A such that 
the diagram 
M&M & LY ——+ClIf(Q) 
| 
1$ 


Y 
A 


LA 


commutes. Conclude that the pair (Clf°(Q), v) is unique up to unique isomor- 
phism. 
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Chapter 23 


Quaternion orders 


In the previous chapter, we gave a rather general classification of quaternion orders in 
terms of ternary quadratic modules. In this chapter, we take a guided tour of the most 
important animals in the zoo of quaternion orders, identifying those with good local 
properties. We continue in the next chapter with a second visit to the zoo. 


23.1 > Highlights of quaternion orders 


We begin in this section by providing some highlights of this tour. Let B be a quaternion 
algebra over Q of discriminant D := disc B and let O C B be an order with reduced 
discriminant N := discrd(O). Then N = DM with M € Zs}, and O is maximal if and 
only if N = D. 


23.1.1 (Maximal orders). The nicest orders are undoubtedly the maximal orders, those 
not properly contained in another order. An order is maximal if and only if it is locally 
maximal (Lemma 10.4.3), i.e. p-maximal for all primes p; globally, an order O is 
maximal if and only if N = D (i.e., M = 1). 

We have either B ~ M2(Q,) or B is a division algebra over Q, (unique up to 
isomorphism). If B is split, then a maximal order is isomorphic (conjugate) to M2(Z,), 
and the corresponding ternary quadratic form is the determinant xy — z? (see Example 
22.3.12). If instead B is division, then the unique maximal order is the valuation ring, 
with corresponding anisotropic form x?—ey* + pz” for p + 2, where e € Zis a quadratic 
nonresidue modulo p (and for p = 2, the associated form is x? + xy + y? +227). 

Maximal orders have modules with good structural properties: all lattices Z c B 
with left or right order equal to a maximal order O are invertible (Theorem 18.1.2). 

There is a combinatorial structure, called the Bruhat-Tits tree, that classifies 
maximal orders in M2(Q,) (as endomorphism rings of lattices, up to scaling): the 
Bruhat-Tits tree is a p + 1-regular tree (see section 23.5). 


Examining orders beyond maximal orders is important for the development of 
the theory: already the Lipschitz order—an order which arises when considering if a 
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positive integer is the sum of four squares—is properly contained inside the Hurwitz 
order (Chapter 11). 


23.1.2 (Hereditary orders). More generally, we say that O is hereditary if every left 
or right fractional O-ideal (i.e., lattice 7 C B with left or right order containing O) is 
invertible. Maximal orders are hereditary, and being hereditary is a local property. A 
hereditary Zp-order Op C Bp is either maximal or 


o=% B)={(¢ Ararat ets) oman 6, 
with associated ternary quadratic form xy — pz”. Thus O C B is hereditary if and only 
if discrd(O) = DM is squarefree, so in particular gcd(D, M) = 1. 

Hereditary orders share the nice structural property of maximal orders: all lattices 
I C B with hereditary left or right order are invertible. The different ideal diff Op is 
generated by any element u € Op such that 4? € pZp. 


23.1.3 (Eichler orders). More generally, we can consider orders that are “upper tri- 
angular modulo M” with gcd(D, M) = 1 (i.e., avoiding primes that ramify in B). The 
order 


p°Zp Zp 


is called the standard Eichler order of level p° in M2 (Qp). A Zp-order Op E M2(Qp) 
is an Eichler order if O, is isomorphic to a standard Eichler order. The ternary 
quadratic form associated to an Eichler order of level p° is xy — p°z?. 

Globally, we say O C B is a Eichler order of level M if discrd(O) = N = DM 
with gcd(D, M) = 1 and O, is an Eichler order of level p° for all p° || M. In particular, 
Op is maximal at all primes p | D. Every hereditary order is Eichler, and an Eichler 
order is hereditary if and only if its level M (or reduced discriminant N) is squarefree. 
A maximal Zp-order Op C M2(Zp) is an Eichler order of level 1 = p°. Eichler orders 
play a crucial role in the context of modular forms, as we will see in the final part of 
this monograph. 

This local description of Eichler orders also admits a global description. The 
standard Eichler order O, can be written 


| Zp A Č M2(Zp) 


Zp p °Zp 


Z Z 
Op = | p r) = M2(Zp) N a, Zp 


= =i 
p°Zp Zp ) = Mx(Zp) Nw M2(Zp)o 


as the intersection of a (unique) pair of maximal orders, with 


0 1 
T = al E NarxQp)(Op) (23.1.4) 
a generator of the group Noi,@,)(Op)/Q,O07 = Z/2Z, and w? = p°. The different 
diff O, is the two-sided ideal generated by w. 

From the local-global dictionary, it follows that O C B is Eichler if and only if O 
is the intersection of two (not necessarily distinct) maximal orders. 
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23.2 Maximal orders 


Throughout this chapter, we impose the following notation: let R be a Dedekind domain 
with field of fractions F = Frac R, let B be a quaternion algebra over F, and let O € B 
an R-order. 


23.2.1. We make the following convention. When we say “R is local”, we mean that 
R is a complete DVR, and in this setting we let p = zR be its maximal ideal, and 
k = R/p the residue field. When we want to return to the general context, we will say 
“R is Dedekind”. 


Recall that an R-order is maximal if it is not properly contained in another order. We 
begin in this section by summarizing the properties of maximal orders for convenience. 


23.2.2. Being maximal is a local property (Lemma 10.4.3), so the following are equiv- 
alent: 


(i) O is a maximal R-order; 
(ii) Oc) is a maximal R(p)-order for all p C R; and 
(iii) Op is a maximal R,-order for all p G R. 


We recall (Theorem 15.5.5) that an order over a global ring R is maximal if and 
only if discrd(O) = discr(B). Local maximal R-orders have the nice local description. 


23.2.3. Suppose R is local and that B ~ M>(F) is split. Then by Corollary 10.5.5, 
every maximal R-order in M2(F) is conjugate to M2(R) by an element of GL2(R), 
i.e. O = Mo(R). We have discrd(O) = R. All two-sided ideals of O are powers of 
rad(O) = pO, and 

O/rad(O) = Mo(k). 


The associated ternary quadratic form is similar to Q(x, y,z) = xy — z?, by Example 
22.3.12 and the classification theorem (Main Theorem 22.1.1). Finally, 


Npx(O) = Nonr\(M2(R)) = F*O*. (23.2.4) 


23.2.5. Suppose R is local but now that B is a division algebra. Then the valuation ring 
O c Bis the unique maximal R-order by Proposition 13.3.4. 

Suppose further that the residue field k is finite (equivalently, that F is a local 
field). Then Theorem 13.3.11 applies, and we have 


K, 7 
= i C | — 
O sesje ($5) 


where K 2 F is the unique quadratic unramified extension of F and S its valuation 
ring. We computed in 15.2.12 that discrd(O) = p. All two-sided ideals of O are powers 
of the unique maximal ideal rad(O) = P = OjO, and £ := O/rad(O) is a quadratic 
field extension of k. By Exercise 13.7, we have P = [O, O] equal to the commutator. 
We also have P = diff O; this can be computed directly, or it follows from the condition 
that nrd(diff O) = discrdO = p. 
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Write S = R[i] with i? = ui — b, and u,b € R. Then l,i, j,k where k = —ij is an 
R-basis for O. We have 


K? = i(ji)j = iéj)j = nrd(i)n, 


so trd(k) = 0, and k = —k = ij. This gives multiplication table 


P=ui-b jk=-ni 
pon ki = bj (23.2.6) 
k? = br ij=k 


realizing the basis as a good basis in the sense of 22.4.7; the associated ternary quadratic 
form is 


nrd#(O)(x, y, Z) = -nx + by? +uyz+ 2 = -nx + Nmx F(z + yi). (23.2.7) 


Finally, since the valuation ring is the unique maximal order and conjugation 
respects integrality, we have 
Npx(O) = B*. (23.2.8) 


Finally, lattices over maximal orders are necessarily invertibility, as follows. 


23.2.9. All lattices 7 C B with left or right order equal to a maximal order O are 
invertible, by Theorem 18.1.2, proven in Proposition 18.3.2. (We also gave a different 
proof of this fact in Proposition 16.6.15(b).) 


The classification of two-sided ideals and their classes follows from that of hered- 
itary orders: see 23.3.19. 


23.3 Hereditary orders 


Hereditary orders were investigated in section 21.4 in general; here, we provide a quick 
development specific to quaternion algebras. As mentioned before, a good general 
reference for (maximal and) hereditary orders is Reiner [Rei2003, Chapters 3-6, 9]. 

We recall that O is hereditary if every left (or right) ideal J € O is projective 
as a left (or right) O-module. Being hereditary is a local property 21.4.4 because 
projectivity is. 


23.3.1. Suppose R is local. By Main Theorem 21.1.4 and Corollary 21.1.5, the follow- 
ing are equivalent: 


(i) O is hereditary; 
(ii) rad O is projective as a left (or right) O-module; 
(iii) OL (rad O) = Or(rad O) = O; 
(iv) rad O is invertible as a (sated) two-sided O-ideal; and 


(v) either O is maximal or 
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R R 
T R| EMP = B. 


We now spend some time investigating ‘the’ local hereditary order that is not 
maximal. So until further notice, suppose R is local and let 


Oo(p) := a C M2(R). 


To avoid clutter, we will just write O = Oo(p). We have 
discrd(O) = [M2(R) : O]r disc(M2(R)) = p. (23.3.2) 


23.3.3. A multiplication table for O is obtained from the one for Mz(R) in Example 
22.3.12, scaling j by m in (22.3.12), which gives the same multiplication laws as 
(22.3.13) except now ij = -rk and c is scaled by x. Therefore, the similarity class of 
ternary quadratic forms associated to O is represented by 


Q(x, y, Z) = xy = 12". (23.3.4) 
23.3.5. Let J := rad(O). Then 
je $ l (23.3.6) 
p p 
by (21.3.6), and we find 
O/J ~ ic 7 xkxk (23.3.7) 
as k-algebras. Now let 
pe k i (23.3.8) 
Then a direct calculation yields 
J = Ow = wO (23.3.9) 


in agreement with 23.3.1(ii)—23.3.1(iii), and J is an invertible O-ideal. Since w =n, 
we have 
J? = pO. (23.3.10) 


In particular, J7! = #7!J, and the powers of J give a filtration 


ODSDpO IP Ds... (23.3.11) 


23.3.12. We compute that 


-l 
w M(R)o™! = f p H 


and hence 
O = M2(R) N w M2(Ryæ `! 


is the intersection of two maximal orders. 
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Lemma 23.3.13. The group Idl(O) = PIdI(O) is generated by J = radO, with J? = 
pO. 


Proof. We have Idl(O) = PIdl(O) since R is local: invertible is equivalent to principal 
(Main Theorem 16.6.1). 

Let J C O be an invertible two-sided O-ideal. Then by (23.3.11), we can replace 7 
by a power of J~! and suppose that O Ç J C J. Invertible means locally principal, so 
I = Oa = aO and 


[O : Ir = nrd(I)? = nrd(a)’R | [O : J]R =p. 


Thus [O : I]R = p?, so [Z : J]r = R and Z = J. (This also follows directly from 
Proposition 16.4.3.) 

Here is a second computational proof. The image I/J € O/J = k x k (23.3.7) is a 
two-sided ideal, therefore 


R ai iel ele aE 
p p/\0 0 0 p p p? 
we get a contradiction with the other possibility by multiplying instead on the left. A 


third “pure matrix multiplication” proof is also requested in Exercise 23.1. 
The second statement was already proven in (23.3.10). o 


Corollary 23.3.14. We have Ngx(O)/(F*O*) = Z/2Z generated by w, and 


F”? RX, if eis even; 


nrd(Ngx(O)) = 2 if eis odd. 


Proof. By (18.5.4), we have an isomorphism 
Npx(O)/(F*O*) = PIdl(O)/ PIdl(R); 


by Lemma 23.3.13, the latter is generated by J = wO with J? = nO. The computation 
of reduced norms is immediate. o 


23.3.15. Lemma 23.3.13 also implies the description 
J = [0,0] (23.3.16) 


as the commutator. Since O/J is commutative, we know [O, O] € J; but [O,O] Ç 
J? = pO since O/pO is noncommutative. We also have J = diff O for the same 
reason, since nrd(diff O) = discrd O = p. (A matrix proof of these facts are requested 
in Exercise 23.8.) 
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23.3.17. We now classify the left O-lattices, up to isomorphism. Each such O-lattice 
is projective, since O is hereditary. 

By the Krull-Schmidt theorem (Theorem 20.6.2), every O-lattice can be written as 
the direct sum of indecomposables, so it is enough to classify the indecomposables; 


= F? the simple B = 


and we did just this in 21.5.3. Explicitly, we have V = p 


M>(F)-module, and we take the O-lattice M = (k) c V. We have JM = a and 


R p 
p 


IPM = 4 = nM, and M, JM give a complete set of indecomposable left O-modules. 


As expected, O = JM @ M is a decomposition of O into projective indecomposable 
left O-modules. 

The preceding local results combine to determine global structure. Now let R be a 
global ring. 


Lemma 23.3.18. O is hereditary if and only if discrd(O) is squarefree. 


Proof. We argue locally; and then we use the characterization (iv), the computation 
of the reduced discriminant (23.3.2), and the same argument as in Theorem 15.5.5 to 
finish. o 


23.3.19. Let O be a hereditary (possibly maximal) R-order. By Theorem 21.4.9, we 
know that the group Idl(ỌO) is an abelian group generated by the prime (equivalently, 
maximal) invertible two-sided ideals. We claim that the map 


{Prime two-sided invertible O — ideals} < {Prime ideals of R } 


(23.3.20) 
PPAR 


is a bijection, generalizing Theorem 18.3.6. If p { N then we have P = pO; and if p | D 
then we have a prime two-sided ideal P = O N rad(Oy) with P? = pO. Otherwise, 
p | N but p t D, so Op is hereditary but not maximal; from the local description in 
Lemma 23.3.13, we get a prime ideal P = ONrad(O,) with P? = pO as in the ramified 
case. This proves (23.3.20), and that the sequence 


0 > Idl(R) > Idl(O) > I] Z/2Z > 0 (23.3.21) 
plot 
is exact. 
Taking the quotient by PIdl(R), we obtain the exact sequence 
0 — CIR > Picr(O) > I] Z/2Z > 0. (23.3.22) 
plot 


In particular, if D = (1), then Picr(O) = CI R. Finally, the group of two-sided ideals 
modulo principal two-sided ideals is related to the Picard group by the exact sequence 
(18.5.5): 


0 — Npgx(O)/(F*O*) = Picr(O) > Idl(O)/ PIdl(O) > 0 
aF*O* & [a0] = [Oa] 


(This exact sequence is sensitive to O even within its genus: see Remark 18.5.9.) 
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23.4 Eichler orders 


We now consider a more general class of orders inspired by the hereditary orders. 


Definition 23.4.1. An Eichler order O C B is the intersection of two (not necessarily 
distinct) maximal orders. 


23.4.2. By the local-global dictionary for lattices (and orders), the property of being 
an Eichler order is local. Moreover, from 23.3.12, it follows that a hereditary order is 
Eichler. 


Proposition 23.4.3. Suppose R is local and O © B = M)(F). Then the following are 
equivalent: 


(i) O is Eichler; 
= R R 
(ii) O = e 4 


, 


Gii) O contains an R-subalgebra that is B*-conjugate to (5 d ; and 


(iv) O is the intersection of a uniquely determined pair of maximal orders (not 
necessarily distinct). 


Proof. We follow Hijikata [Hij74, 2.2(i)]. Apologies in advance for all of the explicit 
matrix multiplication! 

We prove (i) > (ii) © (iii) and then (ii) > (iv) > (i). The implications (ii) > (iii) 
and (iv) > (i) are immediate. 

So first (i) > (ii). Suppose O = O; N O2. All maximal orders in B are B*- 
conjugate to M>(R), so there exist œ, @ € B* such that O; = a M>(R)a; fori = 1,2. 
Conjugating by a1, we may suppose a = 1 and we write a = a; ' for convenience, 
so O =~ Mo(R) N a M2(R)a!. Scaling by z, we may suppose œ € M2(R)\7 M2(R). 
By row and column operations (Smith normal form, proven as part of the structure 
theorem for finitely generated modules over a PID), there exist 6, y € GL2(R) such that 


Bay = ( a 


is in standard invariant form with e > 0. Then 


O = BOB! = M2(R) N Ba M2(R)a!B7! (23.4.4) 
since 6 € GL2(R), and 
4-4. f1 O\(R RV O\_[R p“ 
Ba M2(R)a ot =(; J é a b re} = (ye a (23.4.5) 


so 


O 
R 


R R\ [R p*\_ [R R 
Pae a 
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To show (iii) > (ii), we may suppose O 2 0 


1 0 0 0 F 0 SIN 
i o) en = b i! Then Oe; © ie al Let 7;; be the projection onto the 


= Re), + Ren with ej; = 


ij-coordinate. Then 


R 0 
m1(Oe11) = trd(Oe11)e11 C R= (0 o) 


and so equality holds. Therefore 


O 2 mı (0) = k 0 


for some a € Z. Arguing again with the other matrix unit e22 we conclude that 


b 
O= | A (23.4.7) 


with a, b € Z. Multiplying 


R p? R p? B Rage p? 
pt R pI R = pl Roepe? 


we conclude that e = a + b > 0. Such an order is maximal if and only if a + b = 0: if 


b -a 
a > 0, then Ap (e p P | and similarly if a < 0. The element a = 0 A 
p R p R m’ 0 


aimn _ (O 1\/R p?\(0 m\_ [RR 
a Ou = (Ke alee at a) (pe "| (23.4.8) 


(and normalizes the given subalgebra) so the result is proven. 


R R 
pe a be a maximal R- 


has 


To conclude, we show (ii) > (iv). Let O’ 2 O = | 


order. Since G c O’, the argument of the previous paragraph applies, and 


O’ = f p ) with c € Z satisfying 0 < c < e. The intersection of another such 
p* 


maximal orders with the parameter d is the order (i R 


) where a = max(c, d) 


and b = —min(c,d) so is equal to r d if and only if e = a = max(c, d) and 


0 = b = min(c, d), which uniquely determine c, d up to swapping. o 


Remark 23.4.9. There is a further important equivalent characterization of Eichler 
orders as being maximal or residually split: see Lemma 24.3.6. 


Corollary 23.4.10. Every superorder of an Eichler order is Eichler. 
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Proof. The corollary is local, so we may apply Proposition 23.4.3(iii) to every super- 
order. g 


Definition 23.4.11. Suppose R is local. The standard Eichler order of level p° in 
M2(F) is the order 
e _ [R R 
Oo(p ) = e a : 
By Proposition 23.4.3, if R is local then an order O € M2(F) is Eichler if and only 
if O is conjugate to a standard Eichler order. 


Suppose until further notice that R is local, and let O = Oo(p°) be the standard 
Eichler order of level p? with e > 0. 


23.4.12. First two basic facts about the Eichler order O of level pf: We have 
discrd(O) = [M2(R) : O]r = pf 


and its associated ternary quadratic form Q(x, y, z) = xy — n° z? as in 23.3.3. 


0 1 
o=% o) €O 


O = M2(R)N w! Mo(R)o 


23.4.13. Let 


Then 


as in (23.4.6); by Proposition 23.4.3 these two orders are the uniquely determined pair 
of maximal orders containing O. We have w? = n°, and so w € Ngx(O). It follows 
(and can be checked directly) that J = Ow = wO is a two-sided O-ideal. If e = 0, we 
have I = O. 


Proposition 23.4.14. Suppose that e => 1. Then we have Ngx(O)/(F*O*) = (w) = 
Z/2Z. Moreover, the group Id\(O) = PIdl(O) is abelian, generated by I and pO with 
the single relation I? = p°O. 


Proof. Leta € Ngx(O). Then by uniqueness of the intersection in 23.4.13, conjugation 
by @ permutes these two orders, so we have a homomorphism Ngx(O) to a cyclic group 
of order 2. This homomorphism is surjective, since w transposes the orders. If œ is in 
the kernel, then œ € Ngx(M2(R)) = F* GL2(R) and unconjugating the second factor 
we similarly get waw”! € F* GL2(R), so 


æ € FX(GL2(R) Nw | GL2(R)@w) = F*O*. 


Again since R is local, we have Idl(O) = PIdl(O), and by (18.5.4), we have an 
isomorphism 
Npx(O)/(F*O%) = PIdl(O)/ PIdl(R) 


so Idl(O) is generated by / and the generator p for PIdI(R). o 
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23.4.15. It is helpful to consider the Jacobson radical of an Eichler order, to compare 
to the hereditary case. 


J= | y 7 
pe p 
We claim that J = rad O. We verify directly that J C O is a two-sided ideal and 
f 
Ps te i) c pO, (23.4.16) 


where f = min(e, 2), so by Corollary 20.5.5, J C rad O; on the other hand, the quotient 
O/J~kxk (23.4.17) 


is semisimple, so rad O € J by Corollary 20.4.1 1(a). 
However, the radical is not an invertible (sated) two-sided O-ideal unless O is 
hereditary (e = 1), by 23.3.1. Indeed, we verify that 


R 
a 


(Exercise 23.6); this recovers O if and only if e = 1, and if e > 2 then it is an Eichler 
order of level p°~? (conjugating as in (23.4.8)). By (23.4.16), if e > 2 then J? = pJ, 
and so we certainly could not have J invertible! 


= 
OLJ) = [ A ) = OR(J) (23.4.18) 


We now repackage these local efforts into a global characterization. 


23.4.19. Suppose that R is a global ring. Let discr B = D and let O be an Eichler 
order with discrdO = St. If p | D, then By has a unique maximal order, so (as an 
‘intersection’ ) Oy is necessarily the maximal order. If p { D, and ordy N = e > 0, then 
Oy is isomorphic to the standard Eichler order of level p°. 

We have N = DIM with M C R and we just showed that Mt is coprime to D. We 
call M the level of the Eichler order O. The pair D, M (or D, N) determines a unique 
genus of Eichler R-orders, i.e., this data uniquely determines the isomorphism class of 
Oy for each p. 

Putting together Proposition 23.4.14 together with 23.3.19 for the remaining primes 
where the order is maximal, we have an exact sequence 


0 > Idl(R) > Idl(O) > I] Z[2Z => 0 (23.4.20) 
plot 
and we may take the quotient by PIdl R to get 


0 > CIR > Picr(O) > | [zz >0 (23.4.21) 
pit 


Remark 23.4.22. Eichler [Eic56a] developed his orders in detail for prime level, and 
employed them in the study of modular correspondences [Eic56c] and the trace formula 
[Eic73] in the case of squarefree level. (As mentioned in Remark 24.1.5, Eichler’s 
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earlier work [Eic36, §6] over Q included some more general investigations, but his 
later work seemed mostly confined to the hereditary orders.) Hijikata [Hij74, $2.2] later 
studied these orders in the attempt to generalize Eichler’s result beyond the squarefree 
case; calling the orders split (like our name, residually split). Pizer [Piz73, p. 77] may 
be the first who explicitly called them Eichler orders. 


Remark 23.4.23. An R-order O C M,,(F) that contains a GL,(F)-conjugate of the 
diagonal matrices diag(R, ..., R) (equivalently, containing n orthogonal idempotents) 
is said to be tiled. By 23.4.3, an order O € Mo(F) is tiled if and only if itis Eichler. Tiled 
orders also go by other names (including graduated orders) and they arise naturally in 
many contexts, including representation theory [P183] and modular forms. 


23.5 Bruhat-—Tits tree 


In the previous section, we examined Eichler orders as the intersection of two maximal 
orders. There is a beautiful and useful combinatorial construction—a tree—which 
keeps track of the containments among maximal orders in aggregate, as follows. For 
further reading, see Serre [Ser2003, $II.1]. 

Let F be a nonarchimedean local field with valuation ring R, maximal ideal p = 7R, 
and residue field k := R/p, and let q := #k. Let B := Mo(F), and let V := F? as column 
vectors, so that B := Endp (V) acts on the left. 

Recalling again section 10.5, every maximal order O C B has O = Endr(M) where 
M c V is an R-lattice. So to understand maximal orders, it is equivalent to understand 
lattices (and their containments) and the positioning of one lattice inside another. 


Lemma 23.5.1. Let L,M C V be R-lattices. Then there exists an R-basis x), x2 of L 
such that nî x,,t?.x2 is an R-basis of M with fi, fp € Zand fi < fa. 


Proof. Exercise 23.7. o 


Lemma 23.5.2. We have Endg(L) = Endg(M) if and only if there exists a € F* such 
that M = aL. 


Proof. If M = aL for a € F*, then Endr(L) = Endr(M). Conversely, suppose 
that Endg(L) = Endr(M). Replacing M by aM with a € F*, we may suppose 
without loss of generality that L C M. By Lemma 23.5.1, we may identify L = R? 
with the standard basis and M = wie, @ we, with fi, fp € Zso; rescaling again, 
and interchanging the basis elements if necessary, we may suppose e; = 0. Then 
Endr(M) = Endr(L) ~ M2(R) implies fp = 0 and L = M. o 


With this lemma in mind, we make the following definition (recalling this definition 
made earlier in the context of algebras). 


Definition 23.5.3. Two R-lattices L, L’ C V are homothetic if there exists a € F* such 
that L’ = aL. 
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The relation of homothety is an equivalence relation on the set of R-lattices in V, 
and we write [L] for the homothety class of L. 


23.5.4. Let L C V be an R-lattice. In a homothety class of lattices, there is a unique 
lattice L’ C L in this homothety class satisfying any of the following equivalent 
conditions: 


(i) L’ C L is maximal; 
(ii) L’ É mL; and 
(iii) L/L’ is cyclic as an R-module (has one generator). 


These equivalences follow from Lemma 23.5.1: they are equivalent to fı = 0, and 
correspond to a maximal scaling of L’ by a power of z within L. For such an L’, we 
have L/L’ ~ R/x/ R for a unique f > 0. 


Definition 23.5.5. Let 7 be the graph whose vertices are homothety classes of R- 
lattices in V and where an undirected edge joins two vertices (exactly) when there exist 
representative lattices L, L’ for these vertices such that 


mL CL’ CL. (23.5.6) 


Equivalently, by Lemma 23.5.2, the vertices of J are in bijection with maximal 
orders in B = M2 (F) by [L] © Endg(L) for every choice of L € [L]. 


23.5.7. The adjacency relation (23.5.6) implies L’ Ç mL € aL’, so it is sensible to 
have undirected edges. 
A class [L’] has an edge to L if and only if the representative L’ in 23.5.4 has 


feHl. 


Proposition 23.5.8. The graph T is a connected tree such that each vertex has degree 
qt. 


Proof. We have L/xL = k?, and so the lattices L’ satisfying (23.5.6) are in bijection 
with k-subspaces of dimension 1 in L/L; such a subspace is given by a choice of 
generator up to scaling, so there are exactly (q? — 1)/(q — 1) = q + 1 such, and each 
vertex has g + 1 adjacent vertices. The graph is connected: given two vertices, we may 
choose representative lattices L, L’ such that L’ C L as in 23.5.4. The quotient L/L’ is 
cyclic, so by induction the lattices L; = 2'L + L’ for i = 0,..., f have L; adjacent to 
Li+1, and Lo = L and Ly = L’, giving a path from [L] to [L’]. 

The following argument comes from Dasgupta—Teitelbaum [DT2008, Proposition 
1.3.2]. Suppose there is a nontrivial cycle in 7 


nm’L=L, Ç Ls-1 CCL, CLo=L (23.5.9) 


so that v > 1. We may suppose this cycle is minimal, meaning that no intermediate 
lattices are equivalent. The quotient L/L, = L/a’L = (R/p”)* is not cyclic; let i be 
the largest index such that L/L; is cyclic but L/Li+1 is not. Thus L/L; = R/p! ®R/p, 
and so a’ annihilates L/L,;,, and a’'L C Lj41. Since L/L; is cyclic, just as in the 
previous paragraph, we conclude L;_, = 2’'L + Li. Putting these together, we find 
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that mLi-1 = a'L + aL; C Lj41, the latter by definition of the adjacency between L; 
and Lj+1. By adjacency, Li G mL © Li+ı. And again by adjacency, Li+ı is maximal 
inside L;, so mL;-; = Li+1. This contradicts the minimality of the cycle; we conclude 
that T has no cycles. o 


We call 7 the Bruhat-Tits tree for GL2(F). The Bruhat-Tits tree for F = Q% is 
sketched in Figure 23.5.10. We write Ver(7 ) and Edg(7 ) for the set of vertices and 
edges of T. 


Figure 23.5.10: Bruhat-Tits tree for GL2(Q2) 


23.5.11. We define a transitive action of GL2(F) on T as follows. 

Let L C V be a lattice. Choose a R-basis for L and put the vectors in the columns 
of a matrix 8 € M2(F). Since these columns span V over F, we have 8 € GL2(F), 
and the matrix 8 is well-defined up to a change of basis over R; therefore the coset 
BGL2(R) € GL2(F)/ GL2(R) is well-defined. (Check that the action of change of basis 
on columns is given by matrix multiplication the right.) Therefore a homothety class 
[L] gives a well-defined element of GL2(F)/(F*% GL2(R)). Conversely, given such a 
class we can consider the R-lattice spanned by its columns, and its homothety class is 
well-defined. We have shown there is a bijection 


Ver(T) e GLo(F)/(F* GL2(R)). (23.5.12) 


The group GL2(F) acts transitively on the left on the cosets GL2(F)/(F* GL2(R)) and 
we transport via the bijection (23.5.12) to an action on Ver(7 ). 

We claim this action preserves the adjacency relation on T: if L 2 L’ are adjacent, 
then by invariant factors we can choose a basis x1, x2 for L such that x1, 1x2 is a basis 
for L’, i.e., 


P =B ( (23.5.13) 


23.5. BRUHAT-TITS TREE 389 


If a € GL2(F), then multiplying (23.5.13) on the left by œ shows that aL, aL’ are 
adjacent. 


23.5.14. The tree 7 has a natural notion of distance d between two vertices, given 
by the length of the shortest path between them, giving each edge of F length 1. 
Consequently, we have a notion of distance d(O, O’) between every two maximal 
orders O, O’ C B. 


Lemma 23.5.15. Let L, L’ be lattices with bases x1, x2 and nf x,,n°*F x2, respectively. 
Then in the basis xı, x2, we have O = Endr(L) = M2(R) and O’ = Endg(L’) = 
R yp h 

f R ) and d(O, O’) = e. 

Proof. The statement on endomorphism rings comes from Example 10.5.2; we may 

suppose up to homothety that L’ has basis x1, 7° x2; the maximal lattices as in 23.5.4 

are given by L; = Rx, + p'Rx2 with i = 0,...,e, so the distance is d([L], [L']) = e. 
o 


23.5.16. Importantly, now, we turn to Eichler orders: they are the intersection of two 
unique maximal orders, and so correspond to a pair of vertices in J, or equivalently a 
path. By Lemma 23.5.15, the standard Eichler order of level p? corresponds to a path 
of length e, and by transitivity the same is true of every Eichler order. The normalizer 
w of an Eichler order 23.4.13 acts by swapping the two vertices. Each vertex of the 
path corresponds to the e + 1 possible maximal superorders. 

In this way, the Bruhat-Tits tree provides a visual way to keep track of many 
calculations with Eichler orders. 


Remark 23.5.17. The theory of Bruhat-Tits trees beautifully generalizes to become 
the theory of buildings, pioneered by Tits; see the survey by Tits [Tit79], as well as 
introductions by Abramenko—Brown [AB2008]. 


Exercises 


Unless otherwise specified, let R be a Dedekind domain with F = Frac Rand let O € B 
be an R-order in a quaternion algebra B. 


1. Let R be a DVR with maximal ideal p = (7), and let 
R R 
aki 


€ Ngx(O). After scaling, we may suppose that 
x, y, z, w € R. By determinants, show that if x, w € R*, then a € O*. 


(b) Compute 


x 
(a) Suppose that a = k 


n(wy — xz) x? — ny? ) 


guo we -= 12") —m™(wy — xz) 


Show that z | x, and then z | w, whence a € wO. 
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> 2. 


> 4. 


> 6. 


> 7. 


10. 
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(c) Conclude that Ngx(O)/(F*O%) is generated by w = L a 


[See also the matrix proof by Eichler [Eic56a, Satz 5], which uses a normal form 
for one-sided ideals simplifying the above computation. ] 

Let K 2 F be a field extension, let S be the integral closure of R in K, and let 
a, b C F be fractional ideals of R. Show that a = b if and only if aS = bS. 


. Extend Lemma 13.4.9, and show that if R is a DVR and O is an Eichler R-order 


then nrd(O*) = R*. 
Suppose R is local and O is a hereditary order. Show that if u € O has x? = z, 
then u generates Ngx(O)/(F*O%). 


. Suppose R is local and O C B = M)(F) is the intersection of two maximal 


orders. Give another (independent) proof that O is isomorphic to a standard 
Eichler order which replaces matrix calculations in Proposition 23.4.3 with 
some representation theory as follows. 


(a) Write O ~ M2(R) N O’. Let e;, be the top-left matrix unit and let J = 
M>(R)e11. Show J’ = O'e}; is an R-lattice in V = M>(F)e1, = F°. 

(b) Use elementary divisors to show that there exists an R-basis x1, x2 of I 
such that x1, 7f x2 is an R-basis for I’. 


(c) Show that the corresponding change of basis matrix œ = 0 y has 
I’ = al, and use this to identify O with the standard Eichler order of 


level p°. 


[See Brzezinski [Brz83a, Proposition 2.1].] 


Let R be local and let O be the standard Eichler order of level p° for e > 1. Let 
J = rad O. Show that 


R p! 


OL(J) = (a R 


) = ORk(J). 


Prove Lemma 23.5.1. [Hint: use direct matrix methods or the theory of invariant 
factors. ] 


. Let R be local and let O be a hereditary quaternion R-order. Show that rad O = 


[O, O] is the commutator (cf. Exercise 13.7) and that diff O = rad O. 


. Let R be local. Let O, O’ C B be maximal R-orders. Recall that O, O’ are vertices 


in the Bruhat—Tits tree. Define the distance dist(O, O’) to be the distance in the 
Bruhat-Tits tree between the respective vertices. Show that 


[O : ONO’ = dist(O, O’) = [O’: ONO’). 


Let R be local, and let O be an Eichler order of level p°. Consider the graph 
whose vertices are R-superorders O’ 2 O in B and witha directed edge whenever 
the containment O’ > O is proper and minimal. What does this graph look like? 
[Hint: use the Bruhat-Tits tree; it helps to draw the Eichler orders of the same 
level at the same height. ] 
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Chapter 24 


Quaternion orders: second meeting 


In this chapter, we continue our tour of quaternion orders with some more advanced 
species. 


24.1 » Advanced quaternion orders 


Let B be a quaternion algebra over Q of discriminant D = disc B and let O C B be an 
order with reduced discriminant N = discrd(O). Then N = DM with M € Zs}. 


24.1.1 (Gorenstein and Bass orders). Although Eichler orders may lose the property 
that all of its ideals are invertible, we may still insist on the invertibility of its dual. 
Recall (Definition 15.6.15) that the codifferent of an order is the lattice codiff(O) = of 
obtained as the dual of the trace pairing over R. We say O is Gorenstein if codiff(O) 
is invertible, or equivalently (Corollary 16.8.7) every sated left or right fractional O- 
ideal (lattice J C B with left or right order equal to O) is invertible. Hereditary orders 
are Gorenstein, since for a hereditary order every left or right fractional O-ideal (not 
necessarily sated) is invertible. 

Being Gorenstein is a local property because invertibility is so. An Eichler order is 
Gorenstein, but there are Gorenstein orders that are not Eichler. An order is Gorenstein 
if and only if its associated ternary quadratic form is primitive, i.e. the greatest common 
divisor of its coefficients is 1, or equivalently its values generate Z. 

We say O is Bass if every superorder O’ 2 O (including O’ = O) is Gorenstein. A 
Bass order is Gorenstein, but not always conversely. The fact that every superorder is 
Gorenstein reflects into good structural properties of a Bass order. Most importantly, 
a Zp-order O is Bass if and only if it contains either Zp X Zp or the ring of integers 
in a quadratic extension K 2 Q, (these order are sometimes called primitive; we call 
them basic). This embedded subalgebra makes it possible to calculate explicitly with 
the order, with important applications to the arithmetic of modular forms, a topic we 
pursue in the final part of this book. 


In summary, there is a chain of proper implications 
maximal => hereditary = Eichler > Bass = Gorenstein (24.1.2) 
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for orders O C B, and each of these implications is strict (<« in general). 
Given an order O, we construct its radical idealizer as 


OË = O, (rad O) = Op(rad O). 


We have O c OÏ, and equality holds if and only if O is hereditary. Iterating, we obtain 
a canonically attached sequence of superorders: 


O=0)¢€0,=0'¢---€O, (24.1.3) 


where O, is hereditary. A more refined classification of orders involves dissecting the 
chain (24.1.3) explicitly. 

There is one final way of classifying orders that extends nicely to the noncom- 
mutative context, due to Brzezinski. By way of analogy, we recall that orders in a 
quadratic field are characterized by conductor. Let K = Q(Vdx), let Zg be the ring 
of integers of K, and let dx € Z be the discriminant of Zg. An order S in K is of the 
form S = Z + fZx, where f € Zs, is the conductor of S (in its maximal order), and 
the discriminant of Sis d = f 2dx. Even in classical considerations, these orders arise 
naturally when considering binary quadratic forms of nonfundamental discriminant. 


Proposition 24.1.4. Let B be a quaternion algebra over Q and let O © B be an order. 
Then there exists a unique integer f(O) = 1 and Gorenstein order Gor(O) such that 


O = Z + f(O)Gor(O). 
Two orders O, O' are isomorphic if and only if f(O) = f(O’) and Gor(O) ~ Gor(O’). 


The order Gor(O) is called the Gorenstein saturation of O, and we call f(O) the 
Gorenstein conductor of the order (also sometimes called the Brandt invariant): an 
order is Gorenstein if and only if f(O) = 1, so this gives a ready supply of orders that 
are not Gorenstein. In particular, to classify all orders, via the operation of Gorenstein 
saturation, it is enough to classify the Gorenstein orders. 


Remark 24.1.5. The first attempt to tame the zoo of quaternion orders was Eichler 
[Eic36, Satz 12], who classified what he called primitive (our basic) orders. Later this 
was generalized by Brzezinski [Brz83a, §5], who also clarified certain aspects [Brz90, 
§1]. A nice summary of facts about quaternion orders is given by Lemurell [Lem201 1]. 


24.2 Gorenstein orders 


In this section we define the well-behaved Gorenstein orders. See Remark 24.2.24 for 
more context on the class of Gorenstein rings. 
Recall the definition of the codifferent (Definition 15.6.15): 


codiff(O) = O? := {a € B : trd(@O) C R} CB. 


We have codiff(O) a two-sided sated O-ideal with O C codiff(O) (Lemma 15.6.16), 
and disc(O) = [codiff(O) : O]r. We already saw in section 16.8 the importance of the 
following class of orders. 
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Definition 24.2.1. O is Gorenstein if codiff(O) is invertible. 


24.2.2. The property of being a Gorenstein order is local (O is Gorenstein if and only 
if O, is Gorenstein for all primes p), since invertibility is a local property. 


Proposition 24.2.3. The following are equivalent: 


(i) O is Gorenstein; 
(ii) codiff(O) is projective as a O-bimodule; 
(iti) OY = Homa(O, R) is projective as an O-bimodule; and 
(iv) All sated left or right fractional O-ideals are invertible as O-ideals. 


Proof. For the equivalence (i) © (ii), because codiff(O) is sated it follows from 
Theorem 20.3.3 that O is Gorenstein if and only if codiff(O) is projective as an O- 
bimodule. For (ii) © (iii), by Proposition 15.6.7, we have an isomorphism codiff(O) = 
Hom, (O, R) of O-bimodules over R. Finally, (i) © (iv) follows from Corollary 16.8.7. 

o 


24.2.4. We call in for relief as well Main Theorem 20.3.9: the equivalent sided notions 
(on the left and right) in Proposition 24.2.3 are also all equivalent. In particular, a 
suitably defined notion of left Gorenstein or right Gorenstein would also be equivalent. 


The Gorenstein condition can be detected on the level of norms as follows. 
Lemma 24.2.5. We have 
nrd(codiff(O)) discrd(O) € R, 
and O is Gorenstein if and only if equality holds. 
Proof. We refer to Proposition 16.4.3, and 16.4.5: we have 
[O : Olr 2 Nmgir(O*) = nrd(O*)?, 


with equality if and only if OF is locally principal. But by Lemma 15.6.17, we have 
[OË : Ole = disc(O) = discrd(O)?, and combining these gives the result. o 


Our next main result connects the Gorenstein condition to a property of the cor- 
responding ternary quadratic module. Let Q: M — L be a ternary quadratic module. 
We follow Gross—Lucianovic [GrLu2009, Propositions 6.1—6.2], and consider the odd 
Clifford bimodule. 


Proposition 24.2.6. Left multiplication gives a pairing 
clf°(Q) x Cf! (Q) > Clf!(Q)/M = NM @LY 
that induces an isomorphism 
Home(Clf°(Q), N°M 8 LY) > Cif! (Q) (24.2.7) 
of left Clf°(Q)-modules. 
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We have a similar argument on the right. 


Remark 24.2.8. If R is a Dedekind domain, then choosing a pseudobasis for M we may 
write M = ae, + bez + ce3 for fractional ideals a, b, c, and without loss of generality 
may take L = [a fractional ideal. Then NM = abe(e; A e2 A e3) = abc is just the 
Steinitz class of M, and 3M @ LY ~ abc“! as in (22.5.10). Restricting to this case 
is still quite illustrative. 


Proof. By construction of the Clifford algebra, we have as R-modules that 
CIf'(Q) ~= M @ ( M 8 LY), 


so Clf 'Q) /M~ \@M@LY. Multiplication in Clf(Q) induces a pairing that induces a 
homomorphism of Clf°(Q)-bimodules by associativity of multiplication in Clf(Q). To 
conclude that the pairing induces an isomorphism, we can argue locally and suppose 
that M ~ R? with basis e1, e2, e3 and L ~ R is the quadratic form 


Q(x, y, z) = ax? + by? + cz? + uyz + vxz + wxy. 
Then 1, e2€3, €3€1, €1€2 is an R-basis for clf°(Q) and 
Clf!(Q) = (Re; + Rez + Re3) + Ree2e3 


with Clf!(Q)/M = Rejere3. 
We then compute the dual basis of Clf 'Q) to that of clf°(Q) as 


€1€2€3 — Ue, — Ven — WE3, €], €2, €33 (24.2.9) 
for example, 
e2€3(€1e2€3) = e2(—e1e3 + V)ere3 = —e2e1e3(e2€3) (mod M) 
so 
= 2 
e2€3(e€1e2€3 — Ue, — Ver — We3) = e€1(€2€3)° — Ue e263 
= e; (ue2€3) — uejerxe3 =O (mod M). 
The other products can be computed in a similarly direct fashion. o 


Recall (Definition 9.7.14) that Q: M —> L is primitive if Q(M) generates L as an 
R-module. 


Theorem 24.2.10. O is Gorenstein if and only if its associated ternary quadratic 
module Q = Wo is primitive. 


Proof. The statement is local, so we may suppose that R is a local domain with maximal 
ideal p, and that M ~ R? and L = R, so AM 8 L“ = R. Let J = Clf!(Q) be the odd 
Clifford bimodule, thought of as a left O-module. By Proposition 24.2.6 (specifically, 
(24.2.7)), we have cif! (Q) ~ Homg(O, R) as left O-modules. By Proposition 24.2.3 
and 24.2.4 (or repeating the argument on the right), we want to show that J is principal. 
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Suppose that Q is primitive. Let a = xe, + yez + zez. We then compute that 


1 xX 0 cz t+ vx —(by + uz) ei 
e283 | |y —=cz vy (ax +vz+wy) ez 
€3e| z (by+uz) -(ax+wy) 0 e3 
€12 0 x y z €1€263 


and the determinant of the matrix in the middle is precisely Q(x, y, z)?. So O(a) € R* 
if and only if «0 = J. 
Conversely, suppose that Q is not primitive. Then Q = 0 (mod p). If @ = xe; + 
yez + zez + tejeze3 with x, y, z,t € R, then 
æere3 = xe1e2e3 (mod p) 


and symmetrically with the other products, so œO has rank < 2 over R/p, and it follows 
that «O # J for all a. m 


Corollary 24.2.11. Let R be a Dedekind domain. Then the associations 
Nondegenerate primitive ternary Gorenstein quaternion 
| quadratic modules over R | > | orders over R | 
up to twisted similarity up to isomorphism 
Q = CIO) 
nrd#(O) HO 


(24.2.12) 


are mutually inverse discriminant-preserving bijections that are also functorial with 
respect to R. 


Proof. We restrict the bijection in Main Theorem 22.5.7 and apply Theorem 24.2.10. 
o 


Remark 24.2.13. In view of Corollary 24.2.11, the issues in the correspondence with 
ternary quadratic forms for non-Gorenstein orders amounted to the failure to account 
for the codomain of the quadratic module: non-Gorenstein orders are obtained from 
quadratic modules Q: M — L where Q(M) Ç L. 


24.2.14. From 23.4.12 and Theorem 24.2.10, we conclude that every Eichler order is 
Gorenstein; a direct proof is given in Exercise 24.1. 


Therefore, non-Gorenstein orders abound: indeed, any order corresponding to 
an imprimitive form will do. More generally, we construct a canonically associated 
Gorenstein order containing a given order as follows. 


Proposition 24.2.15. There exists a unique ideal f{(O) © R and unique Gorenstein 
order Gor(O) € B such that 


O = R + f(O) Gor(O). (24.2.16) 
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In fact, we have 
f(O) = discrd(O) nrd(O#) 
Gor(O) = R + nrd(O*)“!OF OF. 


Before we begin the proof, we make a definition and then consider the key ingredient 
of the proof: how rescaling the module affects the even Clifford algebra. 


Definition 24.2.17. In (24.2.16), we call f(O) € R the conductor and Gor(O) the 
Gorenstein saturation of O. 


Remark 24.2.18. Brzezinski [Brz83a] writes 6(O) instead of f(O) and calls it the 
Brandt invariant. Other authors call Gor(O) the Gorenstein closure of O, but this 
terminology may be confusing: Gor(O) is not necessarily the smallest Gorenstein order 
containing O (see Exercise 24.11). 


24.2.19. Suppose that the ternary qudratic module Q: M — L corresponds to a 
quaternion R-order O. For a nonzero ideal a C R, we define Q(a) : M > aT! L to be 
just Q but with values taken in a~! 2 L. We claim that for all R-ideals a, we have under 
the correspondence 


O(a): M>a'L Ola) :=R + a0. (24.2.20) 


The fact that Clf°(O[a]) = R + aO is visible from the construction of the even Clifford 
algebra (22.2.3); it is also visible from the description (22.3.25) in Example 22.3.24. 
In the other direction, we have 


nrd#(O(a)): (O(a)*)° = atO? > discrd(O(a))"! = a7? discrd(O)! 
and now we twist by a to get 
a @ nrd(O(a)*): (O08)? > at! discrd(O)!. (24.2.21) 


Proof of Proposition 24.2.15. We argue using ternary quadratic modules: our proof 
amounts to replacing a potentially imprimitive form with a primitive form, following 
Theorem 24.2.10. 

Let Q = yo : M —> L be the ternary quadratic module associated to O, well- 
defined up to twisted similarity. We may take L = [ C L @p F ~ F and we do so for 
concreteness; accordingly, suppose [is a fractional ideal of R. Then Q(M) = nC Lisa 
finitely generated nonzero R-submodule; since R is a Dedekind domain, n is invertible. 
Let 

f=f(O):=nl'cR. 


Let Gor(Q) = Q(f) : M — n be the primitive ternary quadratic module obtained 
by restricting the codomain. Then Clf°(Gor(Q)) is a Gorenstein order by Theorem 
24.2.10, and 

Clf°(Q) = R + f(O) Clf°(Gor(Q)); 


by (24.2.20), so we let Gor(O) := Clf°(Gor(Q)). Uniqueness follows directly from 
(24.2.20): if O = R + aO’ and O’ is Gorenstein, then Q(a~') : M — al is primitive 
and Q(M) = n = al, thus a = f(O) and O’ = Clf°(Q(a)) = Gor(O). 
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To prove the remaining statements, we recall Corollary 22.4.15 to get 
O=R+ discrd(O)O*O# 
in all cases, and the primitivity of 
nrd(O(f-!)*) @ ft: (0%)° > fdiscrd(O)! 
as in (24.2.21) is equivalent to 
nrd((O*)°) = f discrd(O)! 


i.e. f = discrd(O) nrd((O#)®). Finally, nrd(O#) = nrd((O#)°) is proven in Exercise 
22.10. o 


Lemma 24.2.22. Let O’ C B be an R-order. Then O = O’ as R-orders if and only if 
Gor(O) ~ Gor(O’) and f(O) = f(O’). 


Proof. Immediate from the uniqueness claim in Proposition 24.2.15. o 


The translation of the Gorenstein property in terms of primitivity of the ternary 
quadratic form makes it quite accessible. For example, we have the following result 
that shows that the Gorenstein condition is stable under base change. 


Proposition 24.2.23. Let K 2 F be a finite extension and let S be the integral closure 
of R in K. Then O is a Gorenstein R-order if and only if O8p S is a Gorenstein S-order. 


Proof. Let Q: M —> L be the ternary quadratic module corresponding to O; denoting 
extension of scalars by subscripts, we have Qs : Ms — Ls corresponding to Os. 
We want to show that Q is primitive if and only if Qs is primitive, which is to say 
Q(M) = L if and only if Qs(M) = Q(M)s = Ls, and this statement is true as it holds 
for fractional R-ideals (Exercise 23.2). o 


Remark 24.2.24. Gorenstein rings were introduced by Gorenstein [Gor52] in the con- 
text of plane curves (the results of his Ph.D. thesis); Bass [Bas62, Footnote 2] writes: 
“After writing this paper I discovered from Professor Serre that these rings have been 
encountered by Grothendieck, the latter having christened them ‘Gorenstein rings.’ 
They are described in his setting by the fact that a certain module of differentials is 
locally free of rank one.” Bass [Bas63] gives a survey of (commutative) Gorenstein 
rings, noting their ubiquity; see also the later survey by Huneke [Hun99]. Gorenstein 
rings are truly abundant: they include coordinate rings of affine plane curves and 
curves with only double points as singularities, complete intersections, and integral 
group rings of finite groups. 

The above-mentioned paper of Bass [Bas63] also gave rise to the class of epony- 
mous orders in which every superorder is Gorenstein; over a complete DVR, these 
orders were completely classified (and related to hereditary and Gorenstein orders) by 
Drozd—Kirichenko-Roiter [DKR67]. Gorenstein orders in the context of quaternion 
algebras, were studied by Brzezinski [Brz82]. (Brzezinski [Brz87] also considers more 
general orders in which every lattice is locally principal: but for quaternion algebras, 
these are again the Gorenstein orders.) 
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24.3 Eichler symbol 


Just as local quadratic extensions are classified as being either ramified, inert, or split, 
it is helpful to have a similar classification for quaternion orders. 

We first work locally, and we suppose until further notice that R is local (as in 
23.201); 


24.3.1. The k-algebra O/radO is semisimple and has a standard involution (since 
rad O is preserved by it), so by Example 7.4.9 this standard involution is nondegenerate; 
by classification (Main Theorem 4.4.1 and Theorem 6.4.1) we have one of three 
posibilities: O/radO is either k, a separable quadratic k-algebra, or a quaternion 
algebra over k. 
We give symbols to each of the possibilities in 24.3.1 as follows. 

Definition 24.3.2. Let J := rad O. We define the Eichler symbol 

*, if O/J is a quaternion algebra; 

5 ifO/J = k x k, and we say O isresidually split; 

(9) := 40, ifO/J = k, and we say Ois residually ramified; and 

-1, ifO/Jis a (separable) quadratic field extension of k, and 

we say O isresidually inert. 


O 
For formatting reasons, we will also write ($) = (O | p). The similarity of the 


Eichler symbol to other quadratic-like symbols is intentional: because of the arguments 
to the symbol, it should not be confused with the others. If the reader finds this 
overloading of symbols unpleasant, they may wish to use the symbol £ẹ(O) instead. 


24.3.3. Recall the definition of the discriminant quadratic form 
A: BF 

J (24.3.4) 

A(@) = trd(a@)* — 4 nrd(a) 


that computes the discriminant of F[a] = F[x]/ (x? — trd(x)x + nrd(x)) in the basis 
1, a. The form factors through a map A: B/F —> F. 


Suppose that #k < oo. Fora € R, let *) denote the generalized Kronecker symbol, 
defined to be 0, 1,—1 according as if F ef (x? — a) is ramified, split (isomorphic to 
F x F), or inert. If chark # 2, then 2 is the Legendre symbol. We then have the 
following characterization (Exercise ot A): 


A 
(a) (2) = x if and only if (*) takes on all of the values —1, 0, 1 for a € O. 


Ao) 


(b) (9) = € if and only if | p takes the values {0, €} for a € O. 
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We now consider each of possible values of the Eichler symbol in turn. 
O sae es 
24.3.5. We have p = x if and only if radO = pO, by dimension considerations. 


O 
If further #k < oo, then m = x if and only if O ~ M2(R), since then the only 


quaternion algebra over k is M2(k), and we can lift matrix units using Hensel ’s lemma. 


Lemma 24.3.6. The order O is Eichler if and only if O is maximal or residually split. 


Proof. If O is Eichler, then either O is maximal or O/radO = k x k by (23.4.17). 
Conversely, a maximal order is an Eichler order by definition, so suppose O is residually 
split, i.e., O/radO = k x k. Then (as in Lemma 20.6.8), the nontrivial orthogonal 
idempotents of k x k lift to orthogonal idempotents e1,e2 € O. Since eļe2 = 0, 
immediately B ~ M2(F). Conjugating in B, we may suppose e; = e11 and therefore 
€2 = en, and then the result follows from Proposition 23.4.3. 

For fun, here is a second proof. We have a decomposition O = J 8h = Oe; 90Oe2. 
Thus O = OL(4) N OL (h). Tensoring up to F, we get B = (1))r ® (h)r, so by 
dimensions we have (J})r = (h)r = F? the simple B-module, and each J; is isomorphic 
to an R-lattice M; © F*. Thus O,(/;) = Endr(M;) = Mo(R), and each O,(/;) is 
maximal, so O is the intersection of two maximal orders. oO 


24.3.7. Residually inert orders in a division quaternion algebra B over Q, were studied 
by Pizer [Piz76b], and he described them as follows. Let K = Qp2 be the unramified 
extension of Q,. Then K —> B. Consider the left regular representation B > M2(K); 
it has image 
| S sf) with z,w € K (24.3.8) 
pw z 
The valuation ring of B consists of those with z, w € Zp2 where Zp2 is the valuation 
ring of K. Pizer then considers those orders with z € Z,, and p” | w. He [Piz80a, 
Remark 1.5, Proposition 1.6] connected the residually inert and residually split orders 
by noting the striking resemblance between (24.3.8) and the standard Eichler order, 
remarking: 


Thus O, and O, [the Eichler order and the Pizer order] are both essentially 


subrings of rl Rl fixed by certain (different!) Galois actions induced 


by the Galois group of L/Qp and thus they can be viewed as twisted 
versions of each other. Hence O and O’ are locally isomorphic at all 
primes q + p while at p they are almost isomorphic. Thus it should not 
be too surprising that there are close connections between [them]. 


Pizer works explicitly and algorithmically [Piz80a] with residually inert orders, with 
applications to computing modular forms of certain nonsquarefree level. 


24.3.9. We can also interpret the Eichler symbol in terms of the reduction of the 
associated ternary quadratic form Q. 
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We have (O | p) = x if and only if Q mod p is nondegenerate, defining a smooth 
conic over k. 

If (O | p) = 1, then by Lemma 24.3.6, O is Eichler, and Q mod p ~ Q(x, y) = xy 
is degenerate of rank 2 by 23.4.12, cutting out two intersecting lines over k; and 
conversely. 

Suppose (O | p) = —1. Let i € O generate the quadratic field £ = O/ rad O over k, 
let K = F(i) and let S be the integral closure of R in K. Then K is unramified over F, and 
pS is the maximal ideal of S. Now the S-order Os := O &pr S has Os/ rad Os = £ xf, 
and (Os | pS) = 1. Therefore Q mod p over £ is degenerate of rank 2, so the same is 
true over k, and since we are not in the previous case, it is defined by an irreducible 
quadratic polynomial (the norm form from £ to k). Therefore Q mod p cuts out two 
lines defined over £ and conjugate under Gal(¢| k). In particular, a residually inert order 
is Gorenstein. 

The only possibilities that remain are that Q mod p is identically zero or has rank 
1 (defined by the square of a linear factor), and correspondingly cuts out the whole 
projective plane or a double line. These are the cases (O | p) = 0. 


24.3.10. It follows from 24.3.9 that the ternary quadratic form associated to a residually 
inert order is similar to 


Q(x, y, z) = nf x? +Nmxyr(z + yi) 


just as in (23.2.7); we have e odd if and only if B is a division algebra and e even if 
and only if B is split. 

The residually ramified orders do not admit such a simple classification; we will 
pursue them further in the coming sections. 


Remark 24.3.11. Let R be a DVR, let O be a Gorenstein R-order, and let Q be a ternary 
quadratic form over R representing the similarity class associated to O. Then Q is 
primitive and so defines an integral model C of a conic C C P? over F = Frac R. By 
discriminants, this conic has good reduction if and only if O is maximal. Moreover, we 
saw by direct calculation that this conic has the simplest kind of bad reduction—regular, 
with just one node over k—if and only if O is hereditary. This is no coincidence: in 
fact, C is normal if and only if C is Bass. See Brzezinski [Brz80] for more on the 
relationship between integral models of conics and quaternion orders and the follow- 
up work [Brz85] where the increasing sequence of Bass orders ending in an hereditary 
order corresponds to a sequence of elementary blowup transformations. 


Another motivation to study the Eichler symbol is that it controls the structure of 
unit groups, as follows. 


Lemma 24.3.12. Let #k = q. Then 1 + pO C O%, and 

qalq- 1)}(q+1), if(Olp) =* 
a(q- 1)’, if(O|p) = 1; 
gq), if(O | p) = -1; 
q- 1), if(O |p) = 0. 


[OX :1+p0] = 
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Proof. Since J C pO, if u € 1+ pO then u — 1 is topologically nilpotent, 
hence u € O*. 

Now the indices. If (O | p) = x, then we are computing the cardinality # GL2(k) = 
(q4? — 1)(q* - q). If (O | p) = 1, then O/J = k x k and dim, J/pO = 2, so the index is 
(q — 1)*q’. Similarly if (O | p) = —1, we get (q? — 1)q?. If (O | p) = 0, then O/J = k 
and so the index is (q — 1)q°. o 


We conclude with a global definition, and so we restore R to a Dedekind domain. 


Definition 24.3.13. For a nonzero prime p C R, we define the Eichler symbol at p to 


` (2) =(%} 


i.e., the Eichler symbol of the completion at p. 


O 
We say O is locally residually inert if (9) € {*, —1} for all primes p. 


The analogously defined locally residually split orders already have a name: they 
are the Eichler orders that are not maximal. 


24.4 Chains of orders 


We have a few more classes of orders to consider, but before we continue our tour we 
pause to consider an aspect of the more general classification: we seek to put every 
order in a chain of superorders, ending in a maximal order. 

Suppose throughout this section that R is local. 


Definition 24.4.1. The radical idealizer of O is 
O! := O, (rad O) N Or(rad O). 


24.4.2. By Exercise 20.2, we have O, (rad O) = Or(rad O) = OÏ, so the symmetric 
definition can be replaced by either order in the intersection. 


24.4.3. We recall our motivation to study extremal orders 21.2.1: O? radically covers 
O, and by Proposition 21.2.3 we have O4 = O if and only if O is extremal. By Theorem 
21.5.1, O is extremal if and only if O is hereditary. Iterating, we have a canonically 
associated chain of orders 


O =O) G01 = OF E--- ÇO, (24.4.4) 


terminating in an order O, that is hereditary. (By 23.3.1 and Proposition 23.4.3(iv), 
either O, is maximal or O, is contained in exactly two possible maximal orders.) We 
call O, the hereditary closure of O. 


24.4.5. Suppose (O | p) = 1, i.e., O is an Eichler order (Lemma 24.3.6). Suppose O 
has level p°. Then O4 is an Eichler order of level p°~? from (23.4.18)—it had to be 
Eichler of some level by Corollary 23.4.10. So the chain (24.4.4) is of length |e/2| with 
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quotients dim,(O;/O;+1) = 2. On the Bruhat-Tits tree, the Eichler order corresponds 
to a path of length e by 23.5.16, and O! is the path of length e — 2 obtained by plucking 
away the vertices on the ends. (If desired, one can refine this chain by squeezing in an 
extra Eichler order in between each step.) 


24.4.6. If O = R + pO’ for an order O’, then O4 = O’, by Exercise 24.5. 

In general, write O = R + pf Gor(O) where pf = f(O) is the Gorenstein conductor 
of O and Gor(O) is the Gorenstein saturation as in Proposition 24.2.15. Then the chain 
of radical idealizers begins 


O Ç O1 = R + p^! Gor(O) Ș -++ € Of = Gor(O). 
For each i, we have dim;(O;/O;+1) = 3. 
We next consider the chain of superorders over a (local) residually inert order. 


Proposition 24.4.7. Let O be a residually inert R-order. Then the following statements 
hold. 


(a) rad O = rad O1 N O = pO’. 
(b) Suppose O is not maximal. Then O? is the unique minimal superorder of O. 
Moreover, O” is residually inert and we have [O4: Oly = p>. 


Proof. We begin with the first part of (a), and we show rad O! N O = radO. As 
in 21.2.1, OÏ is a radical cover so radO C rad O" N O. But arguing as in the proof 
of Lemma 21.2.4, we know that rad O" is topologically nilpotent as a O’-ideal and 
p’O’ C pO for large r, so rad O? N O is topologically nilpotent as a O-ideal, and 
radO4N O C radO. 

We therefore have a map 


O/rad O = O4/rad O’; (24.4.8) 


since O/radO is a quadratic field, we must have (O8 | p) = +,-1, i.e., O8 is either 
maximal or residually inert—and in the latter case, (24.4.8) is an isomorphism. 

Let i € O generate £ = O/rad O as a quadratic extension of k. Let K = F(i), and 
let S be the integral closure of R in K. Then K is (separable and) unramified over F. 
We claim that Os = O pr S is residually split. Indeed, we have an isomorphism of 
k-algebras T 

(O/pO) @ £ => Os/pOs 
and since f is separable over k, an identification (Exercise 7.19) 
rad((O/pO) @, £) = rad(O/p) 8 £ 
giving (rad O)s = rad(Os) and 
O/rad O & £ = Os/rad Os. 


But O/rad O = £ 8x € = £ x £. This shows (Os | pS) = 1. 

To conclude, the statements that rad O”NO = pO8 in (a) and that [OĦ : O]r = p? in 
(b) hold for Os by (23.4.18), so they hold for O. Minimality follows from Proposition 
24.4.12. o 
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24.4.9. As a consequence of Proposition 24.4.7, if O is a residually inert R-order with 
discrd(O) = pê, then the radical idealizer chain 


OC Op SO" C400, 


has length r = |e/2], with O, maximal, and dim,(O;/Oj;+1) = 2 for all i; accordingly, 
the case e even occurs exactly when B ~ M2(F) and e odd occurs exactly when B is a 
division algebra. 


We conclude the section showing that under certain hypotheses, the radical idealizer 
is a minimal (proper) superorder. The results are due to Drozd—Kirichenko-Roiter 
[DKR67, Propositions 1.3, 10.3]; we follow Curtis—Reiner [CR81, Exercises 37.5, 
37.7]. 


Definition 24.4.10. Let 7’ C J be left fractional O-ideals in B. We say I’ is (left) 
hypercharacteristic in / if for every left O-module homomorphism ¢: I’ — I we 
have 1’) CI’. 


Lemma 24.4.11. The map O’ > I’ = (0# gives an inclusion-reversing bijection from 
the set of R-superorders O’ 2 O to the set of right hypercharacteristic R-sublattices 
r co. 


Proof. Inclusing-reversing follows from Lemma 15.6.2(a). By the proof of Lemma 
17.3.3, we have a natural identification Homo(/’, I) = (I : I’)r given by right multi- 
plication. 

We first show that if O’ 2 O, then (O’)# c O is right hypercharacteristic; so we 
verify (OË : (O’)#)p c (OË. If (Oa c OF, then 


((O’)Fa)# = a '0’ 2 (0# = O 


so aO c O, thus a = a-1 € O = ORO’). Conversely, given I’ © of 
hypercharacteristic, we have œ € (J’)* if and only if trd(’a) C R if and only if 
æ €(I’: O)p = (I: I’) = ORC’), so (IË = Og’) is an R-order. m) 


Proposition 24.4.12. Let R be local and let O be a Gorenstein R-order that is not 
maximal and such that O is indecomposable as a left O-module. Then there is a unique 
minimal R-superorder O' 2 O and O' = O*. 


Proof. Since O is projective indecomposable as a O-module, and O is Gorenstein 
so OË is projective, we must have OË = Oa for some a € B*. Let J = rad O. By 
Corollary 20.6.9, JOË is the unique O-submodule of OË. If JO#8 = OË for some 
B € B*, then J = OaBae!, so O.(J) = O and O is extremal; but an extremal order 
that is indecomposable is already maximal, a contradiction. Therefore, if J o# BG of 
then the inclusion is strict, and by maximality we have J otp c JO#; that is to say, 
JO” is hypercharacteristic in of. By Lemma 24.4.11, and inclusion-reversing, we see 
that 
O' = (JO#)# = (JOa)! = a`! JË 


is the unique minimal R-superorder, and O’ = Og(J#) = OL (J) by Proposition 15.6.6. 
Oo 
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24.4.13. Let R be local and O be a Gorenstein R-order. By Lemma 20.6.8, the condition 
that O is indecomposable is equivalent to the condition that O/ rad O is simple as a 


O 
k-algebra, i.e., | — | # 1 or equivalently, O is not Eichler. We considered the Eichler 


case in 24.4.5, so suppose further that O is not Eichler (and in particular, not maximal). 
Then OË 2 O is the minimal R-superorder over O by Proposition 24.4.12. If O? is 
not itself Gorenstein, then we may associate its Gorenstein saturation Gor(O4). In this 
way, we obtain a canonical chain of Gorenstein superorders of O. 


24.5 Bass and basic orders 


Given the importance of the Gorenstein condition, we will want to give a name to the 
condition that every superorder is Gorenstein. 
In this section, we restore our hypothesis that R is a Dedekind domain. 


Definition 24.5.1. An order O is Bass if every order O’ 2 O is Gorenstein. 


24.5.2. Since the Gorenstein condition is local by 24.2.2, the Bass condition is also 
local. Moreover, an Eichler (i.e., residually split) order is Bass, because Eichler orders 
are Gorenstein by 24.2.14, and every superorder of an Eichler order is Eichler (Corol- 
lary 23.4.10). 


For the rest of this section, we investigate the local structure of Bass orders, and 
we suppose that R is local. We do not use the following proposition, but we state it for 
context. 


Proposition 24.5.3. Suppose R is local. Then the following are equivalent: 


(i) Every O-ideal is generated by two elements; 
(ii) O is Bass; and 
(iii) Every O-lattice is isomorphic to a direct sum of O-ideals. 


Proof. See Drozd—Kirichenko—Roiter [DKR67, Propositions 12.1, 12.5] or Curtis— 
Reiner [CR81, $37]; the implications (i) > (ii) > (iii) always hold, and the implication 
(iii) > (i) holds because B is a quaternion algebra. o 


The residually inert orders, those with Eichler symbol (O | p) = —1, give a source 
of Bass orders, following Brzezinski [Brz83a, §3]. 


Proposition 24.5.4. Let O be a residually inert R-order. Then O is Bass. 


Proof. We begin by arguing as in the proof of Proposition 24.4.7: letting i € O generate 
the field extension O/P 2 k, we then make a base extension to K = F(i) where i € O 
generates O/ rad O: then Os is residually split, i.e., Os is Eichler. 

We then appeal to Proposition 24.2.23: the Gorenstein condition is stable under 
base change. So for every superorder O’ 2 O, we have a superorder O% 2 Os and so 
O; is Gorenstein by 24.5.2, so O’ is Gorenstein. Therefore O is Bass. o 


24.5. BASS AND BASIC ORDERS 407 


24.5.5. Combining 24.5.2 with Proposition 24.5.4, we see that if O is not a Bass order, 
O 

then | — | = 0. 
p 


Another rich source of Bass orders are the basic orders. 


Definition 24.5.6. We say O is basic if O contains a maximal R-order in a maximal 
commutative F-subalgebra K C B. 


Remark 24.5.7. Local basic orders in a division quaternion algebra were studied by 
Hijikata—Pizer-Shemanske [HPS89b]; they gave the global orders O such that Op) is 
basic if p € Ram(B) and Eichler if p ¢ Ram(B) the name special. The remaining types 
of local basic orders (residually ramified and residually inert for the matrix ring) were 
studied by Brzezinski [Brz90], and further worked on by Jun [Jun97]. The role of the 
embedded maximal quadratic R-algebra S is that one can compute embedding numbers 
for them (see for example the epic work of Hijikata—Pizer-Shemanske [HPS89a]), and 
therefore compute explicitly with the trace formula. 

Other authors use the term primitive instead of basic, but this quickly gets confusing 
as the word primitive is used for the ternary quadratic forms and the two notions do not 
coincide. The following propositions show that the sound of the word basic conveys 
the right meaning. 


Proposition 24.5.8. Suppose R is local. Then O is basic if and only if O is Bass. 


Proof. See Chari-Smertnig—Voight [CSV2019, Theorem 1.1]. To illustrate the proof, 
we explain here only the case where 2 € R*, following Brzezinski [Brz90, Proposition 
1.11]. 

Let Q: M > R be the ternary quadratic form associated to O, a representative up 
to twisted similarity chosen so that Q is integral. We argue explicitly with a quadratic 
form (22.3.5) the multiplication table of the order as a Clifford algebra in a good basis 
(22.37): 

First, suppose O is Bass. Then O is Gorenstein, so after rescaling we may suppose 
Q ~ (-1, b, c}, and the multiplication table reads: 


i? =-be jk=-i 
pre ki = bj (24.5.9) 
k? =b ij =ck 


If ordy(b) < 1 or ordy(c) < 1, then correspondingly R[j] or R[k] are maximal (again by 
valuation of discriminant), so suppose ord, (b), ordy(c) > 2. Consider the R-submodule 
O’ generated by 1, i’, j, k with i’ = i/z. Then the integrality of the multiplication table 
remains intact so O’ is an order, with new coefficients a’ = 2, b’ = b/n, c’ = c/n. 
The corresponding ternary quadratic form Q’, by our hypotheses on valuations, is now 
imprimitive and O’ is not Gorenstein, and this contradicts the fact that O is Bass. 
Conversely, suppose O is basic. If the maximal R-order S C O is in an unramified 
subalgebra K = FS C B, then it generates an unramified extension over the residue 
field and (O | p) # 0, and then by 24.5.5 we know O is Bass. Now at least one of the 
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products bc, ac,ab has valuation 1 because O contains a maximal R-order; without 
loss of generality we may take, after rescaling, a = —1 and ordp(b) = 1 < ordy(c) with 
S = R[k]. Therefore Q is primitive and O is Gorenstein. 

We now compute that rad O = (z, i, j, k} and so 


O’ = O (rad O) = R + pli + Rj + Rk 


is the minimal superorder, and it is still basic. By minimality, to show that O is Bass 
it suffices to show that O’ = O" is Bass. In our new parameters we have a’, b’,c’ = 
=x, b/n, c/n, so ordy(b’) = 0 and ord,(c’) = ordy(c) — 1. Swapping i’, j’ interchanges 
a’, b’, and we are back in the original situation but with ord,(c) reduced. By induction, 
we can continue in this way until ordy(c) = 0, when then R[j] is a maximal order in 
an unramified extension, and we are done. oO 


Proposition 24.5.10. Suppose F = Frac(R) is a number field. Then for the R-order O, 
the following are equivalent: 
(i) O is basic; 
(ii) Ow) is basic for all primes p of R; and 
(iii) O is Bass. 
In other words, for orders in a quaternion algebra over a number field, being basic 
is a local property and it is equivalent to being Bass. 


Proof. See Chari—Smertnig—Voight [CSV2019, Theorem 1.2], building on work of 
Eichler [Eic36, Satz 8] for the case F = Q. o 


Remark 24.5.11. One can similarly define basic orders for a general Dedekind domain 
R, but the preceding results are not known in this level of generality. 


24.5.12. We established several other important features along the way in Proposition 
24.5.8 that we now record. Suppose R is local with 2 € R* and suppose that O is 
a residually ramified Bass (i.e., basic) order. Then the quadratic form associated to 
O is similar to (—1, b,c) with ordp(b) = 1 < ordp(c), and so the multiplication table 
(24.5.9) holds. The unique superorder O" has [O" : O]r = p and associated ternary 
quadratic form (—1, b, —c/b) (see Exercise 24.13). 


4 
Corollary 24.5.13. If O? is not hereditary, then (2) = (2). 


Proof. Since O} is not hereditary, we cannot have O maximal, so (O | p) = 1, 0, —1. If 


O 
2) = 1, then O is Eichler, and so too are its superorders. If | — | = —1 and O4 is not 


ot O 
maximal, then (Z) = —1 by Proposition 24.4.7 (b). For the case (9) = 0, we appeal 
to 24.5.12. o 


We can repackage what we have done for basic orders to give another description 
in terms of its hereditary closure. 
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Proposition 24.5.14. Let O be a basic, nonhereditary R-order with discrd(O) = p”. 
Suppose 2 € R*, and let S C O be a maximal R-order in the F-algebra K. Let J be the 
Jacobson radical of the hereditary closure of O. Then the following statements hold. 


(a) Suppose O is residually inert. Then O = S + J™ where m = n/2,n— 1 according 
as B is ramified or split. 
(b) Suppose O is residually ramified. Then O = S + J™ wherem=n-1. 


Proof. The statement follows by induction using the explicit descriptions of these 
orders in 24.3.7, 24.5.12, and 24.5.12. See Brzezinski [Brz90, Proposition 1.12]. O 


24.6 Tree of odd Bass orders 


To conclude this chapter, we draw a picture of the containments of Bass orders. 
Suppose R is local with finite residue field k and 2 € R*. We put together the radical 

idealizer chains in the residually split 24.4.5, residually inert 24.4.9, and residually 

ramified 24.5.12 cases. The resulting tree of Bass orders is shown in Figure 24.6.1. 


discrd (O) 

B ~ M2(F) split I B ramified 

R 

M2(R) 

det p 

° p? 
TEREE. 
e e e e pî e | 


(2) = —1 0 1 * —1 0 
p 


res. inert res. split res. inert 
(728) AL Nmx,,|F (Eichler) (r ett) J. Nmx,,,|F 
(r°) L H 


Figure 24.6.1: Tree of local Bass orders, odd characteristic residue field 


Each vertex of this graph represents an isomorphism class of Bass order; there is an 
edge between two vertices if and only if there is a minimal containment between them. 
In each case, such a containment is given by the radical idealizer except when the order 
is residually split (in which case it hops by two, skipping the minimal superorder). 

For the trees when 2 is a uniformizer in R, and many other explicit calculations, 
see Lemurell [Lem2011, §5], as well as Pacetti—Sirolli [PS2014, §5]. 
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Exercises 


Unless otherwise specified, let R be a Dedekind domain with F = Frac Rand let O € B 
be an R-order in a quaternion algebra B. 


1. 


> 3. 


> 4, 


> 5. 


Show that every Eichler order O is Gorenstein by showing OF is locally principal 
by direct computation. 


. Show that codiff(O) is right invertible if and only if 


codiff(O)(codiff(O)”)# = O 
and codiff(O) is left invertible if and only if 

(codiff(O))* codiff(O) = O. 
[Hint: show that (codiff(O)~!)# = codiff(O)?.] 
Suppose R is local. Show that if O is an R-order and B is a division algebra, 
then (9) = 1,0. 

p 
Suppose R is local with finite residue field. Recall the discriminant quadratic 
form and the generalized Kronecker symbol 24.3.3. 
O A 

(a) Show that ($) = x if and only if (=) takes on all of the values —1, 0, 1 


p 
for a € O. 


A 
(b) For e = —1,0, 1, show that ($) = € if and only if (*) takes the values 
{0, e} for a € O. 


Let R be local and suppose O = R + pO’ for an order O’. Show that rad O = pO’ 
and OË = O’. [Hint: Argue as in 23.4.15.] 


. Let Q: M — L be a ternary quadratic module, and let a be a fractional ideal. 


Write aQ = Q 8a: M 8a —> L @a’ for the twist. Show that there is a bijection 
C1Q «e ClaQ, and conclude that there is a bijection TypO « Typ Gor(O), 
where Gor(O) is the Gorenstein saturation. 


(a) Show that if discrd(O) is cubefree (i.e., there is no prime p of R such that 
p° | discrd(O)) then O is a Bass order. 


O 
(b) Show that there is a (local) Gorenstein order O with ($) = 0 that is not 


Bass with discrd(O) = pf. 


O 
. Suppose that O is Gorenstein with (9) + 1. Let O’ be the hereditary closure 


of O. Show that Ngx(O) < Ngx(O'), and further that equality holds when O is 
residually inert. 


. Let R be local and let O be a residually ramified quaternion R-order that is 


Gorenstein but not a Bass order. Let O! be the unique minimal order containing 
O. Show that f{(O*) = p. [See Brzezinski [Brz83a, Lemma 4.4].] 
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10. Let R be local with 2 € R*, and let Q = (-1, b,c): R? > R with 0 = ord, (b) < 
ordy(c). Let O = clf°(Q) be its even Clifford algebra. Show that 


11. Let R be local, let O = Mo(R), let O; be the standard Eichler order of level 
p° and let O2 = R + p°O. Show that O2 Ç O; Ç O, that Gor(O2) = O, but 
Gor(O,) = O;; conclude that the Gorenstein saturation is not (necessarily) the 
smallest Gorenstein superorder. 


12. Let R be local and let O be the standard Eichler order of level p° with e > 2. 
Show that O contains no integrally closed quadratic R-order that is a domain 
(even though O contains R x R). 


> 13. Let R be local with 2 € R”. 


(a) Show (using the proof of Proposition 24.5.8) that O is a local Bass order 
with | —]} = 0 if and only if its corresponding ternary quadratic form is 


similar to (—1, b,c) with ordy(b) = 1 < ordy(c). 
(b) In case (a), show that the minimal overorder O’ corresponds to the (simi- 
larity class of) ternary quadratic form (—1, b, —c/b). 
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Chapter 25 


The Eichler mass formula 


In Part II of this text, we investigated arithmetic and algebraic properties of quaternion 
orders and ideals. Now in Part III, continuing this investigation we turn to the use of 
analytic methods. In this first introductory chapter, we restrict to a special case and 
consider zeta functions of quaternion orders over the rationals; as an application, we 
obtain a formula for the mass of the class set of a definite quaternion order. 


25.1 > Weighted class number formula 


Gauss conjectured (in the language of binary quadratic forms) that there are finitely 
many imaginary quadratic orders of class number 1 [Gau86, Article 303]. Approaches 
to this problem involve beautiful and deep mathematics. Given that we want to prove 
some kind of lower bound for the class number (in terms of the discriminant of the 
order), it is natural to seek an analytic expression for it. The analytic class number 
formula of Dirichlet provides such an expression, turning the class number problem of 
Gauss into a (still very hard, but tractable) problem of estimation. 

We recall section 14.1, which gives a classification of quaternion algebras over Q, 
and sections 16.1 and 17.1, providing background on ideal classes in quaternion orders. 
With this motivation in hand, we are led to ask: what are the definite quaternion orders 
of class number 1? The method to prove Dirichlet’s formula generalizes to definite 
quaternion orders as well, as pursued by Eichler in his mass formula. This chapter 
gives an overview of the Eichler mass formula in the simplest case for a maximal order 
in a definite quaternion algebra over Q. (The reader who is already motivated and 
ready for action may consider skipping to the next chapter.) 


Theorem 25.1.1 (Eichler mass formula over Q,maximal orders). Let B be a definite 
quaternion algebra over Q of discriminant D and let O C B be a maximal order. Then 


1 _ gD) 
tieciso Y7 12 

where wy := #O_(J)*/{+1} and (D) := #(Z/DZ)* = Ipin- 1) is the Euler totient 

function. 
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The Eichler mass formula does not quite give us a formula for the class number— 
rather, it gives us a formula for a “weighted” class number. That being said, we remark 
that wy < 24 (see Theorem 11.5.14), and very often wy = 1 (i.e., OL(J) = {+1}. In 
order to convert the Eichler mass formula into a formula for the class number itself, 
one needs to understand the unit groups of left orders: this can be understood either 
as a problem in representation numbers of ternary quadratic forms or of embedding 
numbers of quadratic orders into quaternion orders, and we will take this subject up in 
earnest in Chapter 30. 

Over Q, the Eichler mass formula was first proven by Hey [Hey29, II, (80)], a Ph.D. 
student of Artin, along the same lines as the proof sketched below. This formula was also 
stated by Brandt [Bra28, $67]. We gradually warm up to this theorem by considering a 
broader analytic context. We see the analytic class number for an imaginary quadratic 
field as coming from the residue of its zeta function, and we then pursue a quaternionic 
generalization. 


25.2 > Imaginary quadratic class number formula 


To introduce the circle of ideas, let K := Qvd) be a quadratic field of discriminant 
d € Zand let Zx be its ring of integers. We encode information about the field K by 
its zeta function. 


25.2.1. Over Q, we define the Riemann zeta function 


o0 


¢(s) := 2 (25.2.2) 
n=l n: 


as the prototypical such function; this series converges for Re s > 1, by the comparison 
test. By unique factorization, there is an Euler product 


=j 
cs)=] | f - 2) (25.2.3) 
p 


where the product is over all primes p. The function ¢(s) can be meromorphically 
continued to the right half-plane Re s > 0 using the fact that the sum 


converges for Re s > 0 and 
f(s) + 2(s) = 2'3 ECs) 


so that i 
¿(s) = 5s 2 2) 
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and the right-hand side makes sense for Res > 0 except for possible poles where 
2!-s = 1, For real values of s > 1, we have 


1 œ dx dx sS 
= — < ģ¢(s)<1l+ — = 
s—1l 1 x 1 x s-l 


l<(s-D(s) < s; 


SO 


therefore, as s approaches 1 from above, we have lims\1(s — 1)¢(s) = 1, so ¢(s) has a 
simple pole at s = 1 with residue 


res,_; C(s) = 1. (25.2.4) 


25.2.5. For the quadratic field K, modeled after (25.2.2) we define the Dedekind zeta 
function by 


1 
ORDS a (25.2.6) 


aCZxK 


where N(a) := #(Zx/a) is the absolute norm, the sum is over all nonzero ideals of 
Zr, and the series is defined for s € C with Res > 1. (We recall that N(a) is the 
positive generator of Nmx g(a), so we could equivalently work with the algebra norm, 
if desired.) 

We can also write the Dedekind zeta function as a Dirichlet series 


an 


fk(s)= ) = (25.2.7) 


ns 
n=1 


where an := #{a C Zg : N(a) = n} is the number of ideals in Zg of norm n > 1. By 
unique factorization of ideals, we again have an Euler product expansion 


1 -l 
Cx(s) = I] f = say , (25.2.8) 


p 


the product over all nonzero prime ideals p C Zg. 


In order to introduce a formula that involves the class number, we group the ideals 
in (25.2.6) by their ideal class: for [b] € Cl(K), we define the partial zeta function 


1 
CK,[6](S) := 7, nO 


acZKk 


[a]=[b] 


so that 


CK(s) = > CK,{6](S)- (25.2.9) 


[b]eCl(K) 


In general, for [b] € Cl(K), we have [a] = [b] if and only if there exists a € K* such 
that a = ab, but since a C Zr, in fact 


aéb! = fa € Zk : ab C Zg}; 
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this gives a bijection 
{ac Zx : [a] = [b]} e b7'/Z%, 


(since the generator of an ideal is unique up to units). Thus 


1 1 
£K,[6)(s) = —~— pee 
N(6) O#¢aeb-! /Z% N(@) 


(25.2.10) 


for each class [b] € Cl(K). 

Everything we have done so far works equally as well for real as for imaginary 
quadratic fields. But to make sense of b7! /ZX in the simplest case, we want Z% to 
be a finite group, which by Dirichlet’s unit theorem means exactly that K is Q or an 
imaginary quadratic field. So from now on in this section, we suppose K = Q(Vd) with 
d < 0. Then w := HZ = 2, except when d = —3, —4 where w = 6,4, respectively. 

Under this hypothesis, the sum (25.2.10) can be transformed into a sum over 
lattice points with the fixed factor w of overcounting. Before estimating the sum over 
reciprocal norms, we first estimate the count. Let A C C be a lattice. We can estimate 
the number of lattice points 2 € A with |A| < x by the ratio mx?/A, where A is the area 
of a fundamental parallelogram P for A: roughly speaking, this says that we can tile a 
circle of radius x with approximately 2x7/A parallelograms P. 

More precisely, the following lemma holds. 


Lemma 25.2.11. Let A C C be a lattice with area(C/A) = A. Then there is a constant 
C such that for all x > 1, 


2 
#A€ A: Mls a- E < Cx. 


We leave this lemma as an exercise (Exercise 25.3) in tiling a circle with radius x 
with fundamental parallelograms for the lattice A. With a bit of manipulation (Exercise 
25.4), this lemma can be used to prove the analytic class number formula. 


Theorem 25.2.12 (Analytic class number formula, imaginary quadratic field). Let 
K = Q(Vd) be an imaginary quadratic field with discriminant d < 0. Then 


2th 
wyld] 


where h is the class number of K and w is the number of roots of unity in K. 


TeSs=1 €x(s) = 


This formula simplifies slightly if we cancel the pole at s = 1 with ¢(s), as follows. 
Like in the Dirichlet series, we can combine terms in (25.2.8) to get 


1 \! RS 
«= T1TT (age) eeo 


P pip 


where 
(1-T), ifp splits in K; 
Lp) := 41-T, if p ramifies in K; and (25.2.13) 
1-T?, if p is inert in K. 
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The condition of being split, ramified, or inert in K is recorded in a function: 


1, if p splits in K; 
X(P) := Xap) = 40, if p ramifies in K; and (25.2.14) 


-1l, ifpisinertin K 


for prime p and extended to all positive integers by multiplicativity. If p f d is an odd 


prime, then 
d 
(Pp) = \— 
P 


is the usual Legendre symbol, equal to 1 or —1 according as if d is a quadratic residue 
or not modulo p. Then in all cases, we have 


LT) = (0 -D4 — x(p)T). 
Expanding the Euler product term-by-term and taking a limit, we conclude 
rls) = C(s)L(s, X) (25.2.15) 


where 


x(p)\ | x(n) 
L(s, x) =[] {1-2 =Y L, (25.2.16) 


n S 
P 


The function L(s, v) is in fact holomorphic for all Res > 0; this follows from the 
fact that the partial sums }),,-, x(n) are bounded and the mean value theorem. So in 
particular the series 


nsx 


+ XQ) XO KO 


Ld =1 
a, x) 5) 3 4 
converges (slowly). Combining (25.2.15) with the analytic class number formula yields: 
2mh 
Hine +0. (25.2.17) 
wvld| 


For example, taking d = —4, so y(2) = 0 and y(p) = (-1/p) = (C1), 


1 1 1 1 1 
LAD =l-7+z77t5T t 


—1)P-9/2\7! 
-T(-§ ) ) = OTB 
p 4 


p23 


Remark 25.2.18. The fact that L(1, y) + 0, and its generalization to complex char- 
acters y, is the key ingredient to prove Dirichlet’s theorem on primes in arithmetic 
progression (Theorem 14.2.9), used in the classification of quaternion algebras over 
Q. The arguments to complete the proof are requested in Exercise 26.11. 
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Remark 25.2.19. To approach the class number problem of Gauss, we would then seek 
lower bounds on L(1, y) in terms of the absolute discriminant |d|. Indeed, the history 
of class number problems is both long and beautiful. The problem of determining all 
positive definite binary quadratic forms with small class number was first posed by 
Gauss [Gau86, Article 303]. This problem was later seen to be equivalent to finding 
all imaginary quadratic fields of small class number (as in section 19.1). It would take 
almost 150 years of work, with important work of Heegner [Heeg52] and culminating 
in the results of Stark [Sta67] and Baker [Bak71], to determine those fields with class 
number 1: there are exactly nine, namely d = —3, —4, —7, —8, — 11, — 19, —43, —67, — 163. 
See Goldfeld [Gol85] or Stark [Sta2007] for a history of this problem. For more 
specifically on the analytic class number formula for imaginary quadratic fields, see 
the survey by Weston [Wes] as well as the book by Serre [Ser73, Chapter VI]. 


25.3 > Eichler mass formula: over the rationals 


We are now prepared to consider the analogue of the analytic class number formula 
(Theorem 25.2.12) for quaternion orders: the Eichler mass formula, which is a weighted 
class number formula. We follow Eichler [Eic55-56, Eic56a], and in this section we 
give an overview with proofs omitted—a full development will be given in the next 
chapter, in more generality. 

Let B be a quaternion algebra over Q of discriminant D and let O C B be an order. 
We define the zeta function of O to be 


1 
tols) := Daas (25.3.1) 
3 NC)" 


where the sum over all invertible (nonzero, integral) right O-ideals and 
NZ) := [O : I] = #(O/D) € Z>o. 


(By Main Theorem 16.1.3, we have N(Z) the totally positive generator of nrd(J )?, so 
we could equivalently work with the reduced norm.) 

Let a, be the number of invertible right O-ideals of reduced norm n > 0 (with 
positive generator chosen, as usual). Then N(Z) = Nm(/) = nrd(1)? by 16.4.10, so 


fols)= >) = (25.3.2) 
n=l 


To establish an Euler product for Zo(s), in due course we will give a kind of factorization 
formula for right ideals of O—but by necessity, writing an ideal as a compatible product 
will involve the entire set of orders connected to O! A direct consequence of the local- 
global dictionary for lattices (Theorem 9.4.9) is that 


Amn = Aman (25.3.3) 


whenever m,n are coprime. Next, we will count the ideals of a given reduced norm 
q = p° a power of a prime: the answer will depend on the local structure of the order 
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Op. Indeed, ¢o(s) has an Euler product 
Cots) = | | éon (25.3.4) 


p 


with ćo,p(T) € 1 + TZ[T]. In particular, ¢o(s) only depends on the genus (local 
isomorphism classes) of O. 


For simplicity, we first consider the case where O is a maximal order. Since there 
is a unique genus of maximal orders, the zeta function is independent of the choice of 
O and so we will write g(s) := ćo(s) for O maximal. Then by a local count, we will 
show that 


= pty, if p | D; 
ĉB,p(T) = (1 T ) f z pT?, if p } D. (25.3.5) 
From (25.3.5), i 
als) = £28628 - 1| | (! - 5). 25.3.6) 


p|D 
In particular, since Z(s) has a simple pole at s = 1 with residue 1 and ¢(2) = 2/6 
(Exercise 25.1), 
ress- fa(s) = lim(s — D) = =P] (1+ (25.3.7) 
Sten fa ee ag pID p a 


(We could also look to cancel the poles of ¢g(s) ina similar way to define an L-function 
for B, holomorphic for Re s > 0.) 


Now we break up the sum (25.3.1) according to right ideal class: 


és) = J, Sains) 


[J]eCls O 
where i 
= ——., 25.3. 
Sau) = Dy agp (25.3.8) 
ICO 
MIEN] 


Since [7] = [J] if and only if J = aJ for some invertible a € J~', and uJ = J if and 
only if u € OL(J)*, we conclude that 


1 1 
CEO) = a ae 
NG) OžaeJ71/OL(J)* N(a) 


(25.3.9) 


where the sum is taken over the nonzero elements œ € J~! up to right multiplication 
by units O_(J)* in the left order. 


In order to proceed, we now suppose that B is definite (ramified at co) and hence 
that #OL(J)* < œ (see Lemma 17.7.13); this is the analogue with the case of an 
imaginary quadratic field, and each J has the structure of a lattice in the Euclidean 
space R* via the embedding 


JOB BegR~H=R*. (25.3.10) 
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Let wy = #O.(J)*</{+1}. We again argue by counting lattice points to prove the 
following proposition. 


Proposition 25.3.11. The function ¢g [)(s) has a simple pole at s = 1 with residue 
2 


TeCS,= S$) = Se. 
s=1 SB,[7(S) GD 


Proof sketch. From amore general result (Theorem 26.2.12, proven in the next section 
and used to prove the analytic class number formula itself), we will show that 


_ 1 vol((R*)<1) 
tess-1 SBS) = FNC covold) (aital 


where under (25.3.10) we have 


2 
vol((R*)<1) = vol({x € Rf : |x| < 1} = F 


a 1(O D/4 
covol(J/) = covol(O) = Di4 N 
N(J) N(J) 
Putting all of these facts together, 


2 4N p 
= OL os (25.3.13) 
4w NCJ) D wyD oO 


ress=1 Cai) = 


In particular the pole of each zeta function ¢g,jJ}(s) is almost independent of the 
class [J], with the only relevant term being wy the number of units. 
Combining Proposition 25.3.13 with Proposition 25.3.11, 


n? 1 n? 1 
ress=1 BC) = 5 > oo JH- : (25.3.14) 
p|D 


[J]eCls O 


and we conclude the following theorem. 


Theorem 25.3.15 (Eichler mass formula, maximal orders). Let B be a definite quater- 
nion algebra over Q of discriminant D and let O C B be a maximal order. Then 
1 D 
4t- r (25.3.16) 
tieciso W7 


Remark 25.3.17. The Eichler mass formula is also very similar to the mass formula 
for the number of isomorphism classes of supersingular elliptic curves: this is no 
coincidence, and its origins will be explored in section 42.2. 


To extend the Eichler mass formula to a more general class of orders, one only 
needs to replace the local calculation in 25.3.5 by a count of invertible ideals in the 
order. First we treat the important case of Eichler orders (see 23.1.3). 
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Theorem 25.3.18 (Eichler mass formula, Eichler orders over Q). Let O C B be an 
Eichler order of level M in a definite quaternion algebra B of discriminant D. Then 


1 _ Dym) 
tieciso Y7 12 
where 
m= | | +p) =u] | f + -| : 
pellM pim\ P 


The most general formula is written in terms of the Eichler symbol 24.3. Just to 


O 
recall two important cases: if p | N = discrd(O), then (9) = —1 if Op is the maximal 


O 
order in the division algebra B,, and (=) = 1 if O, is an Eichler order. 
P 


Main Theorem 25.3.19 (Eichler mass formula, general case over Q). Let B be a 
definite quaternion algebra over Q and O C B be an order with discrd(O) = N. Then 


1 N 
a I] A(O, p) 
Weco “7 pIN 
where 
E 1+1/p, f(Ol|p)=1; 
A(O, p) = — E = 31-1/p, (Ol p) =-1; and (25.3.20) 
l= (S) 1-1/pP, if(O|p)=0. 


Main Theorem 25.3.19 was proven by Brzezinski [Brz90, (4.6)] and more generally 
over number rings by Körner [Kör87, Theorem 1]. 


25.4 Class number one and type number one 


The Eichler mass formula can be used to solve the class number 1 problem for definite 
quaternion orders over Z, and in fact it is much easier than for imaginary quadratic 
fields! We begin with the case of maximal orders. 


Theorem 25.4.1. Let O be a maximal order in a definite quaternion algebra over Q of 
discriminant D. Then #Cls O = 1 if and only if D = 2,3,5,7, 13. 


Proof. A calculation by hand; see Exercise 25.5. o 


Remark 25.4.2. The primes p = 2,3,5,7,13 in Theorem 25.4.1 are also the primes 
p such that the modular curve Xo(p) has genus 0. This is not a coincidence, and 
reflects a deep correspondence between classical and quaternionic modular forms (the 
Eichler-Shimizu—Jacquet-Langlands correspondence): see Remark 41.5.13. 
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The list of all definite quaternion orders (over Z) of class number 1 was determined 
by Brzezinski [Brz95]. (Brzezinski mistakenly lists an order of class number 2, and so 
he counts 25, not 24; he corrects this later in a footnote [Brz98, Footnote 1].) 


Theorem 25.4.3 (Brzezinski). There are exactly 24 isomorphism classes of definite 
quaternion orders over Z with #Cls O = 1. 


The list of orders with class number 1 is given in Table 25.4.4. We provide N := 
discrd(O), D := disc(B), we list the Eichler symbols (O | p) for the relevant primes 
p < 13, we say if the order is maximal, hereditary (but not maximal), Eichler/residually 
split or residually inert (but not hereditary), Bass (but not Eichler), or non-Gorenstein. 
This list includes the three norm Euclidean maximal orders (Exercise 25.6) of discrim- 


inant D = 2, 3,5. 
(O|p) 
N D|2 3 5 711 13 Class Q 
2 2 {|-l* * * * * maximal xX? -xy -xz +y + yz+z 
3 3ļ|x*x -l* * x x maximal xX xy +y az 
4 2/0 x x * * * Bass x? + y? +z 
5 S| « -l* * * maximal x? — xy — xz + y? + yz + 2z? 
6 2|-11 x * * x hereditary x? — xy + y? +22? 
6 3/1 -l+ * * >% hereditary x? + xz +y yz + 2z 
7 7|» x * -l* x maximal x? xz + y? +277 
8 2 )-l x * x * x Bass xX? + xy- xz + y? yz + 32" 
8 2/0 x * x * x Bass x? + y? +27? 
10 2 |-l * 1 xx x hereditary x? — xz + y? — yz + 3z? 
10 5|1 x»x-—l x * x hereditary x? + xy +xz+ 2y? +2yz + 2z? 
12 2O 1 x * x * Bass x? + 2y? — 2yz + 2z? 
12 3 |-1-1 * » x | residually inert x + xy ty? +42? 
12 3|1 -l * x x x Eichler x? xytxzt 2y? —yzZt 2z? 
12 3/0 -l* x * x Bass x? + y? +32 
13 13) * «*« * x * -l maximal x? — xy + 2y? + yz +22? 
16 2/0 x x x x x | non-Gorenstein 2(x? — xy- Xz+ y? +yz+ 2°) 
1462/0 * k * * * Bass x? + 2y? +22? 
18 2 j/-1l-l* x * x Bass x + xz + y? — yz + 5z 
18 2/0 l so*s Eichler x? + xz + 2y? + 2yz + 3z? 
20 5/0 x-l * * >% Bass x? + 2y? — 2yz + 327 
22 2ļ-l x xx 1 x hereditary x? + xz + 2y? + 327 
24 3/0 -l » * * * | non-Gorenstein 2(x? -xy + y + 2) 
28 7 -l x * -l* >% Bass x? + xy — xz + 3y? — 2yz + 327 


Table 25.4.4: Definite quaternion orders over Z with class number 1 


Proof of Theorem 25.4.3. Suppose #Cls O = 1. We apply the mass formula (Main 
Theorem 25.3.19). We note that A(O, p) => 1 — 1/p, in all three cases, so 


(25.4.5) 
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and therefore y(V) < 12. By elementary number theory, this implies that 
2 <N < l6orN = 18,20, 21, 22, 24, 26, 28, 30, 36, 42. 


This immediately gives a finite list of possibilities for the discriminant D = disc B € 
{2, 3, 5, 7, 11, 13, 30, 42}, as D must be a squarefree product of an odd number of primes. 

By Exercise 17.3, if O C O’ then there is a natural surjection Cls O — Cls O’, 
which is to say an order has at least as large a class number as any superorder. So 
we must have #Cls O’ = 1 for a maximal order O’ C B, and by Theorem 25.4.1, this 
then reduces us to D = 2,3,5,7, 13. Because #Cls O’ = 1, the maximal order O’ is 
unique up to conjugation, and so fixing a choice of such a maximal order O’ up to 
isomorphism we may suppose O € O’. But now the index [O’ : O] = M = D/N is 
explicitly given, and there are only finitely many suborders of bounded index; for each, 
we may compute representatives of the class set in a manner similar to the example in 
section 17.6. (We are aided further by deciding what assignment of Eichler symbols 
and unit orders would be necessary in each case.) o 


There is similarly an interest in definite quaternion orders O of type number 1: 
these are the orders with the property that the “local-to-global principle applies for 
isomorphisms”, i.e., if O, ~ Op for all primes p then O’ = O. If an order has class 
number | then it has type number 1, by Lemma 17.4.13, but one may have # Typ O = 1 
but # Cls O > 1. Since an order has the same type number as its Gorenstein saturation, 
i.e. Typ O = Typ Gor(O), it suffices to classify the Gorenstein orders with this property. 

By the bijection between ternary forms and quaternion orders, this is equivalently 
the problem of enumerating one-class genera of primitive ternary quadratic forms. 
The list was drawn up by Jagy—Kaplansky—Schiemann [JKS97] (with early work due 
to Watson [Wats75]), and has been independently confirmed by Lorch—Kirschmer 
[LK2013]. 


Theorem 25.4.6 (Watson, Jagy—Kaplansky—Schiemann, Lorch—Kirschmer). There are 
exactly 794 primitive ternary quadratic forms of class number 1, corresponding to 794 
Gorenstein quaternion orders of type number 1. The largest prime dividing a discrim- 
inant is 23, and the largest (reduced) discriminant is 28337? = 338688. There are 
exactly 9 corresponding to maximal quaternion orders: they have reduced discrimi- 
nants 


D = 2,3,5, 7, 13, 30, 42, 70, 78. 


Remark 25.4.7. The generalization of the class number | problem to quadratic forms 
of more variables was pursued by Watson, who showed that one-class genera do not 
exist in more than ten variables [Wats62]. Watson also tried to compile complete lists 
in low dimensions, followed by work of Hanke, and recently the complete list has been 
drawn up in at least 3 variables over Q by Lorch—Kirschmer [LK2013] and over totally 
real fields for maximal lattices by Kirschmer [Kir2014]. 


Exercises 


1. A short and fun proof of the equality (2) = 27/6 is due to Calabi [BCK93]. 
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(a) Expand (1 — xy y! ina geometric series and integrate termwise over 
S = [0, 1] x [0, 1] to obtain 


[fa-srtagy=14 545 4..=(1- 3] co. 


(b) Show that the substitution 


sin u =| 


(x, y) := | 


cos v’ cosu 
has Jacobian 1 — x? y? and maps the open triangle 

T := {(u,v): u,v > Qandu +v < 2/2} 
bijectively to the interior of S. 


(c) Conclude that 
2 


J[0 -2 axay= ff away = a 
s T 8 


and thus ¢(2) = n?/6. 


2. In this exercise, we give a very jazzy proof that ¿(k) € Qr* for all k € 2Z>1, 
due to Zagier [Zag94, p. 498]. 


(a) We start with k = 4. Define 


1 
(m,n) := + + 
f mn? 2m?n? mĉn 


for m,n € Zs, and prove that 


f(m,n) -— f(m+n,n)— f(m,m +n) = 


mn? 
(b) Prove 
ca? = Y Flam) = 224) 


n=1 
Conclude that ¢(4) = n*/90 using Exercise 25.1. 
(c) In general, for k € 2Z>2, let 


k-2 
1 1 1 1 
f(m, n) = mnk! + 5) 2, m'nk-r T mk-!n 


and check that 
1 


minki 


f(m,n)- f(m +n,n)— f(m,m +n) = > 
0<j<k 
j even 


EXERCISES 427 


Conclude in a similar way as in (b) that 


k+1 
D LOSE =) = Fe 
O<j<k 

j even 


so by induction Z(k) € Qa*. 


> 3. Prove Lemma 25.2.11 as follows. 


(a) Let P be a fundamental parallelogram for A, and for A € A let Pa := P +4. 
For x > 1, let 


D(x) := {z€ C: |z| < x} 


and 
N(x) :=#ĦHAEA:4E D(x)} 
Np(x) := HAE A: Pa C D(x)} 
Np(x) := HA € A: Pa N D(x) # 0}. 
Show that 


Np(x) < M(x) < N$). 
(b) Show that Np(x) < 7x? /A < N(x). 


(c) Let / be the length of a long diagonal in P. Show that for all A € AN D(x), 
we have Pa € D(x + 1), so 


2 
N(x) < Np(x +) < T 


Similarly, show that if Pa O D(x — 1) + 0 then Pa € D(x) and A € D(x), so 


_ 72 
-a < Ni(x—1) < N(x). 


(d) Conclude that Lemma 25.2.11 holds with C := (21 + 1?)/A. 


> 4. Using the previous exercise, we now prove the analytic class number formula 
(Theorem 25.2.12). 


(a) Let b c C be a fractional ideal, and let 


bn := #{a € b7! : Nm(a) = n}. 


NX 
ya 


nsx 


Show that 


< Cvx 


for a constant C that does not depend on x and 


A= No!) VAT 
2 
[Hint: Apply Lemma 25.2.11 to 67. 
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(b) Consider the Dirichlet series 


f= tag SoA 


n= 


Show by the comparison test that f(s) converges for all Re s > 1/2. 
(c) Show that 


2n 
wvldI 
(d) Sum the residues over [b] € Cl(K) to derive the theorem. 


Tess=1 K($) = lim(s — 1)ćkx,{o1(s) = 


> 5. In this exercise, we prove Theorem 25.4.1: if O is a maximal order in a definite 
quaternion algebra over Q of discriminant D, then #ClsO = 1 if and only if 
D = 2,3,5,7,13. By the Eichler mass formula (Theorem 25.3.15), we have 
#Cls O = 1 if and only if 
1 _ 9D) 
w 12 
where w = #O/{+1}. 


(a) Show (cf. 11.5.13) that if D > 3 then w < 3. 
(b) Show that #ClsO = 1 for D = 2,3. (The case D = 2 is the Hurwitz 
order and D = 3 is considered in Exercise 11.11. In fact, these orders are 


Euclidean with respect to the reduced norm: see the next exercise.) 

(c) Show that if D is a squarefree positive integer with an odd number of prime 
factors and y(D)/12 € {1, 1/2, 1/3}, then D € {5, 7, 13, 42}. 

(d) Prove that #Cls O = 1 for D = 5,7, 13 (cf. Exercise 17.10). 

(e) Show that #Cls O = 2 for D = 42. 


6. Let O be a definite quaternion order over Z. If O is Euclidean, then #Cls O = 1, 
and we saw in 11.3.1 that the Hurwitz order O of discriminant D = 2 is Euclidean 
with respect to the reduced norm. 


(a) Show that if O is norm Euclidean, then O is maximal. 

(b) Show that O is Euclidean with respect to the reduced norm if and only if 
for all y € B, there exists u € O such that nrd(y — u) < 1. 

(c) Show that if O is maximal, then O is norm Euclidean if and only if 


D= 2,3,3. 

7. Generalizing the previous exercise, we may ask for the Euclidean ideal classes 
in maximal orders. We will show that there are no nonprincipal Euclidean two- 
sided ideal classes in maximal definite quaternion orders over Z. [This exercise 
was suggested by Pete L. Clark.] 


Let O be a maximal definite quaternion order over Z of discriminant D, and let 
I be a two-sided O-ideal. We say that J is (norm) Euclidean if for all y € B, 
there exists u € J such that nrd(y — u) < nrd(/). 


(a) Show that if I is principal, then Z is Euclidean if and only if O is Euclidean. 
In general, show that J is Euclidean if and only if œI is Euclidean for all 
a € B*, so we may ask if [7] is Euclidean. 
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(b) Show that if [7] is nontrivial and Euclidean, then # Cls O = 2 and Picz(O) 
is cyclic, generated by [J]. [Hint: argue by induction on nrd(J). Then use 
the fact that Pic(O) is a group of exponent 2.] 

(c) Show that #Cls O = 2 if and only if D = 11, 17, 19, 30, 42, 70, 78. 

(d) Show that Picz(O) = 1 for D = 11,17, 19, and for the remaining discrimi- 
nants D = 30, 42, 70, 78 that the nontrivial class [7] is not norm Euclidean. 


This chapter is licensed under the terms of the Creative Commons Attribution- 
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Chapter 26 


Classical zeta functions 


In this chapter, we prove the Eichler mass number for a definite quaternion order over 
a totally real field using classical analytic methods. 


26.1 > Eichler mass formula 


In the previous chapter, we saw a sketch of how analytic methods with quaternionic 
zeta functions provide a weighted class number formula for a quaternion order in a 
definite quaternion algebra over Q, analogous to the analytic class number formula of 
Dirichlet for a quadratic field. The main result of this section is then the generalization 
of the Eichler mass formula to a definite quaternion order over a totally real number 
field. In this section, we give the statement of this result. 


26.1.1. Let F be a totally real number field of degree n = [F : Q], absolute discriminant 
dr, and ring of integers R := Zr. Let hp be the class number of F. Let 


1 
ér (s) := ; 
2, N(a)' 


be the Dedekind zeta function of F, where N(a) = [R : a] € Zo. Let B be a totally 
definite quaternion algebra over F of discriminant ©. Let O C B be an R-order with 
reduced discriminant discrd(O) = N. 


For a prime p | N with N(p) = q, let (9) € {-1,0,1} be the Eichler symbol 
(Definition 24.3.2), and let 


1- N)? 1+1/q4, if(Olp)=1; 


AOp) = — y =)! la if(Olp) ==1: (26.1.2) 
L- (2) 1-1/2, if(Olp)=0. 


26.1.3. We saw in Lemma 17.7.13 that for each definite order O, the group O! of 
units of reduced norm 1 is a finite group; we will see in Lemma 26.5.1 that further the 
group O*%/R* is finite. For a right O-ideal J, the automorphism group of J (as a right 
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O-module) consists of right multiplication maps by elements u € B* with uJ = J, 
i.e., HE OLJ. 


It was already evident in the Eichler mass formula (and remains a general principle 
in mathematics) that one often gets a better count of objects when they are weighted 
by the inverse size of the automorphism group, so we weight a right ideal class [J] by 
[OL(J)* : R*]7! and make the following definition of a weighted class number. 


Definition 26.1.4. Define the mass of Cls O to be 
mass(ClsO)= %, [OL : RT. 
[J]eCls O 
Main Theorem 26.1.5 (Eichler mass formula). With notation as in 26.1.1, we have 


2čr(2) 


mass(Cls O) = (Oman 


dp hrN | |0, p). (26.1.6) 
plot 

26.1.7. The functional equation for the Dedekind zeta function relates s to 1—s, giving 

an alternative way of writing (26.1.6) as 


26r(2) 3/2 _ IrL) 
ye eae (26.1.8) 


We notice that the Eichler mass formula then implies that -(—1) € Q. 


Remark 26.1.9. More generally, the rationality of the values ¢-(—n) with n € Zso is a 
theorem of Siegel [Sie69] and Deligne—Ribet [DR80]. 


Remark 26.1.10. The weighting in the mass is what makes Main Theorem 26.1.5 so 
simple. In the (unlikely) situation where wy = wo is independent of J, we would have 
a formula for the class number, but more generally we will need to take account of unit 
groups by computing embedding numbers of cyclotomic quadratic orders: we will do 
this in Chapter 30. 


We now make the formula (26.1.6) a bit more explicit for the case of Eichler orders. 


26.1.11. Let O be an Eichler order of level W, so that N = OM with D, M coprime. 
Then 


-l, ifp|9; 
($) =<], if p | M; 
p x ifp{ Ot. 


Accordingly, we define the generalized Euler y-function and Dedekind w-function by 


= -l= =n 
g) := | [NH -d = so] ral 


pD 
1 
WM) := N(p)*!(N(p) + 1) = NOM) (+p) 
IT MEES 


(recalling D is squarefree, with the natural extension y(D) = #(R/D)* for all D). The 
yw-function computes a unit index: see Lemma 26.6.7. 
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The Eichler mass formula (Main Theorem 26.1.5) for Eichler orders then reads as 
follows. 


Theorem 26.1.12 (Eichler mass formula, Eichler orders). With notation as in 26.1.11, 


we have 
2¢r(2) 


mass(Cls O) = (me 


dp hr DN). (26.1.13) 


Remark 26.1.14. The Eichler mass formula in the form (26.1.13) for maximal orders 
was proven by Eichler (working over a general totally real field) using the techniques in 
this chapter [Eic38b, Satz 1], and was extended to squarefree level Nt (i.e., hereditary 
orders) again by Eichler [EicS6a, §4]. This was extended by Brzezinski [Brz90, (4.6)] 
to a general formula over Q and by Körner [K6r87, Theorem 1], using idelic methods. 


The classical method to prove Main Theorem 26.1.5 is similar to the one we 
sketched over Q in chapter 25, with some added technicalities of working over a 
number field. We follow this approach, first proving the formula when O is a maximal 
order, and then deducing the general case. We will return in chapter 29 and reconsider 
the Eichler mass formula from an idelic point of view, thinking of it as a special case 
of a volume formula (for a finite set of “quotient points”). It is hoped that this chapter 
will serve to show both the power and limits of classical methods before we build upon 
them using idelic methods. 


26.2 Analytic class number formula 


In this section, in preparation for the quaternionic case we briefly review what we 
need from the analytic class number formula for a number field F. References for this 
material include Borevich—Shafarevich [BS66, Chapter 5], Lang [Lang94, Chapter 
VI], and Neukirch [Neu99, Chapter VI]. 

We begin by setting some notation that will be used throughout the rest of this 
chapter. Let F be a number field of degree n := [F : Q] with r real places and c 
complex places, so that n = r +2c. Let R := Zr be the ring of integers in F, and let dF 
be the discriminant of F. Let wr be the number of roots of unity in F, let hr := # C1 R 
be the class number of F, and let Reg, be the regulator of F (the covolume of R* 
under the Minkowski embedding). 

Define the Dedekind zeta function for s € C with Re(s) > 1 by 


1 
ér(s) := saa 
2, N(a)* 


where the sum is over all nonzero ideals of R and N(a) = #(R/a) = [R : a] is the 
absolute norm; we have N(a) = Nm(a) with norm taken from F to Q and positive 
generator chosen. 


26.2.1. The Dedekind zeta function converges for Re s > 1 and has an Euler product 


1 -1 
éw =] | (! - vad (26.2.2) 
p 
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where the product is over all nonzero primes of R—this follows formally from unique 
factorization of ideals after one shows that the pruned product converges. 

The Dedekind zeta function has properties analogous to the Riemann zeta function, 
which is the special case F = Q. In particular, we can extend ¢r(s) in a manner 
analogous to 25.2.1 to Re s > 0. For a € C we write ¢7(a) for the leading coefficient 
in the Laurent series expansion for fp at s = a. 


Theorem 26.2.3 (Analytic class number formula). ¿rF (s) has analytic continuation to 
Res > 0, with a simple pole at s = 1 having residue 


2" (2 m) 
Wr Vid 
Remark 26.2.5. The formula (26.2.4) is known as Dirichlet’s analytic class number 


formula (even though the original form of Dirichlet’s theorem concerned quadratic 
forms rather than classes of ideals, so is closer to Theorem 25.2.12). 


GO = lim(s ~ p(s) = he Reg. (26.2.4) 


Example 26.2.6. When F is an imaginary quadratic field (r = 0 and c = 1) we have 
Reg, = 1 and Theorem 26.2.3 is Theorem 25.2.12. 


Before we finish this section, we review a few ingredients from the proof of the 
analytic class number formula (26.2.4) to set up the Eichler mass formula. 


26.2.7. We first write the Dedekind zeta function as a sum over ideals in a given ideal 
class [b] € CI(R): we define the partial zeta function 


1 
CF,fo|(S) := >» NG (26.2.8) 
acR 
[a]=[b] 


convergent for Re s > 1 by comparison to the harmonic series, so that 
fr(s)= >) iro). (26.2.9) 
[b]ECIR 
Now note that [a] = [b] if and only if a = ab for some nonzero 
aéb!={xeF:xbC R}, 
so there is a bijection between nonzero ideals a C R such that [a] = [b] and the set of 
nonzero elements in b7! /R*. So 
1 


Cros) = me a (26.2.10) 
O#aeb-1/R* 


One now reduces to a problem concerning lattice points in a fundamental domain for 
the action of R*, and examining the residue of the pole at s = 1 fits into a more general 
framework (invoked again below). 
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Definition 26.2.11. A cone X C R” is a subset closed under multiplication by positive 

scalars, so tX = X forall t € Ryo. 

Theorem 26.2.12. Let X C R” be a cone. Let N : X > Rso be a function satisfying 
N(tx) =t"N(x) forall x € X,t € Rso. 


Suppose that 
Xe = {x€ X : M(x) < 1 CR" (26.2.13) 


is a bounded subset with volume vol(X<,). Let A C R” be a (full) Z-lattice in R”, and 


let i 
ča, x (s) := >, Nay 


AEXNA 


Then €y_x(s) converges for Res > 1 and has a simple pole at s = 1 with residue 


i . vol(X<1) 
1)=1 -1 =. 
fax (1) ae Ja, x (s) ETIN 
Proof. See Borevich-Shafarevich [BS66, Chapter 5, Section 1.1, Theorem 1]. oO 


26.2.14. To apply Theorem 26.2.12 for fF,75](s), we embed F — Fr = R” x CÀ and 
we equip Fig with the inner product 


r C 


(xy) = $ xyi + X 2Reltr Iray) (26.2.15) 
j=1 


i=] Js 


for x = (xi) y = (yi)i € Fg. This inner product modifies the usual one by rescaling 
complex coordinates, and the volume form vol induced by (, ) is 2° times the standard 
Lebesgue volume on R” x C°. With this convention, we have (x, 1) = Trrig(x) and 
covol(R) = Vd. 

We then take A to be the image of b7! and take X to be a cone fundamental domain 
for the action of the unit group R*. The absolute norm N(x) = |NmFj9(x)| then satisfies 
the required homogeneity property, and X<; is bounded, so by Theorem 26.2.12, 


br po) = a woe (26.2.16) 
We have ae, Wasi 
covo. 
covol(A) = -A E N = (26.2.17) 
It requires a bit more work to compute vol(X<1). 
Proposition 26.2.18. We have 
siege Bee (26.2.19) 


WF 
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Proof. See Exercise 26.3: the proof is well-summarized as a “change of variables’, but 
the reader may prefer the idelic point of view (Chapter 29) instead, where the integrals 
are ‘easier’. A detailed proof can be found in Borevich—Shafarevich [BS66, §5.1.3], 
Lang [Lang94, §VI.3, Theorem 3], and Neukirch [Neu99, §VII.5]. oO 


Plugging (26.2.19) and (26.2.17) into (26.2.16), 
. 2r (2m): 
cr D = — == Regr; (26.2.20) 
WF VidF| 
note in particular that this does not depend on the class [b]! The analytic class number 
formula (Theorem 26.2.3) then follows as 


2" (2m) 
i=) Geos Ga Reg php. (26.2.21) 
[b] 


WF VidF| 


26.3 Classical zeta functions of quaternion algebras 


We now embark on a proof in our quaternionic setting, mimicking the above. We retain 
our notation on the number field F. We further let throughout B be a quaternion algebra 
over F of discriminant © and let O C B be an R-order. (Our emphasis will be on the 
case O a maximal order, but many definitions carry through.) 

To begin, in this section we define the classical zeta function and show it has an 
Euler product. 


26.3.1. Let J be an invertible, integral right O-ideal, so that J C O, and by definition / is 
sated so Op(/) = O. Recall we have defined N(/) = #(O/J); we have N(/) = N(nrd(1))? 
(Paragraph 16.4.10). 

For example, if a C R is a nonzero ideal then N(aO) = N(a)*. 


We then define the (classical) zeta function of O to be 


N 1 ay,(O) 
lols) := > T B eae (26.3.2) 


ICO n 


where the first sum is over all (nonzero) integral, invertible right O-ideals Z and in the 
second sum we define 
a,(O) := #{7 C O : nrd(7) = n} (26.3.3) 


(and an(O) is finite by Lemma 17.7.26). 
Lemma 26.3.4. If O, O’ are locally isomorphic, then ay(O) = ay(O’) for all n. 


Proof. We use the local-global dictionary for lattices (Theorem 9.4.9). To ease paren- 
theses in the notation, we work in the completion, but one can also work just in the 
localization. For all p, we have O, = v, 'Osvp for some vp € By, and we may take 
vp = | for all but finitely many p; the element vy is well-defined up to left multiplication 
by O% and right multiplication by O% . 
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Then to an integral, invertible right O-ideal /, we associate the unique lattice 7’ such 
that J, = vp Ty vp; such a lattice is well-defined independent of the choice of vp. By 
construction, Og(/;,) = Op so ORQ’) = O’. And since J is integral, Ip C Op whence 
I, & Vp 1I Vp c O, and J is locally integral hence integral. Since / is invertible, 
I is locally principal, so 7’ is also locally principal, hence invertible. Finally, again 
checking locally, we have nrd(/’) = nrd(J/). 

Repeating this argument going from /’ to J, we see that the corresponding sets of 
ideals are in bijection, as claimed. oO 


26.3.5. From Lemma 26.3.4, we see that Zo(s) only depends on the genus of O. Since 
there is a unique genus of maximal orders in B, following the number field case we 
will write €3(s) = o(s) where O is any maximal order. 


Our next order of business is to establish an Euler product for fo(s). We prove a 
more general result on the factorization of invertible lattices. 


Lemma 26.3.6. Let I be an invertible, integral lattice and suppose that nrd(1) = mn 
with m,n © R coprime ideals. Then there exists a unique invertible, integral lattice J 
such that I is compatible with J~' with IJ~! integral and nrd(J) = m. 


Proof. We use the local-global dictionary for lattices, and we define J C B to be the 
unique lattice such that 


Ip) = Og) if p { mn; 
Tig := 4 Tey, if p | m; (26.3.7) 
Ow), ifp|n. 


We have Or(J) = O and nrd(J) = m, since these statements hold locally. Integrality 
and invertibility are local; since these are true for / they are true for J. Finally, we 
compute that (1J~!)(p) = Og) for all p 4 n and (IJH) = Ig) for p | n, so IJ! 
is locally integral and hence integral. The uniqueness of J can be verified directly 
(Exercise 26.4). o 


26.3.8. Consider the situation of Lemma 26.3.6. Let I’ = IJ~'. Then J = I’ J, and I’ 
is integral, invertible (Paragraph 16.5.3) and compatible with J. Since nrd(/’) = n, we 
have “factored” T. 

We have Or(/’) = OL(J) by compatibility, but this common order is only locally 
isomorphic to O, since J’, J are locally principal but not necessarily principal. So in 
a sense, this factorization occurs not over O but over the genus of O—but this is a 
harmless extension. 


Proposition 26.3.9. If m, n are coprime, then amn(O) = am(O)ay(O). 


Proof. Write An(O) for the set of integral, invertible right O-ideals 7 with nrd(7) = n. 
Then #A,(O) = ay(O). According to Lemma 26.3.6, there is a map 
Amn(O) > An(O) 


26.3.10 
Ind ( ) 
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We claim that this map is surjective and that each fiber has cardinality am(O). Indeed, 
these statements follow at the same time from the following observation: if J € A,(O) 
with O’ = O,(J), then for each I’ € Am(O’), we have I’ compatible with J and 
I =I'J € Am, and conversely; so the fiber of (26.3.10) is identified with Am(O’), of 
cardinality ay,(O’) = am(O) by Lemma 26.3.4. o 


26.3.11. From Proposition 26.3.9 and unique factorization of ideals in R, we find that 
ćo has an Euler product 


fo(s) =| | 40, (8) (26.3.12) 
p 
where 
1 = ape (O) 
= = : 26.3.13 
ČO, (s) P N(p)° 2, N(p)2s ( ) 


Remark 26.3.14. Zeta functions of semisimple algebras over a number field can be 
defined in the same way as in (26.3.2), following Solomon [Sol77]: see the survey on 
analytic methods in noncommutative number theory by Bushnell—Reiner [BR85]. 


Remark 26.3.15. The world of L-functions is rich and very deep: for a beautiful 
survey of the analytic theory of automorphic L-functions in historical perspective, 
see Gelbart—Miller [GM2004]. In particular, we have not given a general definition 
of zeta functions (or L-functions) in this section, but it is generally agreed that the 
Selberg class incorporates the minimal essential features: definition as a Dirichlet 
series, meromorphic continuation to the complex plane, Euler product, and functional 
equation. See e.g. Conrey—Ghosh [CG93] and the references therein. 


26.4 Counting ideals in a maximal order 


We now count ideals of prime power norm. By the local-global dictionary, there is a 
bijection 
Uc oO: nrd) = pf} > {Ip C Op : nrd(/p) = p°}. 

so it suffices to count the number of ideals in the local case. In this section, we carry 
out this count for maximal orders. 

So let p C R be a (nonzero) prime and let q := Nm(p). Let Op C By be a maximal 
order. Let 

ape (Op) = Hp = @pOp E Op : nrd([p) = p°} 


count the number of right integral O,-ideals of norm p°. Since Op is maximal, every 
nonzero ideal is invertible; and because Ry is a DVR, all such invertible ideals are 
principal. 


Lemma 26.4.1. Let Op C By be a maximal order and let e € Zo. 


a is a division ring, then every right integra -ideal 1s a power of the 
(a) If By is a divisi ing, th y right integral Oy,-ideal is a p f th 
maximal ideal and age(Oy) = 1. 
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(b) If By ~ M2(Fp), so that Op =~ M2(Rp), then the set of right integral Oy-ideals 
of reduced norm p° is in bijection with the set 


Cc 


w fo] ev € Zaqu ty = eand ee R/p'} 


and 
dpe(Op) =1l+qt---+¢%. (26.4.2) 


Proof. For (a), if p is ramified then by the work of section 13.3, there is a unique 
maximal order O, with a unique (two-sided) maximal ideal J, having nrd(J,) = p, 
and all ideals of Op are powers of Jy. 

To prove (b), we appeal to the theory of elementary divisors (applying column 
operations, acting on the right). Suppose Op = M2(Rp). Let Ip = @pOp be a right 
integral O,-ideal of norm p° and let m be a uniformizer for p. Then by the theory of 
elementary divisors, we can write 

_ (x 0 
wù n) 


for unique u,v € Z>ọ with u + v = e and c € R is uniquely defined as element of 
R/p” (Exercise 26.6). It follows that the number of such ideals is equal to X} $-9 q” = 
l+q+: tgi. o 


26.4.3. There is an alternate bijection that is quite useful. We say an integral right 
O-ideal / is p-primitive if it does not contain pO (so we cannot write J = pl’ with I’ 
integral). 

For a commutative ring A, we define the projective line over A to be the set 


P!(A) := {(x, y) € A? : xA + yA = A}/A*™ 


and write equivalence classes (x : y) € P! (A). 
Then for Op = M2(Rọp), there is a bijection 


P'(R/p®) > {Ip S Op : Ip primitive and nrd(7p) = p°} 


0 e (26.4.4) 
(a: c) É 0) Or +p©Op 


Any ideal of the form in the right-hand side of (26.4.4) is a primitive right integral 
Oy-ideal with reduced norm p°. Conversely, suppose that Ip = œpOp is primitive. We 
have nrd(@y) = 0 (mod p°). We find a “standard form” for J, by looking at the left 
kernel of ay. Let 

L := {x € (R/p*)” : xæp = 0 (mod p*)}. 


We claim that L is a free R/p°-module of rank 1. Indeed, L is one-dimensional over 
R/p since Iy is primitive and so ay #0 (mod p); by Hensel’s lemma, it follows that L 
is also one-dimensional. Therefore, there is a unique generator (a : c) € Pl(R /p®) for 
L. We therefore define an map Ip +> (—c : a) and verify that this furnishes an inverse 
to (26.4.4). 
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Since #P!(R/p°) =qet+ go for e > 1, we recover the count (26.4.2) as 


Le/2] 
dye(Op) = X) #P(R/po™) = ge + gel te 4g 4h. 
i=0 


26.4.5. Lemma 26.4.1 implies a factorization of ¢g, (s) = fo, (s). Write 


S i ty 
= =|= 26.4.6 
OLDI am >) (26.4.6) 


e=0 


so that ¢r (s) = [Tp Sr, (5). 
Corollary 26.4.7. We have 
2 ane if p is ramified; 
sit). < : 
mm | =) T ~1/q?')"', ifp is split. 


Equivalently, 


er, (25), if p is ramified; 
éB (s) =” AE 
Cr, (2s)or, (25-1), ifp is split. 


Proof. We use Lemma 26.4.1. If By is a division ring, then Lemma 26.4. 1(a) applies, 
and the result is immediate. For the second case, we compute 


Ss dl+qg+ etg sy loge" 
és, (8s) = 2 ——3z— = ) = om 
p È qres 2, dd = qyqes 
1 
= -g -q Èz qres -4> d q5- aa 
e=0 (26.4.8) 
o 1 1 q 
~ 1-q\1-1/q@s 1-1/0! 
i t+? 
daal (ga 
as claimed. o 


We have proven the following result. 


Theorem 26.4.9 (Factorization of g(s), maximal order). Let B be a quaternion 
algebra of discriminant D = disc B. Then 


fp(s) = [| ér, (8) = Sr(2s)fe(2s - 1) | [0 - Nip). (26.4.10) 


plD 


Proof. Combine the Euler product 26.3.11 with Corollary 26.4.7. oO 
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Corollary 26.4.11. g(s) has a simple pole at s = 1 with residue 


ae (1) g 
GC) = lim(s — Déa(s) = cE [ Ja-Na@. (26.4.12) 


plD 


Proof. Since €r(s) has only a simple pole at s = 1, with residue computed in Theorem 
26.2.3, there is a single simple pole of (g(s) at s = 1. oO 


26.5 Eichler mass formula: maximal orders 


We now finish the proof of the Eichler mass formula (Main Theorem 26.1.5) for 
maximal orders (26.1.13). In the next section, we will deduce the general formula 
from it: for a nonmaximal order, there are extra factors at each prime dividing the 
discriminant, and it is simpler to account for those in a separate step. 

In this section, we now suppose that B is definite, so F is a totally real field (and 
dr > 0). In particular, B is a division algebra. We saw in 26.1.3 that it was natural 
to weight ideal classes inversely by the size of their automorphism group (modulo 
scalars). To this end, and noting R* < O% is central so normal, we prove the following 
lemma. 


Lemma 26.5.1. The group O* /R* is finite. 


Proof. In Lemma 17.7.13, we proved that 
O! := {y € O* : nrd(y) = 1} 


is a finite group by embedding O —> Bg ~ R“ as a Euclidean lattice with respect to 
the absolute reduced norm (see 17.7.10). Since O! N R* = {+1}, the reduced norm 
gives an exact sequence 

O! O* ma R* 


a = ——_ 


fel) R RR 


(26.5.2) 


By Dirichlet’s unit theorem, the group R* is finitely generated (of rank r + c — 1), so 
the group R*/R*? is a finite abelian 2-group. The result follows. o 


We will examine unit groups in detail in Chapter 32. With this finiteness statement 
in hand, we make the following definition. 


Definition 26.5.3. Define the mass of O to be 


1 
mass(Cls O) := ) — 
WJ 
[J]JeCls O 


where wy; = [OL(J)* : R*] € Zs. 
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Theorem 26.5.4 (Eichler’s mass formula). Let O be a maximal order in a totally 
definite quaternion algebra B of discriminant D. Then 


mass(Cls O) = hrd?’ oD) 


2 
(2n)2" 
where p(D) = [Ipo (N) - 1). 


Following the strategy in the classical case (to prove the analytic class number 
formula), to prove Theorem 26.5.4 we will write o(s) as a sum over right ideal classes 
and analyze its residue at s = 1 by a volume computation. 


26.5.5. For an integral invertible right O-ideal J, let 


1 
Sos) := 2 NO (26.5.6) 
=] 


Then 
fols)= Y, ove) 


[J]eCls O 


We have [/] = [J] if and only if J ~ J if and only if J = aJ for nonzero a € J~!. Since 
4J = J if and only if u € OL(J)* (Exercise 16.3), it follows that 


foi) = 5a an (26.5.7) 
O#aeJ~! /OL(J)* 
By Lemma 26.5.1, we have 
wy := [OL(J)* : R*] € Zso. (26.5.8) 
Then (26.5.7) becomes 
So,(8) = — Bu (26.5.9) 
WINDY zaera jr NOY 


Proposition 26.5.10. Let ÑN := discrd(O). Then €o,,(s) has a simple pole at s = 1 
with residue 
C = 2" (27) Regp 

GEI SwA NN) ` 
Proof. We relate residue to volumes using Theorem 26.2.12. We recall (again) 17.7.10: 
this gives 

J! —> B > Bg := B Q&Q R = H” =~ R™ 
the structure of a Euclidean lattice A C R*” with respect to the absolute reduced 


norm. We take the function N in Theorem 26.2.12 to be the absolute norm N (recalling 
16.4.8). 
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We claim that 
d?, NW) 


2n 
By compatible real scaling, it is enough to prove that this relation holds for a single 
order O, and we choose the R-order 


O=RƏRIİI®RjƏ® Rk. (26.5.12) 


covol(O) = 


(26.5.11) 


The lattice R C Fg has covolume Vdr, so R* has covolume Var = ae the Z-order 
Z@ZieZj Zk has reduced discriminant 4 and covolume 1; and putting these together, 
the formula (26.5.11) is verified. 
Then (26.5.11) and N(J) = [O : J]z = [J~! : O] imply that 
covol(O) _ d7-N(M) 
NJ) 2"NVJ)” 


covol(A) = (26.5.13) 


Next, the group O.(J)* acts on J~! (and on Bg); and this group contains R* with 
finite index wy = [OL(J) : R*], so 


vol(OL(J)*\Br) = + vol(R*\ Br). (26.5.14) 


Multiplication provides an identification 


Bae = Frei XH)", 


so 
Xe. = R*\Br,<1 = (E\Fr,<1) x (=1)\ŒH)”) (26.5.15) 
where E < R* is acting by squares. Thus 
oe! 1 
vol(R*\FR,<1) = dan Reser = qResr- (26.5.16) 
Therefore j 
27°)"R 
igs ee (26.5.17) 
8wy 
From Theorem 26.2.12 together with (26.5.13) and (26.5.17), 
: 4"(27n7)"Re 2" (27) "Re 
Paie e e ie all et (26.5.18) 
Swrdp NN) Swyddi NN) 
o 


We now conclude the proof. 


Proof of Theorem 26.5.4. We now suppose that O C B is a maximal order, and write 
ép(s) and fp ,7)(s). We compare the evaluation of residues given by Corollary 26.4.11 
and Proposition 26.5.10. Since ¢g(s) and each €p ,7)(s) have simple poles at s = 1, we 
get 

GO= >) Gin 


[J]eCls O 
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From (26.4.12), 


* 1 1 * 1 D 
GA) = po ) I] (! = | = oo 0), (26.5.19) 
pid 


From the analytic class number formula (Theorem 26.2.3), 


n 


2 
=(1) = —— hr Re 
pC) dp QF 


since wr = 2 (as F is totally real). 
Adding the residues from Lemma 26.5.10, we find that 


2" Gr (2 D 2" (2)?"R 1 
Srl J Res. O22 il eSF E2 (26.5.20) 
4Vdr ND) 8d-N®) ko WI 
Cancelling, we find 
1 2 
mass(Cls O) = b> —= Gn SF Odp he GD) (26.5.21) 
tieciso W7 (27) 
and this concludes the proof. o 


Remark 26.5.22. For an alternative direct approach in this setting using Epstein zeta 
functions, see Sands [San2017]. 


26.6 Eichler mass formula: general case 


We now consider the general case of the Eichler mass formula, involving two steps. 
First, we relate the class set of a suborder to the class set of a (maximal) superorder; 
second, we compute the fibers of this map via a group action of the units. 

For these steps, we refresh our notation and allow B to be a definite or indefinite 
quaternion algebra over F. 


26.6.1. Let O’ 2 O be an R-superorder, and suppose that there is a prime p such that 
O = Og for all primes q + p. We refine the map from Exercise 17.3(b) as follows. 
For J C O aright O-ideal, we define the right O’-ideal p(J) = IO’ C O’ obtained by 


extension. Then p induces a map 
Cls O = Cls O’ (26.6.2) 
[U = [0] E 


that is well-defined and surjective (Exercise 26.5(a)). Let [/’] € Cls O’ and consider 
the set 
pid’) ={1cO:10 =1'j, 


the fiber of the extension map over /’. 
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We define an action of the group O; on p '(1’)as follows. Write 7 p = BpO,. Then 
to Up € O% , we associate the unique lattice 7(up) (the notation to suggest “the lattice 
generated by up”) such that 


Kipp = BpHpOp 
and I({p)q = Iq = 14 for all q + p, using the local-global dictionary (Theorem 9.4.9). 
This defines a right action of O’*; it acts simply transitively on p~!(J’), and the kernel 
of this action is visibly the subgroup O%. Therefore 
#p '(I') = [OÑ : OF]. 


We now look at the classes in the fiber. If 4p, vp € Of have [I(Up)] = [Ivp)] € 
Cls O, then there exists a € B* such that 


app) = Ip) 


and by extension al’ = I’, so a € OL(l’), and conversely. Therefore, we have a 
bijection 
ClsO © |_| OLATU’). (26.6.3) 
[I’JeCls O’ 


(See also Pacetti—Sirolli [PS2014, §3].) 


Proposition 26.6.4. Let O’ 2 O be an R-superorder, and suppose that there is a prime 
p such that O% = Oq for all primes q # p. Then 


mass(Cls O) = [0 : OF] mass(Cls O’). 
Proof. By (26.6.3), we conclude that 


mass(Cls O) = 3 ea 3 aa 


tneciso Y!  pmyeaiso rover W! \ WI 
-5 10%: 0% 1 (26.6.5) 
p Pwr 


[I’JeCls O’ 
= [0% : OF] mass(Cls O’) 


as claimed. o 


In order to apply Proposition 26.6.4, we need to compute the index of unit groups, 
a quantity that depends on the (locally defined) Eichler symbol. For a prime p, we 
define 


i l+1/q, if(|p)=1; 

_ 2 

A(O, p) := 1 =41-1/q, if(O|p)=-1; (26.6.6) 
1~ (F) ino 1-1/4, if (O|p) =0. 
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Lemma 26.6.7. Let O’ 2 O be a containment of R-orders with O' maximal. Then 


if p is split in B; 


[Op + Op] = [9p : OplaO,p) {i 1/q)"!, ifp is ramified in B. 


Proof. We follow Korner [K6r85, §3]. To prove the lemma, we may localize at p and 
so we drop the subscripts. Let n € Zs, be such that p"O’ C pO. Then 
[O'X :1+pO’][1 + pO’: 1+ p"O’] 

[OX :1+pO][1+pO:1+p"07] ` 


lO : 0%] = 


For y, 6 € 1 + pO, we have yo"! € 1 + pO’ if and only if y — 6 € p"O’. Therefore 
[1+ pO: 1+p"0'] = [pO : p"0’] = [O : p*'0'] 
and similarly with O’, all indices taken as abelian groups. Therefore 
[1+pO’:1+p"O0'] 
[1+pO:1+p"0'] 


For the other terms, we recall Lemma 24.3.12. We divide up into the cases, noting 
that if(O’ | p) = —1 then we must have £ = —1, 0 by classification (Exercise 24.3); this 
leaves 6 cases to compute. For example, if (O’ | p) = » and (O’ | p) = 1, then 

[O%:1+pO']  g(q-1°(@+1) _ TE. 


[O0*:1+p0]  @q-1? q 


The other cases follow similarly (Exercise 26.8). oO 


= [0 : 0]. 


We can now finish the job. 


Proof of Main Theorem 26.1.5. We first invoke Theorem 26.5.4 for a maximal order 
O’ 2 O to get 


PO 2 3/2 _ i 
mass(Cls O’) = Onn hrd“ Nm(®) I] f sma) ; 


By Proposition 26.6.4 and Lemma 26.6.7, we have 


mass(Cls O) = mass(Cls O’) | [roe : OX] 
pN 


= mass(Cls O’)[O’ : O]z I] (i- “am! I] A(O, p) (26.6.8) 


plD plot 


— hrd; Nm) | | ACO,p) 
plot 


using Nm(t) = Nm(®)[O’ : O]z. Oo 


~ (Qa <i 
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26.7 Class number one 


It is helpful to get a sense of the overall size of the mass, as follows. 


26.7.1. Let m(®, W) be the mass of an(y) Eichler order of level Wt. Then in analogy 
with the Brauer—Siegel theorem, 


log m(D, W) ~ Slog dr + log hr + log Nm DM) (26.7.2) 


as dr Nm(DMt) — oo with the degree n fixed: see Exercise 26.9. In particular, for 
F = Q, 
logm(D, M) ~ log(DM). 


Since F is totally real, one typically expects hr to be small in comparison to 
dp—but there is a family of real quadratic fields with small regulator first studied by 
Chowla with log hr ~ 5 log dr, a result due to Montgomery—Weinberger [MW77]. 


To conclude this section, as in section 25.4 (over Q), the Eichler mass formula 
can now be used to solve class number one problems for quaternion orders for definite 
quaternion orders (over totally real fields). This effort was undertaken recently by 
Kirschmer—Lorch [KL2016]: a complete list of definite orders of type number one is 
given, and again because #Typ O < #Cls O, the following theorem can be proven. 


Theorem 26.7.3 (Kirschmer—Lorch). There are 4194 one-class genera of primitive, 
positive definite ternary quadratic forms (equivalently, definite quaternion orders O 
with #TypO = 1, up to isomorphism): they occur over 30 possible base fields of 
degrees up to 5. 

There are exactly 154 isomorphism classes of definite quaternion orders O with 
#Cls O = 1; of these, 144 are Gorenstein and 10 are non-Gorenstein. 


Remark 26.7.4. Kirschmer—Lorch [KL2016] also enumerate two-class genera; a com- 
plete list is available online [KLwww]. Cerri-Chaubert—Lezowski [CCL2013] also 
consider totally definite Euclidean orders over totally real fields, giving the complete 
list over Q and over quadratic fields: all of them are Euclidean under the reduced norm. 


26.8 Functional equation and classification 


To conclude this chapter, we discuss the functional equation and important applications 
to the classification of quaternion algebras over number fields. This section serves as 
preview of the material in chapter 29 (motivation for it!) but we frame results in the 
same vein as the results of this chapter. 

Following Riemann, we complete ¢(s) := }%_; n™* to the function 


ECS) = nP £(s)T(s/2), 


where I (s) is the complex I -function (Exercise 26.2). Riemann proved that &(s) extends 
to a meromorphic function on C and satisfies the functional equation 


EC -= s) = &(s). (26.8.1) 
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(It is also common to multiply €(s) by s(1 — s) to cancel the poles at s = 0, 1.) We will 
prove this in section 29.1—the impatient reader is encouraged to flip ahead! 


26.8.2. This result extends to Dedekind zeta functions (retaining the notation from 
section 26.2). Define 


Tr(s) := 2 °?T(s/2), Tels) := 22a) ST (s). (26.8.3) 
We then define the completed Dedekind zeta function to be 
Er(s) := Idr TRC) Tels) fr (s). (26.8.4) 
Then ér (s) satisfies the functional equation 
ér(1— s) = ér (s) (26.8.5) 


for all s e C. We prove (26.8.5) as Corollary 29.10.3(a) using idelic methods; this 
proof will also motivate the completion defined above. For now, we borrow from the 
future. The functional equation gives ¢p (s) meromorphic continuation to C via 


s—1/2 r Cc 
\dr| ) M(s/2)'T(s) (26.8.6) 


_ = k . 
or(1— s) = ér(s) (s r- s)/2)'Td — s)° 
26.8.7. Using the functional equation (26.8.6), we can rewrite (26.2.4) to obtain the 
tidier expression 


CEO) = lim 77+ DeO) = (26.8.8) 


hr Regr | 

WF > 
in particular, Cr has a zero at s = 0 of order r + c — 1, the rank of the unit group of R 
by Dirichlet’s unit theorem. 

In terms of the completed Dedekind zeta function, we find p(s) has analytic 
continuation to C\{0, 1} with simple poles at s = 0, 1 and residues 
27+ hpRegp 

WF ` 


ép (0) = & (1) = (26.8.9) 


Example 26.8.10. When F is an imaginary quadratic field (r = 0 and c = 1) we have 
Reg, = 1 and ¢7,(0) = ¢r (0), so hr/wr = ¢r(0), and in particular if |dF | > 4 then 
hr = 2¢r (0). 


We now turn to a quaternionic generalization. 


26.8.11. The factorization of Zg(s) in Theorem 26.4.9 implies a functional equation 
for g(s) via the functional equation for ¢p (s). This functional equation is simplest to 
state for a completed zeta function. We recall that 


fn(s) = r2) 2s- 1)] | (1 = Nm(p)!=>) ; 


plD 
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just as the function p completes to p with a simple functional equation, we analo- 
gously complete Zp to 


Ex(s) = Er(2s)Er(28 — 1) | [Nm (1 - Nm) ™) | ]@s-1) 26.8.12 


plD ved 


where Q C Ram B be the set of real, ramified places in B. The definition (26.8.12) is 
motivated by the simplicity of the functional equation; see also Remark 26.8.15 below. 
Written in a different way, 


&n(s) = (22)! (Ide? Nm(D)°)*7Tp(s)Za(5) (26.8.13) 
where ż is the number of split real places, so that #PI(F) = #Q + t, 
Ta(s) := Tp(2s)’Tp(2s + 1)" TRS — 1)'Te(2s)°Te(2s — 1)°, (26.8.14) 
and we have used the formula 
(2s — 1) p(2s — 1) = Xs — 1/2)n-@5-P/7T(5 — 1/2) 
= (Qm)a P+ DPT (5 + 1/2) 
= 2nTp(2s + 1). 


Remark 26.8.15. The completion factors (26.8.12) are not arbitrarily chosen; they have 
a natural interpretation from an idelic perspective. Perhaps this serves as a motivation 
for working idelically: namely, that it helps to nail down these kinds of quantities! For 
more, see section 29.8. 


Some properties can be read off easily from (26.8.12). 


Proposition 26.8.16 (Analytic continuation, functional equation). Let m = #Ram B. 
Then the following statements hold. 


(a) g(s) has meromorphic continuation to C and is holomorphic in C\{0, 1/2, 1} 
with simple poles at s = 0, 1. 


(b) g(s) satisfies the functional equation 
&a(1 — s) = (-1)""Ez(5). (26.8.17) 


(c) €g(s) has a pole of order 2 — m at s = 1/2; in particular, ifm > 2 then p(s) is 
holomorphic at s = 1/2. 


Proof. Statement (a) follows from (26.8.12), recalling that p(s) is holomorphic in 
C\x{0, 1} by 26.8.7 with simple poles at s = 0, 1. Part (c) follows similarly from (b), 
since the other factors in (26.8.12) have a simple zero at s = 1/2. 

To prove (b), we consider each term in the definition of (26.8.12). The functional 
equation (26.8.5) for ér (s) with s — 1 — s implies 


&r(2(1 — s))€r(20. — s) - 1) = r (2 — 25)Er (1 — 2s) 
= ép (1 - (2 - 2s))\ér (1 — (1 - 2s)) (26.8.18) 
= r (2s — lép (2s). 
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For 
Es) = gl = g”) = q° _ a 


and q > 0 we have £(1 — s) = —€(s), so with q = Nm(p) the factors p | © are taken into 
account. Finally, 2(1 — s) — 1 = —(2s — 1) takes care of v € Q, and (b) follows. o 


Proposition 26.8.16 shows how algebraic properties of B correspond to analytic 
properties of £g. A deeper investigation ultimately will reveal the following fundamen- 
tal result. 


Theorem 26.8.19 (Sign of functional equation, holomorphicity). g(s) satisfies the 
functional equation 


éB(l - s) = €n(s). (26.8.20) 


Moreover, if B is a division algebra, then g(s) is holomorphic at s = 1/2. 


Proof. This theorem was proven by Hey [Hey29, §3] (more generally, for division 
algebras over Q) following the same general script as in the proof of the functional 
equation for the Dedekind zeta function (26.8.5), as proven first by Hecke: the key 
ingredient is Poisson summation. The argument is also given by Eichler [Eic38a, Part 
V]. We instead prove this theorem in the language of ideles (Main Theorem 29.2.6), 
as it simplifies the calculations—and so for continuity of ideas in the exposition, we 
borrow from the future. o 


Assuming Theorem 26.8.19, we can now deduce the main classification theorem 
(Main Theorem 14.6.1) for quaternion algebras over number fields. First, we have 
Hilbert reciprocity as an immediate consequence. 


Corollary 26.8.21 (Hilbert reciprocity, cf. Corollary 14.6.2). # Ram B is even. 


Proof. Immediate from (26.8.17) and (26.8.20). oO 
Next we conclude the all-important local-global principle. 


Corollary 26.8.22. We have B ~ Mo(F) if and only if By ~ Mo(F,) for all (but one) 
places v € PIF. 


Proof. The implication (=) is immediate. For the converse (<), by Proposition 
26.8.16(c), g(s) has a pole of order 2 — m at s = 1/2, so if m < 1 then &g(s) is 
not holomorphic at s = 1/2; but then by Theorem 26.8.19, B is not a division algebra, 
so B ~ M2(F) (and the order of pole is necessarily 2, and B, ~ M2(F,) for all v). oO 


From this corollary, we are able to deduce the Hasse norm theorem for quadratic 
extensions. 


Theorem 26.8.23 (Hasse norm theorem). Let K 2 F be a separable quadratic field 
extension and let b € F*. Then b € Nmx /F(K%) if and only if b € Nmx, jr, (K%) for 
all (but one) places v € PIF. 
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Proof. Consider the quaternion algebra B = (K, b | F). Then by Main Theorem 5.4.4, 
we have b € Nmgjr(K*) if and only if B ~ Mo(F). By Corollary 26.8.22, this holds if 
and only if B, ~ Mo(F,) for all (but one) places v. Repeating the application of Main 
Theorem 5.4.4, this holds if and only if B, ~ M2(F,) for all (but one) v. o 


We may similarly conclude the all important local-global principle for quadratic 
forms. 


Theorem 26.8.24 (Hasse—Minkowski theorem). Let Q be a quadratic form over F. 
Then Q is isotropic over F if and only if Q, is isotropic over F, for all places v of F. 


Proof. The implication (=) is immediate, so we prove (=). We may suppose without 
loss of generality that Q is nondegenerate. If n = dimp V = 1, the theorem is vacuous. 

Suppose n = 2. Then after scaling we may suppose Q = (1, —a), and Q is isotropic 
if and only if a is a square. Suppose for purposes of contradiction that K = F(Va) 
is a field. Since Q, is isotropic for all v, we have K, = F, x F, for all v, and thus 
bx(s) = fr(s). But as Dedekind zeta functions, both @r(s) and ¢x(s) have poles 
of order | at s = 1 (we evaluated the residue in the analytic class number formula, 
Theorem 26.2.3), a contradiction. 

Suppose n = 3. Again after rescaling we may suppose Q = (1, —a, —b), and Q is 
isotropic if and only if b is a norm from F[va]: then the equivalence follows from 
Theorem 26.8.23. 

Next, suppose n = 4, and Q = (1, —a, —b, c}. Let K = F(Vabc). By extension, Q is 
isotropic over K and all of its completions. But now Q ~ (1, —a, —b, ab) over K. Let 
B = (a,b | K). Then by Main Theorem 5.4.4, we have By ~ M2(K,,) for all w; thus 
by Corollary 26.8.22 we have B ~ M2(K), so K splits B. By 5.4.7, we have K => B, 
so there exist x, y, z € F such that 


nrd(a) = nrd(xi + yj + zij) = —ax’ — by? + abz? = -abc; 


dividing by ab we have z- a(y/a) — b(x/b)? +c = 0 so Q(z, y/a, x/b, 1) = 0. 
Finally, when n > 5, we make an argument like at the end of proof of Theorem 
14.3.3: we follow Lam [Lam2005, Theorem VI.3.8] and Milne [Milne-CFT, Theorem 
VIII.3.5(b)], but we are brief. Write Q = Qı L Q2 where Q; = (a, b} and dimp V2 > 3. 
Choosing a ternary subform and looking at its quaternion algebra, we find a finite set 
T C PIF such that Q3 is isotropic for all v ¢ T. For each v € T, let Q(z,) = 0 and 
let cy = Q\(zy) = —Q2(z,). Choose x,y € F* close enough so that z = Q(x) has 
zzy € FX?. The form Q’ = (c) L Qo in n — 1 variables is isotropic for all v: for v € T 
this was arranged, and for v ¢ T already Q2 was isotropic at v. By induction on n, 
we conclude that Q’ is isotropic; diagonalizing, we may write Q = (d) L Q’, and it 
follows that Q is isotropic. o 


Corollary 26.8.25. Let Q, Q’ be quadratic forms over F in the same number of vari- 
ables. Then Q ~ Q’ if and only if Qy ~ Q}, for all places v € PIF. 


Proof. Apply the same method of proof as in Corollary 14.3.7: see Exercise 26.10. 
o 


We may now conclude the classification with one further input. 
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Theorem 26.8.26 (Infinitude of primes in arithmetic progression over number fields). 
Letn C Zr be a nonzero ideal, let a € (Zr /n)*, and for each v | œ real let 6, € {+1}. 
Then there are infinitely many prime elements p € Zr such that sgn(v(p)) = & and 
P =a (mod n). 


Proof. The theorem generalizes Dirichlet’s theorem on the infinitude of primes in 
arithmetic progression (Theorem 14.2.9): see Lang [Lang94, Theorem VIII.4.10] or 
Neukirch [Neu99, Theorem VII.13.4]. oO 


Proof of Main Theorem 14.6.1, F a number field. First, the map B œ> RamB has 
the correct codomain by Hilbert reciprocity (Corollary 26.8.21). Surjectivity follows 
by Exercise 14.17 (using Theorem 26.8.26. To conclude, we show injectivity. We 
refer to Corollary 5.2.6, giving a bijection between quaternion algebras over F up to 
isomorphism and ternary quadratic forms of discriminant 1 up to isometry; and we 
recall Theorem 12.3.4, that (rescaling) there is a unique anisotropic ternary quadratic 
form of discriminant | up to isometry. Therefore Corollary 26.8.25 implies that the 
map B +> Ram B is injective, since the set Ram B records those places v where the 
ternary quadratic form attached to B is anisotropic. o 


We will give another proof of Main Theorem 14.6.1 over global fields using 
the characterization of idelic norms in Proposition 27.5.15 (avoiding fiddling with 
quadratic forms and the use of primes in arithmetic progression). 


Remark 26.8.27. For the readers who accept the fundamental exact sequence of class 
field theory as in Remark 14.6.10, the arguments above can be run in reverse, and the 
analytic statement in Theorem 26.8.19 can be deduced as a consequence. 


Exercises 


1. Prove Proposition 26.2.18 that 
2" (27) Regp 


WF 


vol(X<1) = 


in the special case of a real quadratic field. 


> 2. Let co j 
I(s) := f er Z 
0 X 


be the complex I-function, defined for Res > 0. Verify the following basic 
properties of I (s). 


(a) T(1) = 1 and (1/2) = y7. 

(b) T'(s + 1) = sI(s) for all Re s > 0, and T(n) = (n — 1)! forn > 1. 

(c) I'(s) has meromorphic continuation to C, holomorphic away from simple 

poles at Zo. 
(d) T(s) has no zeros in C. 
> 3. In this exercise, we prove Proposition 26.2.18. Let F be a number field with ring 

of integers R, let X C Fp be a cone fundamental domain for R*. Let Reg; be 
the regulator of F and wp the number of roots of unity in F. 
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(a) Let V = Fig be the ambient space. Let u(R*) < R* be the group generated 
by a fundamental system of units, so R*/p(R*) =~ RX... Show that 


vol((V/R*)<1) = — [ dx dz 
WF JV /u(R*) 
with x;, z; standard coordinates on R” x C° in multi-index notation. 
(b) Let p;,6; be polar coordinates on C°, and restrict the domain V to the 
domain V* with x; > 0 for all i. Let W* be the projection of V* onto the 
x, p-coordinate plane and let x,+; = Pj: Show that 


J dx dz = ae | dx. 
V<ı/u(R*) Ws! /u(R*) 


(c) Apply the change of variables u; = log x; to obtain 


I d= f du 
WA /H(R*) P 


where P is the fundamental parallelogram for the additive (logarithmic) 
action of R*. Conclude that 


2” (2x) Reg 


WF 


vol(X<1) = 


> 4. Show that the ideal J in Lemma 26.3.6 is unique: more specifically, show that 
if J is an invertible, integral lattice and suppose that nrd(J) = mn with m,n Cc R 
coprime ideals, then an invertible, integral lattice J such that J is compatible 
with J~! with 7J~! integral and nrd(J) = m is unique. 

> 5. Let F be a number field with ring of integers R, let B be a quaternion algebra 
over F, and let O C O’ C B be R-orders. For J C O a right O-ideal, we define 
the right O’-ideal p(J) = IO’ c O’ obtained by extension. 


(a) Show that p induces a (well-defined) surjective map 
Cls O > Cls O’ 
W => 0") 


of pointed sets with finite fibers. 
(b) For the case where O is the Lipschitz order and O’ the Hurwitz order, show 
that the map in (a) is a bijection (cf. Lemma 11.2.9). 


> 6. Let R be a DVR with uniformizer mz and let J be a (invertible) integral right 
M)2(R)-ideal. Show that J is generated by 


where u,v € Zso and c € R/x” are unique. 
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7. Generalize Exercise 11.13 as follows. For n € Z, let 


ra(n) := #{(t, x, y, Z) € LP 4x + y? +z =n} 
and let rj(m) := r4(n)/8. 

(a) Show that rj(2°) = 1 for all e > 1 and r,(p*) = 1 +p +--+- + p® for all 
e > l and p odd. [Hint: relate the count to the number of right ideals and 
inspect the coefficients of the zeta function. ] 

(b) Show that r} is a multiplicative function: rį(mn) = ry(m)ry(n) if 


gcd(m, n) = 1. 
(c) Conclude that 


8 Zan d, n odd; 
ra(n) = . 
24 alma, n= 2°m even with m odd. 


> 8. Finish the proof of Lemma 26.6.7 by checking the remaining cases. 


9. Prove (26.7.2). Specifically, for a number field F and coprime ideals D, 9t with 
D squarefree and coprime to Wt, define the mass 


2¢r (2) 
(27)?" 
Let N := DM. Show for fixed n that 


mF, D,M) := dhe g@w(M). 


3 
log m(F, D, IN) ~ z logdr + log hr + log Nm() (26.8.28) 


as dr Nm(t) > ov, as follows. 
(a) Show that 


1 -n 1 -1 
a" =| | (! - =| <<] | (: - =| = oln) 
p P p P 
so F (2) x 1. 
(b) Show that 
Nm(N) 


log log Nm(D) < PDM) « NMN) (log log Nm(MN)) . 


[Hint: you may need some elementary estimates from analytic number 
theory, adapted for this purpose; you may wish to start with the case 
F=Q.] 

(c) Conclude (26.8.28). 


10. Prove Corollary 26.8.25: if Q, Q’ are quadratic forms over F in the same number 
of variables, then Q =~ Q’ if and only if Q, =~ Q!, for all places v € P1 F. [Hint: 
see Corollary 14.3.7.] 
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> 11. Use Dirichlet’s analytic class number formula to prove the theorem on arithmetic 
progressions (Theorem 14.2.9) as follows. 


(a) Let F = Q(G,,). Show that 
ér(s) = £9) | [ Lo.) 


x+l 


where y runs over all nontrivial Dirichlet characters y: (Z/mZ)* — C*, 
and 
x(n) 
L o—., 
(s, xX) 2, = 


[Hint: Factor according to the decomposition m = ref.] 

(b) Use partial summation and the fact that the partial sums are bounded to 
show that each L(s, y) for y + 1 is holomorphic at s = 1. 

(c) Conclude from the analytic class number formula that L(1, y) + 0 for 


X #1. 
(d) For gcd(a, m) = 1, using (c) show that as s N 1 that 
1 
-s _ log e(s) +00) 
m) 


p=a (mod m) 
and conclude that the set of primes p with p = a (mod m) is infinite. 


> 12. Let F be a nonarchimedean local field with valuation ring R having maximal 
ideal p and residue field k of size q := #k. For n > 1, let B, := M, (F) and 
On := M, (R). In this exercise we generalize Lemma 26.4.1 (b). 


(a) Show that the set of right integral O,,-ideals is in bijection with the set 


ql 0 0 prs 0 

cC n” 0 aor 0 

c31 c32 WB >- O : u1,- . -, Un € Z>0, cij € R/p! 
0 

Cni Cm Cm ott oe 


[Hint: appeal to the theory of elementary divisors, applying column oper- 
ations acting on the right.] 
(b) Let ape (On) be the number of right ideals of reduced norm p° in O. Show 
p 8 p 
that 


OD = Š aO ogen. 
F=0 
(c) Let 
1 
moe, ai 


IcO, 


the sum over nonzero right ideals of On. Show that ćo,(s) = ér (s) = 
(= gyi, 
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(d) Show for n > 2 that 


o0 > On 
AORDI i ? = to, nsin- 1))o,(ns = (1—1). 


nes 7 
e=0 


(e) Conclude that 


n-1 
Co, (s) = ér (ns)čr (ns — 1) ++: ¢r (ns —(n-1)) = I] Cr (ns — i) 


i=0 
(cf. Corollary 26.4.7). 
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Chapter 27 


Adelic framework 


We have already seen that the local-global dictionary is a powerful tool in understanding 
the arithmetic of quaternion algebras. In this section, we formalize this connection by 
consideration of adeles and ideles. 

The basic idea: we want to consider all of the completions of a global field at once. 
There are at least two benefits to this approach: 


e We will gain notational efficiency, resulting in brief and well-behaved proofs 
that would be difficult or impossible to state clearly in classical language. 


e Each completion is a locally compact field and so amenable to harmonic analysis, 
and by extension to the adele ring and its group of units, we can do harmonic 
analysis on global objects. 


The adelic framework, and its use in class field theory, is a vast topic whose complete 
development deserves its own book. We do our best in this chapter to develop this 
notation and state what is needed for the case of quaternion algebras. For further 
background reading, see Childress [Chi2009] and the references given at the start of 
section 27.4 


27.1 > The rational adele ring 


In this first section, we work purely over Q to give a concrete flavor to the abstract 
definitions to come. 


27.1.1. Recall in section 12.1 that for a prime p we defined Z, = lim Z/p"Z as a 
projective limit, and each Z, is compact. We can package these together to make the 
direct product ring 
Z = ial Zp (27.1.2) 
P 


equipped with the product topology: as a profinite group, it is Hausdorff, compact, and 
totally disconnected. 
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We can see Z itself as projective limit as follows. By the Sun Zi theorem (CRT) 
theorem, we have an isomorphism 
Z= | [imz Z> lim Z/nZ 
P r n 


of topological rings, with the projective limit indexed by positive integers partially 
ordered under divisibility; so under this isomorphism, we may identify 


Z= lim Z/nZ 
i (27.1.3) 
= faa) € []P_,Z/nZ : am = an (mod n) for all n | m} 


The natural ring homomorphism Z > Z which takes every element to its reduction 
modulo n is injective; the image of Z is discrete and dense in Z again by the CRT. One 
warning is due: Z is not a domain. 


27.1.4. We now make the ring Z a bit bigger so that it contains Q as a subring. If 
we were to take the ring [],, Qp, a product of locally compact rings, unfortunately 
we would no longer have something that is locally compact (see Exercise 27.1): the 
product [],, Qp is much too big, allowing denominators in every component, whereas 
the image of Q will only have denominators in finitely many positions. We should also 
keep track of archimedean information at the same time. 


With these in mind we define, for each finite set S of primes, the ring 
Us =Rx]]Q x | |z» (27.1.5) 
pes p¢s 


equipped with the product topology, so that Us is locally compact. For example, 
Up = RXZ. (27.1.6) 


To assemble these rings together, allowing more denominators and arbitrarily large 
sets S, we take the injective limit of Us under the natural directed system Us — Ugy 
for S C S’. The resulting object is the restricted direct product of Q, relative to Zp 
and is called the adele ring Q of Q: 


@=Rx[[@=[][@ 


(27.1.7) 


:= Rx {x = (xp)p € []pQp : Xp € Zp for all but finitely many p} 
= {x = (xy) € [LO : lxvly < 1 for all but finitely many v} 
We declare the sets Us C Q with the product topology to be open in Q; and with 
this basis of open neighborhoods of 0 (open in Us for some S), we have given Q the 


structure of a topological ring. The sets Us C Q are also closed. Note that the topology 
on Q c [],, Q, is not the subspace topology. 
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27.1.8. For a finite set S, we write 


Q=] (27.1.9) 


vés 


for the projection of Q onto the factors away from S. We also write 


Qs := | [o- (27.1.10) 


ves 


We embed each of these into Q extending by zero and identify them with their images, 
so that Q = Qs x Qs. 


Remark 27.1.11. Our notation Q for the adele ring returns to the notation of Weil 
[Weil82] but is not standard; more typically, the adele ring is denoted A (which we 
find markedly problematic). 


Remark 27.1.12. Although Qs and Qs are rings and (via projection) are naturally 
quotient rings of Q, they are not subrings because they do not contain 1. This subtlety 
should cause no confusion in what follows (especially because we will be focused on 
the multiplicative case and working with groups, where there is no issue extending by 
the multiplicative identity 1). 


We have a natural continuous embedding Q ~ Q, for all v € PIQ and this 
extends to a diagonal embedding Q > Q. 


Lemma 27.1.13. The diagonal embedding Q — Q is a continuous injective ring 
homomorphism and the image is closed and discrete as a subring of Q. 


Proof. The embedding is continuous because it is so in each component. Because Q 
is a topological group under addition, to prove the remaining part it is enough to find 
a neighborhood 0 € U C Q such that U N Q = {0}. We take 


U := (-1,1)x Z = {aay : [Xcoloo < 1 and |xp|p < 1 for all primes p}. 


By definition, U is open in Q as it is open in Ug (for a reminder, see (27.1.6)). And if 
a € U N Q, then a € Zp for all p, soa € Z, and |a|.. < 1, and thus a = 0. Oo 


Lemma 27.1.14. The image of Q => Q is cocompact, i.e., Q/Q is compact. 


Proof. Let W := [0,1] x Z. Then W is compact. By strong approximation—for a 
snapshot review, flip ahead to Theorem 28.1.9 and its corollary—we have Q = Q + W. 
Therefore the continuous quotient map Q — Q/Q restricted to W is surjective, so 
Q/Q equal to the image of the compact set W is compact. (W is a fundamental set for 
the action of Q on Q: see Exercise 27.2.) o 


27.1.15. The proof of Lemma 27.1.14 shows that the natural map R x Z > Q/Q is 
surjective; its kernel is Z diagonally embedded, so we have an isomorphism 


Q/Q > (Rx Z/Z 
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of topological groups. The resulting topological group Sol := (R x Z)/Z is called a 
solenoid: it is compact, Hausdorff, connected, but not path-connected (Exercise 27.5), 
which can be visualized as in Figure 27.1.16. 


Figure 27.1.16: The solenoid 


Very often, we will want to tease apart the nonarchimedean and archimedean parts 
of the adele ring Q, and will write 


Q := Qj = I] Q, =~ ZezQ (27.1.17) 
P 
so that extending by zero we have Q = Q xR. 


27.2 > The rational idele group 


Having dealt with the additive version in adeles, now we talk about the multiplicative 
version, ideles. 


27.2.1. We define the rational idele group to be 


Q* = R* x BEY = R* x Q* 
p (27.2.2) 
= {x = (xy) € [I QX : |x,| = 1 for all but finitely many v} 
That is to say, Qe is the restricted direct product of the spaces Q% with respect to ZX. 
The topology is such that for S a finite set of primes, the set 


Vs = R* x | [Q x | |z% 
pES ps 
is open and closed as a subgroup of Qx. 
Remark 27.2.3. Chevalley first used the name élément idéal for elements of Q*, but at 


Hasse’s suggestion he abbreviated it to idéle; the name adéle was then shorthand for 
an “additive idele”. Anglifying, we drop the accents on these words. 
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27.2.4. The topology on Q” is not the subspace topology Q* C Q, because inversion 
need not be continuous. Instead, we think of Q” as a subset of Q x Q via the map 


x + (x,x7!), and then Q* inherits the structure of a topological group. 


Lemma 27.2.5. The diagonal map Q* — Q* is an injective continuous group homo- 
morphism, and the image of Q* is closed and discrete. 


Proof. Since Qis closed and discrete in Q and Q* C Qx Q has the subspace topology, 
so too is Q% closed and discrete. o 


27.2.6. We now give an explicit description of the quotient QX /Q*: we will see it is 
not compact. 
There is a canonical isomorphism of topological groups 


QX = (p) x ZX 
by p-adic valuation. Since (p) = p” = Z, we have a topological group isomorphism 


Q* = R* x [To = {#1} x R>o x | [Z x Dz. (21.2.7) 
P P P 


A direct sum appears because an element of the restricted direct product is a p-adic 
unit for all but finitely many p. We project Q* onto the product of the first and last 
factor, getting a continuous surjective map 


QW > tt x Gz. (27.2.8) 
p 


Looking at r € Q* c QX, if we write r = ell, p”®, where e € {+1} and 
n(p) = ordp(r), then r > (e, (n(p))p) in the projection. Therefore Q* is canonically 
identified with {+1} x p, Z in Q. So the projection map (27.2.8) restricts to an 
isomorphism on the diagonally embedded Q*. Therefore 


Q* = Q* x Reo x | [Z%. (27.2.9) 
p 


By the logarithm map, there is an isomorphism R>»ọo ~ R, so 


Q* = Q* x R x Z* (27.2.10) 


and we have an isomorphism of topological groups 
Q*/Q* = R x Z*. (27.2.11) 


(This is not a solenoid!) = 7 
In a similar way, we see that Q*/Q*, = Z*, where Q*®, = {x € Q : x > 0}, and so 


Q*/Q* is compact. 
Remark 27.2.12. In 27.2.6 we used that Z is a UFD and Z* = {+1}; for a general 


number field, we face problems associated with units and the class group of the field, 
and the relevant exact sequences will not split! 
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27.3 > Rational quaternionic adeles and ideles 


In the remainder of this chapter, we generalize the above construction to the adele ring 
and idele group of a global field and then a quaternion algebra over a global field. For 
the reader on a brisk read, in this section we briefly consider the constructions for a 
quaternion algebra over Q. 

Let B be a quaternion algebra over Q, and let O C B be an order. 


27.3.1. The adele ring of B is 
= Bi B, = {a = (œv) € TyepigBy : ay € O, for all but finitely many v}, 


(27.3.2) 
the restricted direct product of the topological rings B, with respect to O, for places v 
of Q; this definition is independent of the choice of order O, because any two orders are 
equal at all but finitely many places by the local-global dictionary for lattices (Theorem 
9.4.9). We embed B —> B diagonally: the image is discrete, closed, and cocompact 
since the same is true for Q — Q (Lemmas 27.1.13—27.1.14). We write 


B:= IT Bp, (27.3.3) 
P 


so extending by zero we may identify B = B x Boo. We also define 


O:= I] Op CB. (27.3.4) 


The idele group of B is B* := []{, BX, the restricted direct product of the topo- 
logical groups B* with sie to O*; we similarly define 


=e; < BX := IT By (27.3.5) 


Working adelically is notationally quite convenient, as the following lemma illus- 
trates (Lemma 27.6.8 for F = Q). 
Lemma 27.3.6. The set of invertible right fractional O-ideals is in bijection with 
BX /O* via the map I > @O%, where In = @pOp and Q = (Qp)p; this map induces a 
bijection 

Clsr O © B*\B*/O* 
Ur > BXaO™. 

The most fundamental result in this chapter is the following (see Main Theorem 
27.6.14, taking F = Q). 
Theorem 27.3.7. Let B be a division quaternion algebra over Q. Then B* < B* is 
cocompact and the set B*\B* /O* is finite. 

In particular, combining Lemma 27.3.6 and Theorem 27.3.7, we conclude that the 


class set of O is finite, something we proved earlier using the geometry of numbers for 
B definite in section 17.5. 
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27.4 Adeles and ideles 


In pursuit of more generality, we now repeat the constructions of adeles and ideles over 
a global field. For further reference, see e.g. Cassels [Cas2010], Goldstein [Gol71], 
Knapp [Kna2016, Chapter VI], Lang [Lang94, Chapter VII], Neukirch [Neu99, Chapter 
VI], or Ramakrishnan—Valenza [RM99, Chapter 5]. 

Throughout the rest of this chapter, let F be a global field. 


27.4.1. We recall notation from section 14.4 for convenience. The set of places of F is 
denoted PI F. For a place v of F, we denote by F, the completion of F at the place v, 
with preferred (normalized) absolute value ||, so that the product formula holds in F: 
see 14.4.12. If v is nonarchimedean, we let 


R, := {x € F : |x < 1} = {x € F, : v(x) 2 0} (27.4.2) 


be the valuation ring of Fp, where we write v also for the discrete valuation associated 
to the place v. If F is a number field, we will sometimes denote an archimedean place 
by writing v | œ, and for an archimedean place we just take R, = F,. A set SC PIF 
of places is eligible if it is finite, nonempty, and contains all archimedean places. 


27.4.3. The adele ring of F is the restricted direct product of F, with respect to R,: 


F=||4 
: 


= {(xy)y € [I5 : x € R, for all but finitely many v} (21.44) 


= KED c ILA :lxvly < 1 for all but finitely many v} 


with the restricted direct product topology. The topology is uniquely characterized 
(as a topological ring) by the condition that R := [],, R, (with the product topology) 
is open. Accordingly, a subset U C F is open if and only if for all a € F, the set 
(a+ U)NA ]], R is open in the product topology. 


Giving F the discrete topology, we have an isomorphism of topological rings 
Q gQ F — F: see Exercise 27.14. 


27.4.5. We embed F C F under the product of the embeddings F = F,, i.e., by 
x }> (x),; this map is well-defined because |x|, < 1 for all but finitely many places v 
of F. The image of F in F has the discrete topology and is closed in F; the quotient 
F/F is compact (i.e., F is cocompact in F): for x € F, finda € F such that x,- ay € Ry 
for all v by the Sun Zi theorem (CRT). 


27.4.6. Let S C PIF be an eligible set. We will write 
Fs := Bi a Beffa (27.4.7) 
pes veS 


and extending by zero we identify these sets with their images in F, so that F = Fs x Fs; 
we call Fs the S-finite adele ring of F. 
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We pass now to the multiplicative situation. 


27.4.8. The idele group of F is the restricted direct product of F% with respect to R*: 


Pe | 
Vv 


= {(xy)v € [EFX : |x|, = 1 for all but finitely many v} . 


(27.4.9) 


27.4.10. The topology on F* (as a topological ring) is uniquely characterized by the 
condition that [],, R% (with the product topology) is open. Thus, U © F* is open if 
and only if for all a € F*, the set aU N [Į], RŽ is open in the product topology. 

Note that F* does not have the topology induced from being a subspace of F, 
since inversion is not a continuous operation. In general, if A is a topological ring, A% 
becomes a topological group when A” is given the relative topology from 


AX > AXA 
xR (x, x71). 
(See Exercise 27.13.) 
Just as F C F is discrete, F* C F* is also discrete. 
Definition 27.4.11. The group Cr := F*/F* is the idele class group of F. 
The justification for calling this the idele class group is given in section 27.5. 


27.4.12. As above, if S c PI F is an eligible set, we define the S-finite ideles 
Ri l px 
Fs P I] Fy 
vés 
which is missing the product at the places in S (a finite product), namely 
Fé := I] Fx; 
veS 
extending by 1, we identify these with their images in F*, so that F* = (F5)* x F. 
27.4.13. With respect to the normalized absolute values 14.4.12, we have a natural 


map 
F* > Ro 


(vO | [lv (27.4.14) 


When F is a number field, the map (27.4.14) is surjective; when F is a function field 
with constant field F4, the image is q”, the cyclic subgroup of Rs 9 generated by q. Let 


FO i= {x = (wv Mylly = 1} (27.4.15) 
so that FO is the kernel of (27.4.14). Then F* < FO by the product formula (14.4.6). 
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The following theorem is fundamental. 
Theorem 27.4.16. The quotient F“ /F* is compact, i.e., F* is cocompact in F®. 


Proof. We give a proof in a more general context in Main Theorem 27.6.14 below. Or 
see e.g. Cassels [Cas2010, §16, p. 69] for a direct proof. o 


Theorem 27.4.16 is equivalent (!) to the Dirichlet unit theorem and the finiteness 
of the class group in the number field case, and finite generation of the unit group of a 
coordinate ring of a curve and the finiteness of the group of rational divisors of degree 
zero in the function field case [Cas2010, §$17-18]. 


Via the projection map F® > F 3° we have F* cocompact also in F 3° 


27.5 Class field theory 


Let F*? be a separable closure of F. In this chapter, we summarize the idelic approach 
to class field theory; unfortunately, we must omit most proofs, as a full treatment would 
require a lengthy development—but the reader who is willing to accept the statements 
should be able to digest what follows and will hopefully be motivated to dig deeper! 
For further reading, see Tate [Tate2010], Lang [Lang94, Chapters XI-XI], Neukirch 
[Neu99, Chapters IV-VI], or Janusz [Jan96, Chapter V]. 


27.5.1. Let R = Rs) be a global ring (the ring of S-integers) for the eligible set 
S C PIF. Then R is a Dedekind domain with field of fractions F. The class group of 
R admits an idelic description, embodying the definitions above, as follows. 

To simplify notation, throughout we abbreviate Fs = F, as we take the set S to be 
fixed. To an invertible fractional ideal a C F of R, we have ay = Ry for all but finitely 
many primes p, so we can consider its idelic image (ap)p S F under the product of 
completions. Since a is locally principal, we can write each dp = dyRp with ay € FX, 
well-defined up to an element of Res putting these together we obtain an element 


a = (ap )p € F* 
and = ae 
a=aR=aRCF. 
We recover a = 4M F from Lemmas 9.4.6 and 9.5.3. Therefore the group of invertible 
fractional ideals of R is canonically isomorphic to the quotient 


Id R = F*/R*. (27.5.2) 


The principal (invertible) fractional ideals correspond to the image of F™ in F>, 
Therefore there is a canonical isomorphism 


CIR © F*/R*F* 


e 27.5.3 
[a]  aR*F*. 
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27.5.4. Suppose F is a number field. If S consists of the set of archimedean places, 
then R = Zr is the ring of integers, and Cl R is the usual class group. For larger sets S, 
we have a natural quotient map Cl Zr — CIl R obtained by the quotient by the classes 
of primes p corresponding to nonarchimedean places in S. 

More generally, one may restrict (27.5.3) to the subgroup of principal fractional 
ideals which have a totally positive generator; we then obtain the narrow (or strict) 
S-class group 

CI+ R > FYR F”. (27.5.5) 


In idelic class field theory, it is often convenient to move between quotients of the 
finite ideles and quotients of the full idele group as follows. 


Lemma 27.5.6. Let Foo := I [vio Fy and let 


F% >0 := {(ay)y € F% : a > 0 for all v real}. 


Then the map 
F* => F* ~ F* x FX 


Boos (27.5.7) 
ab (a, 1) 
induces isomorphisms of topological groups 
F” /F* © F*/F*F 
(27.5.8) 


PPa PS sg 


Proof. Composing with the projection to F*/F*F% |, we see that the kernel is F% 
and that the induced map is surjective; similarly for the quotient by F*F. The few 
details are requested in Exercise 27.9. o 


Using Lemma 27.5.6, we have 
CIR © F*/(R*FXF*) 
CI R > PORE, sgl) 


27.5.9. Class field theory relates class groups to abelian extensions. For example, let 
Hoke rea | [Rx |< eee. 
vés vES 
Then H < CF is an open subgroup of finite index, and the projection map 
Cr/H > PYR 
is an isomorphism, which together with (27.5.3) gives an isomorphism to C1 R. So we 
are led to consider the finite-index open subgroups of CF. 


The main theorem of idelic class field theory for finite extensions is as follows. 
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Theorem 27.5.10. There is a bijection 


{K C FS? : K D F finite abelian} © {H < Cr : H finite-index open} 


se (27.5.11) 
K ® F” Nmk/F Cx 


together with functorial isomorphisms Cr /H — Gal(K | F). 
The map Cr/H æ Gal(K | F) is called the Artin isomorphism for H, K. 
Proof. See e.g. Tate [Tate2010, §5]. oO 


27.5.12. Rewriting the main theorem (Theorem 27.5.10) slightly, we see that if H < F* 
is an open finite-index subgroup containing F,, then there is a finite abelian extension 
K 2 F with the Artin isomorphism 


F*/H © Gal(K|F). 


27.5.13. Combining the surjections Cr — Gal(K | F), we obtain a continuous homo- 
morphism 
0: CF > lim Gal(K | F) = Gal(F® | F) 
K 

called the global Artin homomorphism, where F® c FP is the maximal abelian 
extension of F in FP. 

If F is a number field, then 8 is surjective; let Dr be the connected component of 
1 in Cr. Then DF is a closed subgroup with 


Dr = Rx (R/Z)* x Sol" +e! (27.5.14) 


(see Exercise 27.10). We therefore have an isomorphism Cp /DFf = Gal(F® |F). 
If F is a function field with finite constant field k, then 0 is injective, and 6(Cr) is the 
dense subgroup of automorphisms o € Gal(F' ab | F) whose restriction to Gal(k”! | k) = 


Z lies in Z, i.e., acts by an integer power of the Frobenius. See Tate [Tate2010, §5.4— 
5.7]. 


We conclude with a nice application to the classification of quaternion algebras. 


Proposition 27.5.15. Let £ C Pl F be a finite subset of noncomplex places of F of even 
cardinality. Then there exists a quaternion algebra B over F with Ram B = È. 


Proof. Let K 2 F be a separable quadratic extension that is inert (an unramified field 
extension) at every v € È: such an extension exists by Exercise 14.21. By the main 
theorem of class field theory, we have [Cr : F* Nmx;r Cx] = [K : F] = 2, where 
Cr = F*/F* and similarly Cx are idele class groups. Therefore 


[Cr : FX Nmg;r Cx] = [F* : FX Nmg;r(K%)] = 2 (27.5.16) 


as well. 

For each v € È, let m, be a uniformizer for R, and if v is real let m, = —1. Since 
K, 2 F, is an unramified field extension, we have my ¢ Nmg, /F,(K%). For v € Ł, let 
Ty = (1. +25 1,426) € E”, Then 7m, ¢ Nmg;r(K*). 
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We claim that my ¢ F* Nmx/(K*). Otherwise, there would exist a € F* such 
that am, € Nmx/r(K*), so a € Nmx,,,/F,,(K3;,) for all w’ | v’ with v’ + v, and 
a ¢ Nmr, jr, (Ky); but then a is a local norm at all but one real place, so by the 
Hasse norm theorem (Theorem 26.8.23), a € Nmx/r(K%) is a global norm, but this 
contradicts that a is not a local norm at v. 

Now let p = [ [peg my. Since #Z is even, by the previous paragraph and (27.5.16) 
we get ~ E 

p= bu e F* Nmx/F(K*). 
; , K,b 
Consider the quaternion algebra (2) For all places v € £, we have K, a field and 
b= myu}! ¢ Nmr, /r,(K%), so v € Ram B. At every other place v’ ¢ X, we have either 
that K, is not a field or b = u,! € Nmx, /r, (K~), and in either case v’ ¢ Ram B. oO 


27.6 Noncommutative adeles 


We retain notation from the previous section, in particular the abbreviations R = R,s) 
and Fs = F. Let B be a finite-dimensional simple algebra over the global field F. In 
this section, we extend idelic notions to B; the main case of interest is where B is a 
quaternion algebra over F. We recall the topology on B, for places v, discussed in 
section 13.5. Let O C B be an R-order. 


27.6.1. The adele ring of B is the restricted direct product of the topological rings B, 
with respect to O,: 


B:= Bi B, = {(ay)y €[], B, : œ € O, for all but finitely many v ¢ s} 
vePI(F) 


The topology on B (as a topological ring) is uniquely characterized by the property 
that the subring [],, O, is open with the product topology. 

By the local-global dictionary for lattices (Theorem 9.4.9), the definition of B is 
independent of the choice of order O and eligible set S (and base ring R = Ris). 


27.6.2. Just as in 27.4.5, we embed B —> B diagonally. A basis for B as an F-vector 
space shows that B is a free F-module of finite rank. Then, since the image F > F 
is discrete, closed, and cocompact, arguing in each coordinate (with respect to the 
chosen basis), we conclude that B — B is discrete, closed, and cocompact. (Details 
are requested in Exercise 27.11.) 


We now turn to the multiplicative structure, the main object of our concern. 


27.6.3. The idele group of B is the restricted direct product of the topological groups 
B* with respect to O%: 


B” i= IT B = {(ay)y € [], Bx : a, € O% for all but finitely many v} : 


equivalently, B% is the unit group of B with the topology as in 27.4.10. The topology on 


B* as a topological group is characterized by the condition that the subgroup [],, OX% 
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is open with the product topology. Again, B* is independent of the choice of O and 
eligible set S because any two such constructions differ at only finitely many places. 


27.6.4. The S-finite adele ring is 
B= [| Ba (27.6.5) 
vés 

extending by zero, we may identify Bg with its image in B. The S-finite adele ring has 
a compact open subring 

Os:= | [Ov £ Bs. (27.6.6) 

vés 

We similarly define the S-finite idele group with its compact open subgroup 


kalie ken (27.6.7) 


vés vés 


When no confusion can result (S is clear from context), we will drop the superscript 
and replace with hats, writing simply B = Bs and O% = O%, etc 


Just as in 27.5.1, the ideles provide a convenient way of encoding fractional ideals, 
as follows. 


Lemma 27.6.8. The set of locally principal, right fractional O-ideals is in bijection 
with B*/O* via the map I +> @O*, where Ip = QpOy and @ = (ap)p; this map induces 
a bijection 


Clsp O © B*\B*/O* 


= (27.6.9) 
Uap BXaO”. 


Proof. Let I bea locally principal right fractional O-ideal, so Ip = ap Oy for all primes 
p of R, with œp well-defined up to right multiplication by an element of OX, so to 7 


we associate (@pO})p = @O* e€ BY/O™. Conversely, given @ € B*/O* we recover 
I = GON B from Lemmas 9.4.6 and 9.5.3. 


The equivalence relation defining the (right) class set is given by left multiplication 
by B*, so the second statement follows. o 


Remark 27.6.10. We recall by Main Theorem 16.6.1 that for B a quaternion algebra, 
an R-lattice J C B is locally principal if and only if it is invertible. 


27.6.11. In analogy with 27.4.13, we have a natural multiplicative map 
IIl: BY > R>o 


a = (ay) H> | |Inrd(a,)|y (27.6.12) 


and we define 
BO = kerli || = {æ = (a), : II, lnrd(ay)h = 1} (27.6.13) 
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By the product formula (14.4.6), we have B* < B®. 
By comparison, we have also the groups 
B! = {a € B : nrd(a) = 1} 
B! = {a € B : nrd(a) = 1} 
satisfying B! < B! < B®. 


The following theorem is fundamental (see Fujisaki [Fuj58, Theorem 8.3], Weil 
[Weil82, Lemma 3.1.1]). 


Main Theorem 27.6.14 (Fujisaki’s lemma). Let B be a finite-dimensional division 
algebra over a global field F. Then the following statements hold. 


(a) BX < B is cocompact. 
(b) For any eligible set S, the subgroup B* < B% = B* is cocompact and the set 
B*\ B*/O* is finite. 


Proof. The natural place to prove this result is after some more serious analysis has 
been done—but it is too important to wait for this. The small amount of input needed, 
which can be seen as an (ineffective) idelic version of the Minkowski convex body 
theorem (Theorem 17.5.5), is as follows. There exists a compact subset E such that for 
all Be BY, 
the map B > B\B is not injective when restricted to BE. (27.6.15) 
For the proof of (27.6.15), see Exercise 29.11: in a nutshell, there is a measure u on B 
in which u(B\B) < co, anda compact E with u(E) satisfies 27.6.15, as u(BE) = u(E). 
We first quickly prove part (b), assuming part (a). We have Ox open in B*, so the 
open cover {B*@O*} can be reduced to a finite cover, whence the double coset 


space is finite. 
We now turn to prove (a), which we do in steps. 


@EBX 


Step 1: Setup. From the set E granted above, we define 
X:=E-E={n-71' : 7,7 € E}. (27.6.16) 


Since E is compact, we conclude that X C B is compact (subtraction is continuous). 
We will also use the set of products XX, which is again compact in B. 


Step 2: Measuring differences. We claim that for all 8 € B®, we have BX N BX + 0. 
Indeed, let 8 € B®. By (27.6.15), there exist distinct n, 7" € E (depending on £) such 
that 
BQ - 1) EB. (27.6.17) 
Because 7 # ņ and B is a division algebra (here is where we use this hypothesis), we 
have B(n — 7’) € BX, as claimed. 
A similar statement holds on the right, multiplying by B!. 


27.6. NONCOMMUTATIVE ADELES 471 


Step 3: Cocompactness. We claim that there exists a compact set K C B* x B* such 
that for all £ € B®, there exist 8 € B* and v € B such that £ = By and (y, y~ Nek. 
To prove this claim, first define T := B* N XX. Since T is the ilersecton of a 
discrete set and a compact set, we conclude that T is finite. Let T7! := {y7} : y € T} 
and let 
K :=T!XxX. (27.6.18) 
To check the claim, let £ € B®. By Step 2, BX OB™ + Oand (similarly) X67 "A BX #0. 
Therefore there exist y, v’ € X and £, p’ € ‘BX such that 


Bv=8 and vp =f". (27.6.19) 


Therefore 


B'B = (v'B')\(By) = v'v € B* N XX. 


Then v~! € T-!X, and T~!X is compact. We have shown that 6 = By"! with B € B* 
and (v~!,v) € K = T7!X x X, and this proves the claim. 


Step 4: Conclusion. By the definition of the topology on B*, the set 
eB”: Q, eK} 


is compact; then, by the claim in Step 3, this set surjects onto B*\B®. We conclude 
that B*\B® is compact, completing the proof of part (a). o 


Corollary 27.6.20. If B is a division algebra or B ~ M,(F), and O C B is an R-order, 
then the class set Clsp O is finite. 


Proof. If B is a division algebra, we combine Lemma 27.6.8 and Main Theorem 
27.6.14. Otherwise, B ~ M,,(F). If O is maximal then O =~ M,,(R) and the result 
follows from 17.3.7; and then the result for a general order O follows from Exercise 
17.3(b). o 


Remark 21.6.21. Corollary 27.6.20 covers all quaternion algebras B. This finiteness 
statement generalizes to the theorem of Jordan—Zassenhaus: see Remark 17.7.27. 


27.6.22. The idelic point of view (Lemma 27.6.8) also makes it clear why the class 
number is independent of the order within its genus (Lemma 17.4.11): the idelic 
description only depends on the local orders, up to isomorphism. 


We have two other objects that admit a nice idelic description. 


27.6.23. The genus of an order and its type set (see section 17.4) can be similarly 
described. Let O be an R-order, and let O’ € Gen O be an order in the genus of O. 
Then O’ is locally isomorphic to O, so there exists 7 € BX such that 7O! = O’, well 
defined up to right multiplication by an element of the normalizer Ngx (O). We recover 


O’ = O' N B, so this gives a bijection 


Gen O © BY/Nz.(O). 
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Two such orders are isomorphic if and only if there exists 6 € B* such that BOB"! = O’, 
so we have a bijection 
Typ O © B*\B*/N5.(O). (27.6.24) 


Corollary 27.6.25. The type set Typ O is finite. 


Proof. In view of (27.6.24), the double coset B*\B*/N Bx (O) is a further quotient of 
the set B*\B*/O* which is finite by Main Theorem 27.6.14. Oo 


27.6.26. Referring to section 18.5 locally, we see that the group of locally principal 
two-sided O-ideals Id1(O) is in bijection with 


O*\Ng.(O)/O* = Ng.(O)/O* = O%\N 5. (O) (27.6.27) 


where = pi SEREN 
Ngx(0) = {@ € BX: @0 = Oa} 


is the normalizer of O in B™. Furthermore, the group of isomorphism classes of locally 
principal two-sided O-ideals is therefore in bijection with 


Npx(O)\N5(O)/O* = Np.(O)/(Nax(O)O*) = (Nax(O)O*)\Np.(O). 


27.7 Reduced norms 


To conclude, we consider reduced norms; we specialize and suppose that B is a 
quaternion algebra. 


27.7.1. Since S contains all archimedean places, by the local norm calculation (Lemma 
13.4.9), we have nrd(B*) = F*, By the Hasse—Schilling theorem on norms (Main 
Theorem 14.7.4), we have nrd(B*%) = F% o where Q C RamB is the set of real 
ramified places and F% o is the set of elements positive at all v € Q (recalling 14.7.2). 
Therefore, the reduced norm (in each component) yields a surjective map 

nrd: B*\B*/O* > F* 


>a! 


ae? nrd(O*). (27.7.2) 


We will now see that the group de de] nrd(O*) in (27.7.2) is a certain class 
group of R. 


Lemma 27.7.3. The subgroup nrd(O*)FX o < FX isa finite-index open subgroup 
containing Ey 
then nrd(O*) = R*. 


the group of totally positive elements of F”. If moreover O is maximal, 


Proof. In Lemma 13.4.9, we saw that if Op is maximal, then nrd(O%) = RŠ; for 
the finitely many remaining p C R, the Rp-order Oy is of finite index in a maximal 
superorder, so nrd(O;) is a finite index open subgroup of Ry. Putting these together, 


we conclude nrd(O*) is a finite index open subgroup of R. 
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But F Y R* FZ o = Clo R is a finite group and therefore 


[F* : nrd(O*)FX 9] = LF* : RX FS [RX FZ o : nrd(O”)FX 9] < 0%. 


Q 


Finally, we have F* , > F% 


S90 = Foo as the latter possibly requires further positivity. o 


27.7.4. Let oo 
G(O) := FX nrd(O*) < F”. (27.7.5) 


From Lemma 27.7.3, G(O) is a finite-index open subgroup containing F,. By class 
field theory 27.5.12, there exists a class field K for G(O), i.e., there exists a finite 
abelian extension K 2 F and an Artin isomorphism 


Cle(o) R = F*/G(O) > Gal(K | F). (27.7.6) 


The group Clgo) R only depends on the genus of O: if O’ € Gen O then O’ is locally 
isomorphic to O, so there exists V € B* such that O’ = 7-107 so nrd(O”) = nrd(O*). 


Example 27.7.7. Suppose F is a number field and S is the set of archimedean places 
of F, so that R = Zr is the ring of integers in F. Suppose further that O is maximal. 
Then G(O) = F% oR”. Recalling 17.8.2, let Q be the set of ramified (necessarily real) 
archimedean places of B, and let Clo Zp := F> AF x/ R*, equivalently, Clo Zr is the 
group of fractional ideals of Zp modulo the subgroup of principal fractional ideals 
generated by elements in FY 0° Then Cleo) Zr = Clo Zr by definition, so we have a 
surjective map 
ord: Cls O > Cleo) Zr = Clo Zr. 


The two extreme cases: if B is unramified at all real places, then Q = Ø, and 
Clo Zr = Cl Zr is the class group; if B is ramified at all real places, then Clo Zr = 
Cl* Zr is the narrow class group. 


Remark 27.7.8. It is a fundamental result of Eichler (Theorem 17.8.3) that whenever 
there exists v € S such that B is unramified at v, then the reduced norm map (27.7.2) 
is injective, and hence bijective, giving a bijection between the class set of O and a 
certain class group of R. This topic is the main result in the next chapter. 


Exercises 


Unless otherwise specified, let F be a global field and let B be a quaternion algebra 
over F. 


1. If we take the direct product instead of the restricted direct product in the 
definition of the adele ring, we lose local compactness. More precisely, let 
{Xiker be a collection of nonempty topological spaces. Show that X = []j<7 Xi 
is locally compact if and only if each X; is locally compact and all but finitely 
many X; are compact. 

2. Review the language of group actions and fundamental sets (section 34.1). 
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a) Equip Q with the discrete topology. We have a group action Q U Q (by 
addition). Show that Zx [0,1] € Q x R is a fundamental set for this action. 
[Hint: Review the arguments in Lemmas 27.1.13—27.1.14.] 


b) Similarly, show that Q* OQ” and that Z*x Rso € Q x R is a fundamental 
set for this action. 


3. For a prime p, let P = (p,1,...,1,p,1,...) € Q be the adele which is equal to p 
in the pth and coth component and | elsewhere. 


a) Show that the sequence p, ranging over primes p, does not converge in Q*; 
conclude that Q” is not compact. 7 

b) However, show that the sequence p has a subsequence converging to the 
identity in the quotient Q*/Q*. 


4. Recall that Z = lim | Z/[nZ = |], Zp. 


(a) Prove that each @ € Z has a unique representation as @ = |”, Can! where 
Cn E€ Zand 0 < cy <n. 

(b) Prove that Z*= 2x II}; Z/nZ as profinite groups. /Hint: Consider the 
product of the p-adic logarithm maps and use the fact that for every prime 
power p° there are infinitely many primes q such that p° || (q — 1).] 

(c) Prove for every n € Z>o that the natural map Z/nZ > Z/nZ is an isomor- 
phism. 

(d) Prove that there is a bijection from Z>o to the set of open subgroups of Z 
mapping n œ> nZ. 


5. We recall from 27.1.15 the solenoid Sol = (R x ZZ (with Z embedded diago- 
nally, and given the quotient topology). 


(a) Prove that Sol is a compact, Hausdorff, and connected topological group. 

(b) Prove that Sol ~ Q/Q as topological groups. 

(c) Prove that Sol = lim | R/ 17, with respect to the directed system n € Zs, 
under divisibility. 

(d) Show that the group of path components of Sol is isomorphic to Z /Z, and 
conclude that Sol is not path connected. / Hint: the neutral path component 
is the image of {0} x R € ©) xR./ 


6. Recall that by definition, a set U C F is open if and only if U N Fs is open in Fs 
for all eligible S c Pl F. Show that U C F is open if and only if for all a € F 
that (a + U) NT], Ry is open in [],, Ry. 


7. Show that the topology on F* agrees with the subspace topology induced on 
FX œ F x F by x | (x, x7!). 
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8. Show that if Sis eligible and O C B is an Ris)-order, then 
B= La) € [I By : x € O, for all but finitely many v} 


and 
BX = {(xv) € [], BX : xy € O% for all but finitely many v} 


and therefore that this definition is independent of the choice of O (and S). 


> 9, Prove Lemma 27.5.6. 


10. Returning to 27.5.13, let Dr < Cr be the connected component of 1 in the idele 
class group of F. Show that Dr is a closed subgroup with 


Dr ~Rx(R/Z) x Sol”*! 


where r is the number of real places of F and c the number of complex places. 
Interpret this isomorphism explicitly for F a quadratic field for both F real and 
imaginary: what ‘explains’ the factors that appear? 


> 11. Let B be a finite-dimensional F-algebra. 


(a) Show that B is discrete and closed in B. [Hint: F is discrete in F by the 
product formula. ] 

(b) Show that B is cocompact in B (under the diagonal embedding), i.e., that 
B/B is compact. 


12. Give a fundamental system of neighborhoods of 0 in B and of 1 in B*. 
> 13. Let A be a topological ring. 


(a) Suppose that A* C A has the induced topology. Give an example to show 
that the map x + x7! on A* is not necessarily continuous. 


(b) Now embed 
AX >AXxA 
xR (x, x) 
and give A* the subspace topology. Show that A% in this topology is a 
topological group. 
> 14. Let Sc PIF be an eligible set. 


(a) Show that Rgs) is discrete in Fs = [ [yes Fy. [Hint: it is enough to show this 
for a neighborhood of 0, and then use the fact that the norm must be an 
integer. ] 

(b) Prove that if O is an Rs)-order in B, then O is discrete in Bs = [ [yes By. 


15. Let F be a global field and let K be a finite separable extension of F. 


(a) Show that K = F @f K. [Hint: Use the fact that F, 8f K = |], Kw where 
w runs over the places above v.] 
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(b) Show that 
K= {aww € [],Kw : INmx,,/F, Xvlv < 1 for almost all v} 
but that the inclusion 
B=B8F FC {æ € [[ By : |nrd(x,)|, < 1 for almost all v} 
is strict, so the corresponding statement is false for B. 


16. Let R = Ris) be a global ring and O be an R-order in B. Show that the set 
of R-orders which are connected to O is in bijection with BX [N gx (O), where 
N x(O) is the normalizer of O in BX. 


17. Extend Lemma 27.7.3 as follows: if O is an Eichler order, then nrd(O*) = RX. 
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Chapter 28 


Strong approximation 


28.1 > Beginnings 


We have already seen in several places in this book how theorems about quaternion 
algebras over global fields are often first investigated locally, and then a global result is 
recovered using some form of approximation. Approximation provides a way to transfer 
analytic properties (encoded in congruences or bounds) into global elements. In this 
chapter, we develop robust approximation theorems and investigate their arithmetic 
applications. 

We begin by reviewing weak and strong approximation over Q, taking a breath in 
preparation for the idelic efforts to come. 


28.1.1. The starting point is the Sun Zi theorem (CRT): given a finite, nonempty set 
S of primes, and for each p € S an exponent np € Zs; and an element x, € Z/p"” Z, 
there exists x € Z such that x = x, (mod p”») for all p € S. These congruences can 
be equivalently formulated in the p-adic metric by lifting to xp € Zp and asking for 
|x — xp| < p”? for p € S; or equivalently, the map Z > [],,<s Zp has dense image for 
any finite, nonempty set of primes S (giving the target the product topology). We may 
therefore think of the CRT as an approximation theorem, in the sense that it allows us 
to find an integer that simultaneously approximates a finite number of p-adic integers 
arbitrarily well. 


We may generalize 28.1.1 (recalling notation from section 27.1, in particular 27.1.8) 
as follows. 


Theorem 28.1.2 (Weak approximation). Let S C PI1Q be a finite, nonempty set of 
places of Q. Then the image of Q => Qs := [yes Q is dense. 


Proof. Let x, € Q, for each v € S, and let € > 0. We want to show 
there exists x € Q such that |x — x,|, < € forall v € S. (28.1.3) 


We proceed by considering increasingly more general cases; our intent is to com- 
municate (concrete, effective) meaning behind the symbols and to set us up to prove 
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strong approximation below; for a short, uniform proof (which reproves the CRT), see 
Exercise 28.1. 


Case 1. Suppose co ¢ Sand xp € Zp for all p € S. Then (28.1.3) holds by the CRT, as 
in 28.1.1; in fact, we have infinitely many x € Z for this purpose. 


Case 2. Suppose œ ¢ S, but xp € Qp for p € S. We employ continuity of multiplication 
to reduce to the previous case, as follows. We consider the least common denominator: 


d:= [ [pete € Zso. (28.1.4) 


pes 


Then dx, € Zp for all p € S. By the case just established by the CRT, there exists 
x’ € Z such that |x’ — dxp|p < eld|p for all p € S, so taking x := x’/d and dividing 
through we conclude that (28.1.3) holds. 


Case 3. To conclude, suppose co € S. We employ an additive translation: we find a 
rational number close to x» and add to it a small solution to the previous case, as 
follows. Since Q € R is dense, there exists y € Q such that |y — xo|o < €/2. Let 
Yp := Xp — y. From case 2, we find y’ € Q such that |y’ — yp|p < € for all p € S. By 
case 1, there exist infinitely many m € Z such that |1 — m|p < min(1,|y’ — yp|p) for all 
p € S; for such m, we have m = 1 (mod p) so |m|, = 1 and 


ly’ E MyYplp 


my 7+0- mpl =y -yplp <€ (28.1.5) 
p 


\(y’/m) = Yplp = 


by the ultrametric inequality. By choosing m large enough, we may ensure that 
ly’ /Mlo < €/2. Let x := y’/m + y. Then by (28.1.5) 


|x — Xplp = \(y’/m) +y- p + Wp = \(y’/m) = Yplp <E (28.1.6) 

and 
|x — Xcoloo < |X — Vloo + Ly — Xælæ = |y /Mlo + €/2 < € (28.1.7) 
proving (28.1.3). m 


Theorem 28.1.8 (Strong approximation). Let S C P1Q be a nonempty set of places. 
Then the image of Q > Q; := Iesv is dense. 


28.1.9. Written out in the standard basis of open sets, strong approximation is equiv- 
alent to: given a finite set T C PI Q disjoint from S, elements x, € Q, for v € T, and 
e > 0, there exists x € Q such that |x — x,|, < € for all v € T and x € Zp for all 
p ¢ SUT with p # œ. 


Weak approximation follows from strong approximation by forgetting S and weakly 
approximating x, for v € T. Indeed, the difference between the ‘weak’ and the ‘strong’ is 
meaningful here. In weak approximation, we satisfy only a finite number of conditions, 
with no control over the rational number at places v ¢ S. By contrast, in strong 
approximation, the role of the set S is switched, and have specified conditions at all 
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primes v ¢ S, either by approximation at finitely many places in Tas in 28.1.9 or by the 
assertion of integrality at the rest. (This results in some asymmetry in the conclusion 
for the real place; we could restore this by defining Zæ := (—1, 1), but this would not 
add content as we could just include this interval in T.) 


Proof of Theorem 28.1.8. We prove the statement in its formulation 28.1.9. Naturally, 
we return to the proof of weak approximation. Without loss of generality (proving a 
stronger statement), we may assume #S = 1. 

If S = {co}, we apply step 2 of weak approximation over the set T: the result 
x = x’'/d already has x € Z, for q ¢ T, since then q { d. 

So suppose S = {£} with £ prime. We return to case 3. To define y, we note instead 
that Z[1/p] & R is dense, so we may take y € Z[1/€], so in particular y € Z, for 
q + £. We just showed that we may take y’ € Z, for q ¢ T. And for the integers m, we 
claim we may take m = €* for k € Zs: indeed, we are applying case 1 (CRT) and, as 
in 28.1.1, this asks for m = 1 (mod p”?) for p € T (with np € Z>1, and £ # p), so we 
just need to take k to be a common multiple of the orders of £ € (Z/p”? Z)*. With this 
strengthening, we have x = y’/m+y € Z4 for all q ¢ SUT. o 


28.1.10. As is hopefully evident from the proof of strong approximation when S = {oo}, 
aside from continuity of multiplication, the key statement needed was that the map 
Z — Z/mZ is surjective for all m € Z, as provided by the CRT. Or more zippily, what 
is needed is that the image of Z — Z is dense. 


28.1.11. In weak approximation, we can replace the additive group Q with with the 
multiplicative group Q*: the image of Q* — [],<5 Q% is dense a fortiori. 
However, the embedding Q* = Q% is not dense: that is to say, we do not have 


strong approximation for Q*. Indeed, taking S = {co} we have Q% = QX; and since 
Z* n Q* = Z* = {+1}, the open set 7X \{+1} is disjoint from Q%*. In view of 28.1.10, 
the problem is also indicated by the fact that Z* = {+1} does not surject onto (Z/mZ)”* 
form > 7. 


28.2 > Strong approximation for SL? (Q) 


We now consider approximation in the noncommutative context. For motivation in this 
second phase of the introduction, we consider the simplest case where B = M2(Q) 
and take S = {co}, and Qs = Q; by analogy, this is like considering a noncommutative 
generalization of the CRT. 


28.2.1. Recall that B, = M2(Q,) ~ Qt has the coordinate topology (see section 13.5); 
therefore weak and strong approximation for B = M2(Q) follow from these statements 
for Q, and weak approximation for GL2(Q) follows as the determinant is continuous. 


28.2.2. We should not expect the embedding GL2(Q) = GL2(Q) to be dense any 
more than it was for Q* = GL; (Q), as in 28.1.11. In fact, we rediscover the same issue 
by taking determinants: the map GL2(Z) > GL2(Z/mZ) is not surjective, because 
det(GL2(Z)) = +1 whereas det(GL2(Z/mZ)) = (Z/mZ)*. 
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Once we restrict to the subgroup of determinant 1, we find a dense subgroup once 
again. 


Theorem 28.2.3. The image of SL2(Q) => SL2(Q) is dense. 


Theorem 28.2.3 is known as strong approximation for the group SL2(Q). We give 
a quick proof of Theorem 28.2.3 in two steps. In preparation, we recall from 27.2.6 
that 


Q* = Q*Z* 


(“denominators can be handled globally”) and prove an analogous decomposition. 


Lemma 28.2.4. We have 
GL2(Q) = GLQ) GL2(Z) 


2 A (28.2.5) 
SL2(Q) = SL2(Q) SL2(2). 
Proof. We begin with the first statement. The inclusion GL2(Q) GL,(Z) (a GL2(Q) 
holds; we prove the other containment. Let @ € GL2(Q). Consider the collection of 
lattices (Lp)p with Lp = pZ, Cc QZ. Since a, € GL2(Zp) for all but finitely many 
p, we have Lp = Zi, for all but finitely many p. By the local-global dictionary for 
lattices (Theorem 9.4.9), there exists a unique lattice L C Q? whose completions are 
Lp. We now rephrase this adelically (and succinctly): letting L=@2c Q, we take 
L=Ln Q?. Choose a basis for L and put the columns in a matrix œ, so L = aZ?. 
Then L = aZ? = @Z?, and there exists y € GL2(Z) such that @ = a7. This completes 
the inclusion. 
To get down to SL2, we take determinants. Let @ € SL2(Q) and write it as @ = ay 
with œ € GL2(Q) and F € GL»(Z). Then 


1 = det(@) = det(a) det(7) € Q*Z* = Q* 


but Q* NZX = {+1}; multiplying both a, y by D on the right and left respectively, 


0 1 
if necessary, we may take det(a) = det(7) = 1, i.e., œ € SL2(Q) and 7 € SL2(Z). (m 


Now for the slightly magical second step. 


Theorem 28.2.6. The map 
SL2(Z) > SL2(Z/mZ) 
is surjective for all m € Z; equivalently, the image of SL7(Z) => SLZ) is dense. 
The statement is nontrivial: a matrix modulo m can certainly be lifted to a matrix 


in Z whose determinant will be congruent to 1 modulo m, but the hard part is to ensure 
that the lifted matrix has determinant equal to 1. 
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Proof of Theorem 28.2.6. Let a € Mo(Z) be such that œ maps to the desired matrix in 
SL2(Z/mZ), so in particular det(a) = 1 (mod m). By the theory of elementary divisors 
(Smith normal form), there exist matrices u, v E€ SL2(Z) such that uav is diagonal; so 


without loss of generality, we may suppose that œ = i 4 with ab = 1 (mod m). 
Let ee 
a’=|, ie r ie B J r (28.2.7) 
Then a’ = œ (mod m) and 
det(a’) = ab(2 — ab) + (1 — ab)’ = 1 (28.2.8) 
soa’ € SL2(Z), as claimed. (Compare Shimura [Shi71, Lemma 1.38].) Oo 


Remark 28.2.9. The proof of Theorem 28.2.6 extends in two ways. First, we can replace 
Z with a PID and the same proof works. Second, arguing by induction, one can show 
that the map SL,,(Z) > SL,,(Z/mZ,) is surjective for all n > 2 and m E Z. 


We are now ready to prove strong approximation for SL2(Q). 


Proof of Theorem 28.2.3. Consider the closure of SL2(Q) in SL2(Q) in the idelic 
topology; we obtain a closed subgroup. Since SL2(Q) > SL2(Z), by Theorem 28.2.6 the 
closure contains SL(Z), but then by Lemma 28.2.4, it contains all of SL2(Q) SLZ) = 
SL2(Q)! Therefore SL2(Q) < SL2(Q) is dense. o 


With the preceding context, we are now ready to state a more general formulation of 
strong approximation for indefinite quaternion algebras over Q. The following theorem 
is a special case of Main Theorem 28.5.3. 


Theorem 28.2.10 (Strong approximation). Let B be an indefinite quaternion algebra 
over Q. Then B! is dense in B'. 


If B is definite, then O! is a finite group () and so cannot be dense in the infinite 
idelic group B!. This theorem has the following important applications. 


Theorem 28.2.11. Let O be an Eichler order in an indefinite quaternion algebra over 
Q. Then the following statements hold. 


(a) Every locally principal right O-ideal is in fact principal, i.e., #Cls O = 1. 
(b) Every order O’ locally isomorphic to O is in fact isomorphic to O, i.e., 
#TypO = 1. 


(c) For any integer m, the reduction map O! > (O/mO)! is surjective. 


Proof. Specialize Main Theorem 28.5.3, Corollary 28.5.6, and Corollary 28.5.14, 
respectively, using 28.5.16. o 
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28.3 Elementary matrices 


Before embarking on our more general idelic quest, we pause to give a second proof 
of strong approximation for SL, using elementary matrices. 


28.3.1. Let R be a domain. An elementary matrix (or transvection) in SL,(R) is a 
matrix which differs from the identity in one off-diagonal entry; such a matrix acts by 
an elementary row operation (add a multiple of a row to a different row) on the left and 
by an elementary column operation on the right. For n = 2, the elementary matrices 


are those of the form ae or 4 with b,c € F. 
0 1 c 1 


28.3.2. If F is a field, then SL, (F) is generated by elementary matrices by the theory 
of echelon forms (Exercise 28.3). 


Lemma 28.3.3. Let R be a Euclidean domain. Then SL2(R) is generated by elementary 
matrices. 


Proof. The calculation 


(ENII 


shows that ņ is in the subgroup of elementary matrices. 
We now follow the usual proof of the elementary divisor theorem. Let @ = 
a b 
d 
the second row to the first, we may suppose b = 1; then multiplying by 7 on the right 
we may suppose a = 1; elementary row and column operations then give b = c = 0, 
and then d = 1. Similarly, if a = 0, multiplying by 7 gives b = 0 and we repeat. 

So we may suppose a, b + 0. By the Euclidean algorithm under the norm N, there 
exists q,r € R such that a = bq +r and N(r) < N(b). Applying the elementary 
matrix which adds —q times the second column to the first, we may suppose a = 0 or 
N(a) < N(b). If a = 0, we are done by the previous paragraph; otherwise, we multiply 
on the right by 7 which swaps columns, and repeat. Because N takes nonnegative 
integer values, this procedure terminates after finitely many steps. o 


€ SL2(R). First, suppose b = 0. Then det(@«) = ad = 1; adding a times 


Remark 28.3.4. Lemma 28.3.3 holds for general n > 2, and it follows from the above 
by induction: see Exercise 28.4. 


This theory of elementary matrices has the following striking consequence. 
Proposition 28.3.5. Let R be a Dedekind domain. Then for all ideals m © R, the map 
SL2(R) > SL2(R/m) 


is surjective. 
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Proof. We may suppose m is nonzero. Then by the CRT, R/m is a finite product of local 
Artinian principal ideal rings. Therefore R/m is Euclidean and by a generalization of 
Lemma 28.3.3, the group SL2(R/m) is generated by elementary matrices: see Exercise 
28.6. Every elementary matrix in SL2(R/m) lifts to an elementary matrix in SL2(R), 
and the statement follows. o 


Corollary 28.3.6. Let F be a global field, and let R C F be a global ring with eligible 
set S. Then the image of the map 


SLa(F) > SLo(Fs) = | | sLv) 
vés 


is dense. 


Proof. For brevity, we write F= Fs and R= Iles Rv. We first show that SL2(R) —> 
SL2(R) is dense. If U C SL2(R) is open, then U contains a standard open neighborhood 


of the form pi PERETI 
{8 € SL2(R) : B = Am (mod m)} 


for some @m E SL2(R/m) and m C R. The surjectivity in Proposition 28.3.5 then 
implies that U N SL2(R) + 0. 

For the statement itself, we again argue with elementary matrices. Let @ = (a), € 
SLi(F ); then a, € SL2(R,) for all but finitely many v. For these finitely many v, we 
know SL2(F,,) is generated by elementary matrices by Lemma 28.3.2, so by strong 
approximation in F (Lemma 28.7.2) we can approximate œ, by an element of SL2(F) 
that belongs to any open neighborhood of œ,; for the remaining places we apply the 
previous paragraph, and we finish using the continuity of multiplication. o 


28.4 Strong approximation and the ideal class set 


In this section, we provide one more motivation for strong approximation, relating it 
to the ideal class set as previewed in Eichler’s theorem (see section 17.8). 

We adopt the following notation for the rest of this chapter. Let R be a global ring 
with eligible set S and F = Frac R its global field. Let B be a quaternion algebra over 
F and let O C B be an R-order. 


28.4.1. By 27.7.1, the reduced norm map 
nrd: Cls O = BX\B*/O* > FX )\F*/ nrd(O*) (28.4.2) 


is surjective. Then by class field theory 27.7.4, the codomain FX oF a nrd(O*) = 
Cleo) R admits a description as a class group. 

The important point that we will soon see: the reduced norm map is injective, 
therefore bijective, when strong approximation holds for the group B!. But before we 
get there, we have some explaining to do. 


We now investigate the injectivity of the reduced norm map (28.4.2). This map is 
only a map of (pointed) sets, so first we show that it suffices to look at an appropriate 
kernel. 
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28.4.3. For all 8 € BX, the map @O > GOR gives a bijection 
Cls O = B*\B* /0* «œ BX\B*/O”™ = Cls O' 


where O’ := BN BOB is connected (locally isomorphic) to O. So it is sensible 
to consider the maps (28.4.2) for all orders O’ connected to O, i.e., the entire genus 
Gen O. We recall that the type set is finite (Corollary 27.6.25, or Main Theorem 17.7.1 
in the number field case using the geometry of numbers). 


Our investigations will involve the kernels of the reduced norm maps: 


= {a € BX : nrd(a) = 1} < B! := {@ € BX : nrd(@) = 1} (28.4.4) 


Example 28.4.5. If B = (a,b | F), then B! admits the Diophantine description 
= {(x, y, z, w) E F* 4 x? - ay? = bz? + abw? = 1} 


and the group B! consists of local solutions at all primes that belong to R4 for almost 
all places v € PI F. 


Lemma 28.4.6. Let O C B be an R-order. Then the reduced norm map (28.4.2) is 
injective for all orders O' € Gen O if and only if B? c B*O'* for all O’ € Gen O. 


Proof. If (28. 4.2) is injective, then given @ € B! we have nrd(@O*) = = nrd(O*) so 
@O* = zO% for some z € BX and @ € zO* c BO, 

For the converse, since nrd: B* > F% o and nrd: OX 3 nrd(O*) are both surjec- 
tive, to show nrd is injective for O we may show that if nrd(@) = nrd(f) e F* then 
a@O* = zBO* for some z € B*. We consider (@B-!)(BOB-) = = @E NO where as 
above O’ = B n BOB"! € GenO. Since ap! e B}, by hypothesis ap LS = = 
z(BuB!) where z € B* and fi € O*, and consequently 20 = zßnO = = zBO, and 
hence the map is injective. o 


28.4.7. We have B*O*Nn B! = B'O! if and only if nrd(O*) = F% o Anrd(O*) (Exercise 
28.10). 


28.4.8. Suppose that B! is dense in B!. Then we claim that B! c B'O! c BXO% for 
all orders Ô. Indeed, if @ = (app e B! then ao! < B! is open, so there exists œ € B! 
such that «œ = @y with y € O!, and @ = &7! € B 19}, 


We should not expect hypothesis of 28.4.8 to hold for all quaternion algebras: see 
Exercise 28.7. 


28.5 Statement and first applications 


In this section, we set up and state the strong approximation theorem, and then derive 
some applications. Throughout, we abbreviate Bs = B. 
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Definition 28.5.1. We say B is S-indefinite (or B satisfies the S-Eichler condition) if 
S contains a place which is unramified in B. 


28.5.2. If F is a number field, then this definition agrees with Definition 17.8.1; and 
since a complex place is necessarily split and S contains the archimedean places, if B 
is S-definite over a number field F then F is a totally real number field. 


Main Theorem 28.5.3 (Strong approximation). Let B be a quaternion algebra over a 
global field and suppose B is S-indefinite. Then B! is dense in B!. 


28.5.4. One can think of strong approximation from the following informal perspective: 
if B! is not compact, then there is enough room for B! to “spread out” in B! so that 
correspondingly B! is dense in the S-finite part B!. 

The hypothesis that B! = [],cs B} is noncompact is certainly necessary for the 
conclusion that B! is dense in B!. Indeed, if Bi = [],es B} is compact, then since B! is 
discrete in B!, the subgroup B'B! < B! is closed in Bt, and B! B! + B!. On the other 
hand, if B! is dense in B!, then adding the components for v € S we have B! B! <B! 
dense. This is a contradiction. 


We give two proofs of strong approximation over the next two sections. For the 
moment, we consider some applications. 

Our main motivation for strong approximation is the following proposition. We 
recall the class group 27.7.4 associated to O. 


Theorem 28.5.5. If B is S-indefinite, then the reduced norm map (28.4.2) 
nrd: Cls O = B*\B*/0* > Clio) R = FX 9\F*/ nrd(O*) 


is a bijection for all R-orders O C B: in particular, if I is a locally principal right 
O-ideal, then I is principal if and only if nrd(T) is principal in the class group Cleo) R. 


Proof. Combine Lemma 28.4.6 and 28.4.8. oO 


Corollary 28.5.6. If B is S-indefinite and Clg(oy R is trivial, then Typ O is trivial, i.e., 
every order O locally isomorphic to O is in fact isomorphic to O. 


Proof. The class set Cls O maps surjectively onto Typ O by Lemma 17.4.13, and the 
latter is trivial by Theorem 28.5.5. o 


28.5.7. More generally, we can grapple with the type set of O, measured by a different 
(generalized) class group. Recall (27.6.24) that 


Typ O © B*\B*/N5x(0). 
Let 
GN(O) := F% 


Q 


onrd(Ng.(O)) < FX. (28.5.8) 
Since OX < N x(O), we have GN(O) > G(O). Define accordingly the class group 


Clen) R = F*/GN(O). (28.5.9) 
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Then there is a surjective map of abelian groups 
Cleo) > Clenoy R. 
Corollary 28.5.10. Jf B is S-indefinite, then the reduced norm map induces a bijection 
nrd: TypO > ClenioyR 


Proof. We take the further quotient by the normalizer in the bijection in Theorem 
28d: o 


28.5.11. Returning to 17.4.16, for B = Mo(F) and O = M2(R) we compute that 
nrd(N gx (O)) = F?R”, so Clen(o) R = CIR/(C1 R)?. Thus by Corollary 28.5.10, the 


R 
types of maximal orders in O are given by (E 4 for [a] in a set of representatives 


of (CI R)/(C1 RY. 


Two other immediate applications of strong approximation that served as motiva- 
tion are now apparent. 


Corollary 28.5.12. Suppose B is S-indefinite. Then 
B'=B'O! and B'=B'0!. (28.5.13) 
Proof. The inclusion B!O! c B! holds, and the converse holds when B! is dense in 


B! by 28.4.8. For the second statement, we have B = Bx Bs and O = O x Bs, so we 
take norm 1 units and multiply both sides of (28.5.13) by B= ves BI. Oo 


Corollary 28.5.14. Suppose B is S-indefinite. Let m C R be an ideal. Then the reduc- 
tion map 


! > (O/mO)' 
is surjective. Moreover, O! is dense in 0! 
Proof. For all am € (O/mO)!, by strong approximation the open set 
{Be O!: B= Qm (mod mO)} € 0! 


contains an element a € B! n O! = O! mapping to a in the reduction map. The 
second statement follows as above from the definition of the topology. o 


We now give a name to a large classes of orders where the group G(O) governing 
principality is explicitly given. 


Definition 28.5.15. We say that an R-order O C B is locally norm-maximal if 
nrd(O*) = R*. 


Equivalently, O is locally norm-maximal if and only if the reduced norm maps 
nrd: OF > R% are surjective for all nonzero primes p of R. 
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Example 28.5.16. If O is maximal, then O is locally norm-maximal (Lemma 13.4.9); 
more generally if O is Eichler then O is locally norm-maximal (Exercise 23.3). 


Certain special cases of Theorem 28.5.5 are important in applications. Recall that 
Q C Ram B is the set of real ramified places, and Clo R as defined in 17.8.2 is class 
group associated to ©, a quotient of the narrow class group. 


Corollary 28.5.17. Suppose F is a number field and let S be the set of archimedean 
places of F. Suppose B is S-indefinite and O C B is locally norm-maximal R-order. 
Then nrd: Cls O —> Clo R is a bijection. 


Proof. This is just a restatement of Theorem 28.5.5 once we note that Clg(o) R = Cla R 
by Example 27.7.7. o 


Proposition 28.5.18. Let T 2 S be a set of primes of R that generate Clg(o) R and 
suppose B is T-indefinite. Then every class in Cls O contains an integral (invertible 
right) O-ideal whose reduced norm is supported in the set T. 


Proof. Let Rn denote the (further) localization of R at the primes in T. We apply 
Theorem 28.5.5 to the order Or) := O 8r Rgn: we conclude that there is a bijection 
Cls Om = Cle@n) Rn. But Cle(o,p) Rin is the quotient of Clg(o) R by the primes in 
T, and so by hypothesis is trivial. Therefore if 7 is a right O-ideal, then Krn := I 8r Rin 
has Kr) = «Oqn for some a € B*. Let J = a™!1. Then [J]r = [I]r and Jp = Op for 
all primes p ¢ T and so J has reduced norm supported in T. Replacing J by aJ with 
a € R nonzero and supported in T, we may suppose further that J C O is integral, and 
the result follows. o 


Example 28.5.19. Let B be a definite quaternion algebra over a totally real (number) 
field F and let S be the set of archimedean places, so R = Zr. Let O be a locally 
norm-maximal R-order in B. Suppose that Clo R = {1} and let p C R be a prime of R 
unramified in B. Then by Proposition 28.5.18, every ideal class in Cls O contains an 
integral O-ideal whose reduced norm is a power of p. 

As a special case, we may take F = Q: then Clo Z = Cl* Z = {1}. Therefore, 
if B is a definite quaternion algebra of discriminant D over Q, and O C B a locally 
norm-maximal order (e.g., an Eichler order), then for a prime p { D, every invertible 
right O-ideal class is represented by an integral ideal whose reduced norm is a power 
of p. 


28.6 Further applications 


We continue with further applications of strong approximation. 
Our next consequence of strong approximation is a refinement the Hasse—Schilling 
theorem on norms (Main Theorem 14.7.4) as follows. 


Theorem 28.6.1 (Eichler’s theorem on norms). Suppose B is S-indefinite, and let 
neRn Fay Then there exists œ € B* integral over R such that nrd(q) = n. 


488 CHAPTER 28. STRONG APPROXIMATION 


Proof. By Main Theorem 14.7.4, there exists œ € B* such that nrd(@) = n. For each 
prime p, the set i 
Up = {Bp E B, : trd(Bya@) € Rp} (28.6.2) 
is (closed and) open since trd is continuous, and further Up is nonempty: if p € Ram B 
then already œp is integral over Ry and 1 € Up, and otherwise By ~ M2(Fp) and we 


0 -n\.. f 
may suppose œp =|; , Jisin rational canonical form whereby 


0 1\/0 -n 1 t 
vaf” Ale ") =wa(, j)=nt ler, (28.6.3) 


shows By = c l € Up. 

Let U := (II p Up) N B!; then U is open and nonempty. By strong approximation, 
there exists B € Ū N B!. Thus trd(Ba) € Np Ry = R and nrd(Bq@) = nrd(a) = n. 
Therefore Ga is as desired. o 


Let RX o := R* N F% o be the subgroup of units that are positive at the places 
v € Q (the set of real, ramified places in B). 


Corollary 28.6.4. Suppose B is S-indefinite and that Clo R is trivial. Let O C B be an 
Eichler R-order. Then 
nrd(O”) = RŽ 0: 
Proof. Let u € RZ 0" We repeat the argument of Theorem 28.6.1, with a slight modi- 
fication. Let M be the level of O. 
Let p | M be a prime that divides the level M. We choose an isomorphism 
By ~ M2(Fp) such that œp is in rational canonical form, and let Os be the standard 


Eichler order in M2(F,) of the same level as O. Define 
Up = {Bp € By : Bpa € Oj}. 


This is again an open condition because multiplication is continuous, and the calcula- 


tion 
tu! 1\(0 -u _{1 0 
-1 OJ z} \0 u 
shows also that U, # 0. 


For all other primes p { M, we define U, as in (28.6.2). As in the proof of Theorem 
28.6.1, we find 8 € B! such that nrd(Ba@) = u and y’ = Ba has Yp € Op for all p | M. 
Since y’ is integral, y’ belongs to an R-order O’ that is equal to O, at all p | M and is 
maximal at all p { Mt. The order O’ is therefore an Eichler order of level M. 

Finally, since Clg(o) R = Clo R is trivial, the type set Typ(O) is also trivial 
(Corollary 28.5.6), so we conclude O = vO’y7! for some v € B*, and y = vy’v-! € O 
has nrd(y) = u as desired. o 


Example 28.6.5. Suppose O is an Eichler order in an indefinite quaternion algebra 
over F = Q. Then by Corollary 28.6.4, nrd(O*) = {+1}, in particular, there exists 
y € O% such that nrd(y) = -1. 
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Remark 28.6.6. We will prove a stronger version of Corollary 28.6.4 after we have 
developed the theory of selectivity: see Corollary 31.1.11. 


To conclude this section, we consider a variant of principalization of right fractional 
ideals: we ask further that the generator has totally positive reduced norm. 


28.6.7. Suppose F is a number field. Let 
Bes = {a € B* : v(nrd(a)) > 0 for all real places v}. 


The reduced norm gives a map B*/B*, > F. > nor F% and the quotient is a finite abelian 
2-group, so in particular B, < B% has finite index. 

Let J, J C B be R-lattices. We say J, J are in the same narrow right class if there 
exists a € BX, such that aJ = J; accordingly, we let Cls% O be the narrow (right) 
class set of O. As in Lemma 27.6.8, there is a bijection 


Clst O © B%,\B/O* 


choosing a local generator. The projection map Cls O > Clsr O has finite fibers as 
BX) < B* has finite index, so since Clsp O is a finite set, so too is Cls% O. 


Corollary 28.6.8. Let F be a number field and suppose B is S-definite. Then the map 
nrd: Clsy O e BSy\BY/O* > FX\F*/ nrd(O*) =: Clio) R (28.6.9) 
induced by the reduced norm is a bijection. 


Proof. Repeating the argument in the proof of Lemma 28.4.6, we see that the map 
(28.6.9) is injective for all orders O’ € GenO if and only if Bic B% O% for all 
O’ € Gen O. And the latter holds by strong approximation (Corollary 28.5.12): 


pl 1571 A 
B C BO" c B30” 
for all R-orders O’. o 


Proposition 28.6.10. Let F be a number field. Suppose that B is S-indefinite and that 
ClEo) R = Clg) R. For each real place v not in O, let e € {+1}. Then there exists 
y € O such that sgn(v(nrd(y))) = & for all v real not in Q. 

In particular, if F = Q and O is a locally norm-maximal Z-order in an indefinite 
quaternion algebra B, then there exists y € O* with nrd(y) = -1. 


Proof. Leta € F? o be such that v(a) = & for all v real not in Q and the class of 
aR is trivial in Clg(o) R: these constraints together impose congruence conditions on 
elements in a real cone. By the Hasse-Schilling norm theorem, there exists a € B” 
such that nrd(q@) = a. Thus the class of nrd(@O) in Cleo) R = Cleo) R is trivial. 

But then by (28.6.9) (a consequence of strong approximation), there exists 6 € BX, 
such that BO = œO, and therefore 6 = ay with y € O*. Since £ is totally positive, for 
all real places v ¢ Q we have sgn(v(nrd(y))) = sgn(v(nrd(q@))) = &, completing the 
proof. 

For the second statement, we only need to note that Cl* Z = C1Z = {1} and recall 
that nrd(O”) < {+1}. o 
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Remark 28.6.11. More generally, let B be a central simple algebra over the global field 
F. We say B satisfies the S-Eichler condition if there exists a place v € S such that 
B, is not a division algebra. (In this text, for quaternion algebras we prefer to use the 
term S-indefinite because it readily conveys the notion, but both terms are common.) 
If F is a number field and S is the set of archimedean places of F, then B satisfies the 
S-Eichler condition if and only if B is not a totally definite quaternion algebra (Exercise 
28.5). So the condition is a mild condition, and the quaternion algebra case requires 
special effort. 

When B satisfies the S-Eichler condition, then B! —> B! is dense, and for O C Ba 
maximal R-order, a locally principal right fractional O-ideal J C B is principal if and 
only if its reduced norm nrd(/) C R is trivial in Clo R, where Q is the set of real places 
v € PIF such that B, ~ M,,(H), generalizing the quaternion case. 

Eichler proved Theorem 28.5.5 and the more general statement of the previous 
paragraph [Eic37, Satz 2], also providing several reformulations and applications 
[Eic38a, Eic38c]. For a full exposition, see Reiner [Rei2003, $34]. 


28.7 First proof 


Now we proceed with the proof of strong approximation in Theorem 28.5.3; we follow 
roughly the same lines as in the proof of Eichler’s theorem on norms, but here instead 
we will be concerned with traces. In other words, we replace strong approximation of 
elements by strong approximation of traces, and then we just have to chase conjugacy 
classes. 

We start with a statement of weak and strong approximation for the global field F. 


Lemma 28.7.1 (Weak approximation for F). Let S C PI F be a finite nonempty set of 
places. Then the images of the maps 


Fo Fsi=| |F and P> r=] |F 


veS veS 


are dense. 
Proof. See e.g. Neukirch [Neu99, Theorem II.3.4] or O’ Meara [O’Me73, 811E]. o 


Lemma 28.7.2 (Strong approximation for F). Let S C PIF be a finite nonempty set of 
places. Then the image of F —> Fs := J |] ygs Fy is dense. 


Proof. See e.g. Neukirch [Neu99, Exercise II.1.1] or O’ Meara [O’Me73, §33G]. o 
We recall that B is a quaternion algebra over F and we write Bs := [], <5 By. 


Proposition 28.7.3 (Weak approximation for B). Let S C PIF be a finite nonempty 
set of places. Then the images 


B= Bs and B* = Bx and B!= B! 


are dense. 
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Proof. By weak approximation for F (Lemma 28.7.1), we have F dense in [Į [yes Fy. 
Choosing an F-basis for B, we have B ~ F^ as topological F-vector spaces, and so 
by approximating in each coordinate, we conclude that B is dense in [[ņ es By. The 
multiplicative case follows a fortiori, restricting open neighborhoods. 

Finally we treat B!. By Exercise 7.31, we know that B! = [B*, B*] is the commu- 
tator. Let (yy) € [yes B}. Then for each v € S, we can write yy = @ypya; l By 1 with 
ay, By € B*. Then by weak approximation for B*, we can find a sequence a, € B* 
such that a, —> (@y) € BX, and similarly with 6, — (,),. Then since multiplication 
is continuous, we conclude that yn = @nBna;,'B,! > (Yv) € B!. o 


Next, we need to approximate polynomials: this kind of lemma was first performed 
in section 14.7 to prove the Hasse—Schilling theorem of norms, and here we need 
another variant. 


Lemma 28.7.4. Let S C PIF be a finite nonempty set of places and X C PIF a finite 
set of noncomplex places disjoint from S. Lett € B be such that x? — tyx + 1 € Fix] 
is irreducible for v € X. 

Let € > 0. Then there exists t € F such that: 


e f(x) = x? -tx + 1 is irreducible and separable over F; 
e |t- ly < €forallv € X; 

e f(x) is irreducible over F, for all v € X; and 

°|t-ah < lforallv € SUX. 


Proof. We argue as in Lemma 14.7.6 (and Corollary 14.7.8), but instead of weak 
approximation we now use strong approximation (Lemma 28.7.2). Our job is a bit 
easier because we are only asking for irreducibility, not separability. 

Since S is nonempty, by strong approximation for F we can find t arbitrarily close 
to T, thus ensuring that the desired inequalities hold and that f(x) is irreducible over 
F, for v € X; to ensure that f(x) is separable, we need only avoid the locus t? = 4, and 
similarly we may ensure f(x) is irreducible. o 


We now embark on the proof of strong approximation. 


Proof of Main Theorem 28.5.3. We follow Vignéras [Vig80a, Théorème III.4.3]; see 
also Miyake [Miy2006, Theorems 5.2.9-5.2.10] for the case F = Q. We show that the 
closure of B! is equal to B'. Let Y=) € B l. we will find a sequence of elements 
of B! converging to 7. 


Step 1: Setup. We claim it is enough to consider the case where y, = 1 for all but 
finitely many v, by a Cantor-style diagonalization argument. Indeed, for a finite set 
T C PIF disjoint from S, we let Yir be the idele obtained from ¥ replacing yy = 1 
for v ¢ T. Then for a sequence of subsets T eventually containing each place v, we 
have Fir, — 7. Thus, if we can find sequences in B! converging to each Fir) we can 
diagonalize to find a sequence converging to ¥, since Pl F is countable. 

So we may suppose without loss of generality that y, = 1 for all but finitely many 
v. To find a sequence in B! converging to F, our strategy in the proof is as follows: 
in shrinking open neighborhoods of ¥ we first find an element in B! whose reduced 
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characteristic polynomial is close to an element in the open neighborhood, and then 
we conjugate by B* to get the limits themselves to match. 

To this end, let O C B be a reference R-order, let T C PIF be a finite set of places 
disjoint from S containing the primes v where y, + 1 and the ramified places of B not 
in S. We consider open neighborhoods of the form 


u=| |wuwx I] o! 


veT v¢SUT 
where U, C B! is an open neighborhood of 1. 


Step 2: Polynomial approximation. Now comes the polynomial approximation step: 
we will show that there exists @ € B* such that B! N @!U@ + 0. (This is about as 
good as could be expected at this stage: if we argue by approximating polynomials, we 
should only be able to expect to get something up to conjugation.) 

We define the idele fin each component, as follows. 


- Ifv ¢ SUT, we take t, = trd(y,) = 2. 

¢ Ifv € Tis unramified in B, we take t, = trd(y,). 

e Ifv € Tis ramified in B, we choose u, € yU, such that u, ¢ F, is separable 
over F,, and take t, = trd(y,,). Since B, is a division algebra, we have F,[ u] is 
a field, and so its reduced characteristic polynomial is irreducible. 

+ Ifv € Sis ramified in B, we choose t, € F, such that x? — t,x + 1 is irreducible; 
such an element t, exists by Lemma 14.7.5). 


The only places that remain are those v € S such that v is unramified in B. By 
hypothesis that B is S-indefinite, we know that there is at least one such split place 
Vsp] E€ S remaining (and in particular, vsp é Ram B U T). 

We now apply our polynomial approximation Lemma 28.7.4, we conclude that 
there exists t € F such that f(x) = x? — tx + 1 is separable and irreducible over F, 
and irreducible over F, for all v € Ram B, and such that ¢ is arbitrarily close to T. Let 
K = F[x]/(f(x)). Then either Ram B = Ø and K —> B automatically, or Ram B + Ø 
so f(x) is irreducible and then K — B by the local-global principle for embeddings 
(Proposition 14.6.7). Let B € B! have minimal polynomial f(x). Since T € trd(U) and 
the reduced trace is an open (linear) map, with a closer approximation we may suppose 
trd(B) € trd(U), and therefore there exists y’ € U such that trd(8) = trd(y’) so that 
B, 7 have the same irreducible minimal polynomial. By the Skolem—Noether theorem 
(Corollary 7.7.3), there exists @ € B™ such that 


B=a'ya. (28.7.5) 

Step 3: Finding a sequence. We then repeat the above argument with a sequence of 
open neighborhoods U,, > @ such that (),, Un = {7}; we obtain a sequence 

B! > Bn = @,'¥,Gn € Q, Unn. (28.7.6) 


Since y/, € Un, we have y;, > 7, and in particular for v € PIFNS U T, we have 
Yny >W =l. 
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Step 4: Harmonizing the sequence. By ‘harmonizing’ the conjugating elements @,,, we 
will realize a sequence in B! tending to 7 as desired, in two (subset)steps. First, by 
Main Theorem 27.6.14, BY/BX is compact. So restricting to a subsequence, we have 

= = Snn with un € B* and Sn > 6 =(6))) € BX. Second, by weak approximation 
B (Proposition 28.7.3), B% is dense in [],,<7 BX, so there is a sequence v„ from B* 
such that v, — (6) yer. 


Step 5: Conclusion. To conclude, we consider the sequence 


(nbn )BuYnben) | = nbn YIn(On¥a'): (28.7.7) 
We claim that this sequence tends to 7. For v € T, we have vp, my — 6,6,! =1 s0 
On vôn Yn, vln v Yny) > Yv- (28.7.8) 
On the other hand, for v € Pl FN(S U T), we have yp, > 1, so 
On, vô EDA v (Ôn, v Yn, > l=y. (28.7.9) 
Putting together these cases, we conclude that (vn tn) Bn(vn Uny! — Y, and therefore 
7 is in the closure of B!. Oo 


Remark 28.7.10. Strong approximation has a more general formulation, as follows. 
Let G be a semisimple, simply-connected algebraic group over the global field F. Let 
S be an eligible set containing a place v such that G(F,,) is not compact. Then G(F) 
is dense in G(F's), and we say G satisfies strong approximation (relative to S). Over 
number fields, strong approximation was established by Kneser [Kne66a, Kne66b] and 
Platonov [Pla69, Pla69-70], and over function fields by Margulis and Prasad [Pra77]; 
see also Platonov—Rapinchuk [PR94, Theorem 7.12]. For a survey with discussion 
and bibliography, see Rapinchuk [Rap2014] and the description of Kneser’s work by 
Scharlau [Scha2009, §2.1]. 


28.8 Second proof 


Because of its importance, we now give a second proof of strong approximation; this 
has the same essential elements, but uses some facts from group theory to simplify away 
the final steps of the previous proof. We follow Swan [Swa80, §14], who references 
Kneser [Kne66a, Kne66b], Platonov [Pla69-70], and Prasad [Pra77]; see also the 
exposition by Kleinert [K1t2000, §4.2]. 

Let Z := cl(B') < B! be the closure of B! in B!. We embed B} œ B! by 
(...,1,@,, 1,...) in the vth component, for v ¢ S. 


Lemma 28.8.1. If B! C Z for almost all v ¢ S, then Z = B!. 


Proof. Suppose that B! c Z for all v ¢ T where T is a finite set. Let y € B!. If 
yy = | for all v € T, then ¥ is a limit of elements in Z (approximating at a finite level), 
so y € Z. Otherwise, by weak approximation for B (Proposition 28.7.3), there exists 
y € B! such that y is near y, for all v € T. Let B have By = 1 for v € Tand B, = yty, 
for v ¢ T; then £ € Z, and 7 is the limit of the y£. o 
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Now we consider 
Zi := {7 €Z: = 1 for all but finitely many v}. (28.8.2) 
Lemma 28.8.3. Z; < B! is a normal subgroup. 


Proof. Let ¥ € B! and let @ € Z; with a, = 1 for v ¢ T with Ta finite set. By weak 
approximation for B (Proposition 28.7.3), there exists y € B! with y close to y, for 
all v € T. Therefore y"'@y is near y-'@y for v € T and y"!ayy = y,!ayyy = 1 for 
v ¢ T, so is the limit of such in Z. Therefore y-'@y € Z, thus y-'Z,;y C Z, and 
Z, a B!. Oo 


Lemma 28.8.4. Let F be an infinite field. Then PSL2(F) is a simple group. 


Proof. See e.g. Grove [Grov2002, Theorem 1.13]. Briefly, the result can be proven 
using Iwasawa’s criterion, since SL2(F) acts doubly transitively on the linear subspaces 
of F?: the kernel of the action is the center {+1}, and the stabilizer subgroup of a standard 
basis element is the subgroup of upper triangular matrices, whose conjugates generate 
SLo(F). o 


Proof of Main Theorem 28.5.3 (Strong approximation). We show that Z = B!. By 
Lemma 28.8.1, it is enough to show that B} c Z for almost all v. By Lemma 28.8.3, 
each ZıNB} < B} is anormal subgroup; by Lemma 28.8.4, either this normal subgroup 
is either scalar, or we have Z; N B} = B} < Z, < Z and we are done. So it suffices to 
show that for almost all v, we have Z, N B! nonscalar, which is to say, Z4 N B! + {1}. 

So let w € PI F be unramified. We perform polynomial approximation, as in Step 2 
of the first proof. For v € Ram B, let x? — t,x + 1 be a separable irreducible polynomial 
(which exists by Lemma 14.7.5) with 4, € R,, and do the same for w; let T be the 
corresponding idele, with t, = 1 for the remaining places v. By Lemma 28.7.4, there 
exists t € F such that f(x) = x? —tx + Lis: 


irreducible and separable over F, 
e irreducible over F, for all v € Ram B, and 
+ such that ¢ arbitrarily well-approximates f € R, so we may suppose t € R. 


By the local-global principle for embeddings (Proposition 14.6.7), there exists 6 € B! 
with f(8) = 0; but moreover, £ is integral and so belongs to a maximal order. In this 
way, we manufacture a sequence Bn with trd(B,) — T. Repeating this with 4, > 1 for 
v € Ram B and diagonalizing, we may suppose t, = 1 for all v + w. 

Let O be a maximal R-order. The type set Typ O is finite by Corollary 27.6.25; 
choose representatives O; for TypO. After conjugating the elements £8,, we may 
suppose without loss of generality that each 6, belongs to one of the orders O;. By 
the pigeonhole principle, there is an order containing infinitely many, so restricting to 
a subsequence we may suppose Bn, € O! for all n. 

But now the kicker: O! is compact, so we may restrict to a convergent subse- 
quence fn > B e Ol. By construction, each 8„ has separable reduced characteristic 
polynomial, and trd(6,,,) —> 1 for all v + w, so Bay > y = 1 for all v + w. But 
trd(Bnw) > tw, and x? — ty.x + 1 is irreducible, so Bẹ ¢ FX, as desired. Oo 
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28.9 =» Normalizer groups 


In this section, we apply strong approximation to the normalizer of an order, and we 
compare the normalizers for locally isomorphic orders. We recall the notation from 
section 18.5. Let Idl(O) be the group of invertible two-sided fractional O-ideals, let 
PIdl(O) < Idl(O) the subgroup of principal fractional O-ideals, and let PIdl(R) < 
PIdl(O) be the image of the group of principal fractional R-ideals. 

We suppose throughout this section that PIdl(O) < Idl(O) is a normal subgroup. 
This is true whenever Idl O is abelian, which holds when O is Eichler order (using 
Lemma 23.3.13 for the primes where p is maximal and Proposition 23.4.14 the remain- 
ing primes). 


28.9.1. Recall there is a natural exact sequence 
1 > Npgx(O)/(F*O%) > Picr(O) > Idl(O)/ PIdl(O) > 1 (18.5.5) 


obtained by considering the class of a bimodule as a two-sided ideal modulo principal 
ideals. The idelic dictionary (27.6.26) gives another proof of exactness: there is a 
canonical bijection 

Idl(O) > Nz.(O)/O*, 


to obtain Idl(O)/ PIdl(O) we take the quotient by N(O) and to obtain Idl(O)/ PIdl(R) 
we take the quotient by F*; therefore the exact sequence (18.5.5) can be rewritten 


1 > F*\Ngx(O)/O*% > F*\Nęx(0)/0* > Ngx(O)\Ng.(O)/O% > 1 (28.9.2) 


and its exactness is now visible. 


28.9.3. We have a map of pointed sets 
Idl(O) — Cls O 
In] 


and PIdl(O) is the kernel of this map, the preimage of the trivial class in Cls O. The 
composition of this map with the reduced norm gives a group homomorphism: 


c: Idl(O) > Cleeo)(R) = FX 


0\EY nrd(O) 
I |> [nrd(J)] 


Lemma 28.9.4. Suppose that B is S-indefinite. Then PIdI(O) = kerc, i.e., 
PIdl(O) = {7 € Idl(O) : [nrd(7)] is trivial in Clgoy(R)}. 
Proof. By strong approximation (Theorem 28.5.5), the reduced norm gives a bijection 


nrd: Cls O — Clgo)(k); thus J € Idl(O) is principal, belonging to PIdI(O), if and 
only if [nrd(/)] is trivial in Cle(o)(R). oO 


Now let O, O’ be locally isomorphic orders (in the same genus) with connecting 
O, O’-ideal J. 
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28.9.5. By 18.4.7, there is an isomorphism of groups 
Idl(O) > Idl(O’) 

Š (28.9.6) 

Ie J IJ. 


which induces an isomorphism Picr(O) = Picr(O’). 
We now come to the first major result of this section. 


Proposition 28.9.7. Suppose that B is S-indefinite. Then the map (28.9.6) induces a 
commutative diagram 


1 ——+> Npgx(O)/(F*O*) ——> Picr(O) ——> Idl(O)/ PIdl(O) ——~+ 1 


| l | 
1 —> Ngx(0')/(F*0'*) —> Picr(O’) —> Idl(O’)/ PIdl(O’) —> 1 


with vertical maps isomorphisms. 


Proof. We verify that J7! PIdl(O)J = PIdI(O’), from which both statements follow; 
and this verification comes from Lemma 28.9.4, as 


[nrd(J~'J)] = [nrd()] € Cleo) R = Claon R 
(recall 27.7.4), so I € PIdl(O) if and only if J7!J € PIdI(O’). (m 


Proposition 28.9.7 says that when B is S-indefinite, then the structure of the nor- 
malizer group, the Picard group, and group of ideals modulo principal ideals are all 
isomorphic for all orders in a genus. The same is not true when B is S-definite; we 
always have an isomorphism in the middle, but for locally isomorphic orders, the Picard 
group may be distributed differently between the normalizer and the ideal group. 

By chasing a few diagrams, we can be more specific about the structure of Id1(O) 
by seeking out primitive ideals. 


Definition 28.9.8. The Atkin—Lehner group of O is 
AL(O) := {J € Idl(O) : [nrd(J)] € (Clee) R)’}/1dl(R). (28.9.9) 


The definition (28.9.9) makes sense because Idl(R) is indeed a subgroup: since if 
a € Idl(R) then [nrd(aO)] = [a]? € (Cleo) RY’. 


Example 28.9.10. Suppose that O is an Eichler order with discrdO = St. Then 
(23.4.21) gives an isomorphism 


Idl(O)/ IdI(R) = I] Z/2Z. 
pin 


The group AL(O) is therefore an abelian 2-group, isomorphic to [] pj; Z/2Z when 
(Cleo) RÝ is trivial. For example, for O = M2(R) c B = M2(F), we have AL(O) the 
trivial group. 
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28.9.11. There is a group homomorphism 
Idl(R) > Idl(O) 


(28.9.12) 
ar a0; 
this map is injective, since aO = O implies a = R. We obtain an exact sequence 
1 > CI R > Picr(O) > Idl(O)/ Idl(R) — 1 (28.9.13) 


(compare to (18.5.5)). From Lemma 28.9.4 and the fact that PIdl(R) © ker(c), we 
obtain an exact sequence 


1 > Npgx(O)/(F*O*) > Picr(O) = Cleo) (R). (28.9.14) 


From (28.9.14), we see that c(Idl(R)/ PIdI(R)) = (Cleo) R)*; further, we have c(Cl R) = 
(Cleo) RY with 


ker clair = (CIR)[2}o := img(Clg@(R)[2] > Cl(R)) < (CIR)[2]. (28.9.15) 


We write (Cl R)[2];o0 because this is the subgroup of ideal classes that lift to the group 
Clgo)(R)[2] (having order dividing 2). Therefore the following diagram commutes, 
with exact rows and columns: 


1 1 1 
Y 
1 > (CIR)[2};0 > Ngx(O)/(F*XO*) AL(O) 
Y 
1 CIR Picr(O) Idl(O)/Idl(R) ——>1 (28.9.16) 
Y 4 Y 


1 + (Cleo) R? —> Cleo (R) —> Clewo(R)/(Cleo R? > 1 


The second main result of this section is the following. 


Proposition 28.9.17. Suppose B is S-indefinite. Then there is a (non-canonically) split 
exact sequence 


1 > (CI R)[2]ro > Ngx(O)/(F*O*) => AL(O) > 1. (28.9.18) 


Proof. The snake lemma implies that the top row of (28.9.16) is exact; the sequence 
is split by choosing for each class in AL(O) a generator of a representative ideal. 
Here is second, self-contained proof which captures the above discussion. Say that 
a two-sided (integral) O-ideal 7 is R-primitive if Z is not divisible by any integral ideal 
of the form aO witha Ç R. (If I is integral but not R-primitive, with J C aO and a as 
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small as possible, then a~'J c O is integral and now R-primitive.) Let a € Ngx(O). 
Then OaO = I € Idl(O), so we can factor J = cJ uniquely with c a fractional ideal of 
Rand J an R-primitive ideal. We have 


aR = nrd(a)R = nrd(J) = c? nrd(J) = ĉa (28.9.19) 
and so 
1 = [@)] = [c] [a] € Cleo (R) 
and in particular [nrd(J)] € (Cleo) R)*. Therefore there is a map 
Npx(O) > AL(O). (28.9.20) 


This map is surjective by strong approximation (see Lemma 28.9.4): if J € Idl(O) 
has nrd(J) = a and [a] = [c1]? € (Cleo) Ry’, then [nrd(cJ)] = 1 € Cle(o)(R) so 
by Theorem 28.5.5, there exists a € B* such that O@O = cJ and since cJ € Idl(O) 
we have a € Ngx(O). The map is split by this construction, with a choice of J 
up to Idl R. The kernel of the map in (28.9.20) consists of œ € Ngx(O) such that 
OaO = cO with c € IdI(R), and from the preceding paragraph [c?] = 1 € Clecoy(R) 
so [c] € Clg(o)(R)[2]; however, the kernel also contains F*O*, whence the class of 
c is well-defined only in Cl(R). Therefore the kernel is canonically identified with 
(CI R)[2]to. o 


Corollary 28.9.21. Suppose that O is an Eichler order with discrd O = Ò% and that B 
is S-indefinite. Then 


Npgx(O)/(F*O*) = AL(O) x (C1 R)[2lho 


is an abelian 2-group with rank at most W(X) + h2(R), where w(N) is the number of 
prime divisors of N and h2(R) = dimp, (C1 R)[2]. 


Proof. Combine Proposition 28.9.17 and Example 28.9.10. o 
Corollary 28.9.22. We have 
Narar)\(M2(R))/(F* GL2(R)) = (C1 R)[2]. 


Proof. Apply Corollary 28.9.21 with Clg(o)(R) = CI(R) so (C1 R)[2]ro = (C1 R)[2] 
and AL(O) the trivial group by Example 28.9.10. m 


Remark 28.9.23. Corollary 28.9.21 corrects Vignéras [Vig80a, Exercise II.5.4] to 
account for possible class group factors. 
28.10 =» Stable class group 


We conclude this epic chapter with a final result on the stable class group, restoring 
some generality, we announced a special case of this theorem as Theorem 20.7.16. 


Theorem 28.10.1 (Fröhlich-Swan). Let R be a global ring with F = Frac R, let B be a 
central simple F-algebra, and let O C B be an R-order. Let Q C PIF be the set of real 
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places v of F such that By, ~ Mm (H) for some m > 1. Then the reduced norm induces 
an isomorphism 


nrd: StClO © FX ,\F*/nrd(O*) (28.10.2) 


Q 
of finite abelian groups. In particular, if B is a quaternion algebra and O is locally 
norm-maximal order, then 


StClO =~ Cla R 
where Q C Ram B is the set of real ramified places in B. 


Proof. We give only a sketch of the proof. For further details, see the references given 
for the proof of Theorem 20.7.16. 

We first show that the map (28.10.2) is well-defined. Suppose that /@O” = I’ O” 
with r > 0. If r = 0, we are done; so suppose r > 1. Extending scalars, we find 
an isomorphism ¢: B’*! > B’*! of left B-modules, represented by an element y € 
GL,+1(B) acting on the left. In a similar way, associated to J p O” we obtain a class 


GL-+1(B)@ GLy+1(O) € GL;+1(B)\ GLy+1(B)/ GLy+1(O) 


by choosing in each completion an isomorphism with ort! represented by a matrix, 
well-defined up to a change of basis on the right (and on the left by a global isomor- 
phism). Now by strong approximation in the advanced version announced in Remark 
28.7.10, the reduced norm induces a bijection 


nrd: GL,+1(B)\ GL,+1(B)/ GL;41(O) > FX o\F%/ nra(0%) 
after we check that the codomain is indeed the image of the reduced norm. Then 
nrd(a@’) = nrd(y@) = nrd(y) nrd(@) = nrd(@) € FZ APY nrd(O*) 


so the map is well-defined. 

Similarly, the map (28.10.2) is a group homomorphism: an isomorphism J @ I’ = 
J ® O gives nrd(yf) = nrd(8) = nrd(@@’). The map is surjective. If [I]są is in the 
kernel and nrd(/) is trivial, then so too is nrd(@) where @ corresponds to I 6 O; by 
strong approximation, this means that there exists 6 € GL2(B) and f € GL2(O) such 
that €; = By and so via B we have I 6 O = O®, which means [J]s, = [O]. Oo 


Exercises 


Unless otherwise specified, let R be a global ring with eligible set S and F = Frac R, 
and let B be a quaternion algebra over F, and let O C B be an R-order. 


1. Give another proof weak approximation for Q, as follows. 


(a) Let SC PIQ be a finite, nonempty set of places. Show that, for each v € S, 
there exists yy € Q% such that |y,|, < 1 and |yyly > 1 for all v’ € S\{v}. 
[Hint: let yoo := [pes 1/p and to get yp, multiply yoo by a power of p.] 

(b) For the elements y, constructed in (a), show that for all v € S we have 
1/d+y?) — 1 € Q, and 1/0. + y7) > 0 € Qv for v’ + v. [How does this 
relate to the proof of the CRT?] 
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(c) Prove weak approximation. /Hint: given x, E€ Q, for each v € S, show that 


Xy 
nie Gow eQ, 
1l+yJ 


veS 


asn — oo, ] 


. Show that for all N > 1, the group SL2(Z/NZ) is generated by two elements of 


order N. 


. Let F bea field and let n € Zs. Show that the elementary matrices (which differ 


from the identity matrix in exactly one off-diagonal place) generate SL, (F) 
as a group. [Hint: Argue by induction, and reduce a matrix to the identity by 
elementary row and column operations. ] 


. Letn > 2. 


(a) Let R be a Euclidean domain. Show that the elementary matrices generate 
SL, (R). [Hint: In view of Lemma 28.3.3, argue by induction. ] 


(b) Using (a), show that Proposition 28.3.5 and Corollary 28.3.6 hold for SL,,. 


. Let B be a central simple algebra over the global field F. We say B satisfies the 


S-Eichler condition if there exists a place v € S such that B, is not a division 
algebra. Show that if F is a number field and S is the set of archimedean places 
of F, then B satisfies the S-Eichler condition if and only if B is not a totally 
definite quaternion algebra. 


. Inthis exercise, we provide details in the proof of Proposition 28.3.5, considering 


elementary matrices of rings that are not necessarily domains. 

Let A be a commutative ring (with 1), not necessarily a domain. We say A is 
Euclidean if there exists a function N: A — Zso such that for all a,b € A with 
b + 0, there exists q,r € A such that a = qb +r and N(r) < N(b). 

In the first parts, we suppose A is Euclidean with respect to N and show that the 
situation is quite analogous to the case when A is a domain. 


(a) Show that for all b € A we have N(b) > N(O) with equality if and only if 
b=0. 

(b) Let m = min{N(b) : b + 0}. Show that if N(b) = m then b € A”. 

(c) Show that every ideal of A is principal. [We call A a principal ideal ring. ] 


We now consider examples. 


(d) Let A be an Artinian local principal ideal ring with a unique maximal ideal 
m = 7A. (For example, we may take A = R/p* where R is a Dedekind 
domain, p is a nonzero prime ideal, and e € Z 9.) Show that for all x € A 
with x # 0, there exists u € A* and a unique n € Zso such that x = un”. 
Conclude that A is Euclidean with N(x) = n. 

(e) If A ~ []}_, A; with each A; an Artinian local principal ideal ring, show 
that A is Euclidean under N(x) = $r; Ni(ai(x)) where N; is as given in 
(a) for A; and z;: A — A; is the projection. 


We conclude with the application. 
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(f) Let A be a Euclidean ring. Show that SL2(A) is generated by elementary 
matrices. [Hint: Show that the proof in Lemma 28.3.3 carries over. ] 


7. Consider the quaternion algebra B := (—11,-17| Q) and let O = Z@Zi@Zje@ 
Zij. Then B is definite, so strong approximation does not apply. Indeed, show 
that B! ¢ BXO% as follows. 


(a) Find a, b,c,d,m € Z with 3 { m such that 
a +11b* =c? + 17d? = 3m. 
(b) Now let @ = (@p)p € BX be such that 


+ bi)(c — dj 
ae jee a L 
3m 
and a, = 1 if p + 3. Show that @ € B! and @ ¢ BXO*. [Hint: Observe 
that nrd |o only represents 9 by +3.] 


(c) Prove that the right O-ideals J; := 30 + (a + bi)O and h := 30+(c+di)O 
are not principal. How does this relate to (b)? 


8. Let F be a number field with ring of integers R. Show that there is a finite set S 
of (rational) primes such that every totally positive element of R can be written 
as a sum of four squares of elements of F whose denominator is a product of 
primes in S. (We may not be able to write every such element as sum of four 
squares from R, but we only need denominators in S.) 

9. Suppose that B is S-indefinite. Suppose O is locally norm-maximal. Give a 
direct proof using strong approximation that if O C B is an R-order and 7 is an 
invertible right fractional O-ideal, then / is principal if and only if [nrd(/)] is 
trivial in Clo R. [Hint: If a € B* satisfies nrd(a)R = nrd(J), consider aL] 

> 10. Show that BXO* n B! = B'O! if and only if nrd(O*) = FX o N nrd(O%) (see 
28.4.7). 

11. Suppose that B is S-indefinite. Show that # Typ O is a power of 2. 

12. Suppose disc B = Ð and O is an Eichler order of level Wt and reduced discrim- 
inant N = DM. Show that Clea oy R is the quotient of (C1 R)/(Cl R)* by the 
subgroup generated by the classes of ideals p | D together with q | St such that 
ord, (9) is odd. 


Conclude that for B = M2(F)and O = p a , the type set Typ O is represented 


by the isomorphism classes of the orders a 1 4 for [b] € Clenoy R. 
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Chapter 29 


Idelic zeta functions 


In this chapter, we present an idelic formulation of zeta functions associated to central 
simple algebras over global fields and we prove that they have analytic continuation 
and functional equation. 


To read beyond the first two introductory sections, the additional prerequisite of 
real analysis at the level of measure theory is recommended; happily, the statements of 
the essential conclusions (for example, the functional equation of the zeta function of 
a quaternion algebra) can be understood without this background. 


29.1 > Poisson summation and the Riemann zeta function 


We begin with some essential motivation. Recall that the Riemann zeta function 
f(s) := De n defined in (25.2.2), Following Riemann, we complete to the function 
&(s) := f(s) Ip(s), where 
p(s) := n T(s/2) (29.1.1) 
and I°(s) is the complex I -function (Exercise 26.2). Then &(s) extends to a meromorphic 
function on C and satisfies the functional equation 


EC - s) = (s). (29.1.2) 


(More generally, we saw in 26.8.2 that the Dedekind zeta function of a number field 
can be completed in an analogous manner, again with functional equation.) 


In this introductory section, we sketch a proof of the functional equation and see 
it as a consequence of Poisson summation (arising naturally in Fourier analysis, still 
following Riemann). Then, following Tate we reinterpret this extra factor in a manner 
that realizes the zeta function as a zeta integral on an adelic space, giving a uniform 
description and making the whole setup more suitable for analysis. 


To begin, we the function &(s) itself as an integral. Looking at one term in &(s), we 
have 


fS pdx =a o du 

nPI = rn | etx? = | eT My s/2 (29.1.3) 
0 x 0 u 
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Accordingly, we define the theta function 


o0 


@(u) := F: ehu iy D (29.1.4) 
n=1 


n=—0 


studied by Jacobi, convergent absolutely to a holomorphic function on the right half- 
plane Re u > 0. Summing over n > 1 gives 


ir 5/2. du 
&(s)=5 T (O(u) — 1)u?!* — (29.1.5) 
2 Jo u 


valid for Res > 1, where the integral converges (so we may justify interchanging 
summation and integration). We will soon rewrite this integral so as to extend its 
definition to all s € C apart from s = 0, 1. 


Let f: R > C be a Schwartz function, so f is infinitely differentiable and every 
derivative decays rapidly (for all m,n > 0 we have |f™(x)| = O(x7™”) as |x| = œ). 
We define the Fourier transform f“ : R > C of f by 


fO) = if i e? f(x)dx. (29.1.6) 


o0 


Theorem 29.1.7 (Poisson summation). We have 


y f(m) = 5 f(n), (29.1.8) 


m=— 0 n=—co 


with both sums converging absolutely. 


Proof. The condition that f is Schwartz ensures good analytic behavior allowing 
the interchange of sum and integral, the details of which we elide. Let g(x) := 
oo f(x +m). Then g is periodic with period 1, so by Fourier expansion we 


m= 


have g(x) = 1. ane?7!"* where 


n=—oco 


1 1 œ% 
anaE i g(x)e27"™* dx = f > f(x +m)?” ™* dx 
0 On peas 


pa 1 fo) 
= X J f(x+ mye 2tin(x+m) dx = J fxe? deo F). 
m=—oo 0 m9. 


Thus s z 
DO OED fm). o 


n=—oco 


Now take f(x) = e* Then f is Schwartz. By contour integration and using 
Te er dx = 1, we conclude that f’(y) = f(y) for all y. For u > 0, let f(x) := 
f(ux); then fY) = u`! fiju(y) by change of variable. Applying Poisson summation 
to fy@(x) then gives 
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1 
O(u) = —O(1/u). 29.1.9 
(u) NF (1/u) ( ) 
The equation (29.1.9) implies the functional equation (29.1.2) for (s) as follows: 
we split up the integral (29.1.5) as 


eosi Power st [ew - pw? 


and apply the change of variable u < 1/u to obtain 


! co 
d d 2 pd 
f OwP = ie Ou - _=_ 4 I (@(u) — Iu 
0 u 1 u s—1l 1 u 


Putting these together, we have 


1 [7 a2 d 1 1 
&(s) = sf (@(u) — 1U? + uE _ a (29.1.10) 
2 Ji u s l-s 
which is sensible as a meromorphic function for all s € C, holomorphic except for 
s = 0, 1, with the right-hand side visibly invariant under s — 1 — s. This establishes 
(29. 1:2). 


This method extends to prove the functional equation for the L-series L(s, 7) where 
xX is a Dirichlet character (now involving a Gauss sum); Hecke extended this method 
(generalizing the appropriate theta functions) to prove the functional equation for a 
wider class of functions, including the Dedekind zeta functions. 


29.2 »Idelic zeta functions, after Tate 


Now convinced of the utility of integral representations, we seek to put the finite places 
on an equal footing. In this way, the inclusion Z C R in the Fourier analysis above is 
replaced by Q C Q. Recall that ¢(s) = [Ip ¢p(s) where 


Q) = Dip = (psy: 
e=0 


we recover these factors from an integral. First we need a measure to integrate against. 
We have Zp = lim | Z/p"Z as a projective limit with compatible projection maps 
: Zp > Z/p"Z. We define the measure on Zp as the projective limit of the 
o measures on each Z/p”Z with total measure 1, i.e., for a set E C Zp we 
define #n,(E) 

Tn 

E) := lim ——— 

Mp(E) ouy n 

when this limit exists. The measure extends additively to Q, = Zp[1/p] and is invariant 
under additive translation 


pla + E) = up(E) forall a € Zp; 
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accordingly, 4p is the standard Haar measure on Q, (see 29.3). We have normalized 
the measure so that 


Hp(Zp) = I dup (x) =1. 


p 


Since Z% = ZpNpZp and Zņ = yar: + pZp) we have p(pZ,,) = 1/p and 


Up(Z%)=1- 1/p. (29.2.1) 
Similarly, we have a standard Haar measure u% on Q% by 
iy 
du (x) ‘= (! _ z) Hp(x)_ 
P IXIp 


the measure is invariant under x + ax for a € QX as well as under the substitution 
x — x7!, and with our normalization we have 


1 -1 
wea = [1-5] Hp (ZX) = 1. 


For a complex number s € C, on our way to recover ¢,(s) we consider the 
(Lebesgue) integral 
i xl, dys (x). (29.2.2) 
Zp \{0} 
This looks a bit weird at first, because it is not over a subgroup of Q5 or anything. 
Nevertheless, it works! For every nonzero x € ZpN{0}, we may write x = p°xo with 
xo € Zs and e > 0, therefore the integral can be written as a sum over the level sets 


fgg ee 
is 


P 


S —es =S -1 
Les, duX(x) = D7 poe rar T g (29.2.3) 
{0} e=0 

It is more common to rewrite this as an integral over Q% by letting Y, be the charac- 
teristic function of Z, \{0}, so that 


f ll da% (x) = f lp Yp) dy% (x). (29.2.4) 
Zp N{0} Q% 

In asimilar fashion, we define the measure Ho on R by dx/|x|, and to match (29.1.1) 
we define ‘¥..(x) = ere Putting these together, on the idele group Q* we define the 
product measure u* = [], 4%, the function ¥(x) = [], %(x,), and absolute value 
lxil = [][ylav ly (trivial on Q* by the product formula). We have then repackaged the 
zeta function as an adelic integral 


gs) = f PEO: (29.2.5) 


This is really nice! 
Rewritten in this idelic way, Tate [Tate67] in his Ph.D. thesis elegantly proved the 
functional equation for a wide class of zeta functions (and L-functions): 
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[T]he role of Hecke’s complicated theta-formulas for theta functions 
formed over a lattice in the n-dimensional space of classical number the- 
ory can be played by a simple Poisson formula [...], the number theoretic 
analogue of the Riemann—Roch theorem” [Tate67, p. 305-306]. 


In this chapter, we use this method of Poisson summation and idelic integrals to 
prove the basic properties of the zeta function of a central simple algebra over a global 
field (Main Theorem 29.10.1). Translated back into classical language, we prove as 
a consequence the key result (the crux of which is Theorem 26.8.19) announced in 
section 26.8. 


Main Theorem 29.2.6. Let F be a number field with ring of integers R = Zr and let 
B be a quaternion algebra over F with maximal order O. Let 


nt) = ) NUYS 


IcO 


be the sum over nonzero right O-ideals, where N is the absolute (counting) norm 
(16.4.7) and let g(s) be its completion (26.8.13). Then Eg(s) has meromorphic con- 
tinuation to C, holomorphic away from {0, 1/2, 1} with simple poles at s = 0,1, and it 
Satisfies the functional equation 


ép(1 — s) = €p(s). (29.2.7) 
Moreover, if B is a division algebra, then p(s) is holomorphic at s = 1/2. 


Main Theorem 29.2.6 is an analytic result with key arithmetic consequences, 
including the classification of quaternion algebras over global fields, as we saw in 
section 26.8. More than that, the evaluation of the residue will further give rise to a 
volume formula (Main Theorem 39.1.8) that generalizes the Eichler mass formula (as 
in the proof of Proposition 26.5.10). 

The developments in this chapter have a rich history, and they generalize vastly 
beyond this text: see Remark 29.10.24. We restrict ourselves to the case of zeta functions 
both because this suffices for our main applications and because it is a good stepping 
stone to the more general theory. Although this chapter is quite technical, the reader’s 
forbearance will ultimately be rewarded! 


29.3 Measures 


In this section, we define the local measures we will use. As references for this 
section and the next, see Bekka—de la Harpe—Valette [BHV2008, Appendices A and 
B], Deitmar [Dei2005], Deitmar—Echterhoff [DE2009, Chapters 1 and 3, Appendix B], 
Loomis [Lo0053], and Ramakrishnan—Valenza [RM99, Chapters 1—4] as references on 
harmonic analysis, and Vignéras [Vig80a, $11.4] and Weil [Weil74, Chapter XI] for 
the present context. 

Let G be a Hausdorff, locally compact, second countable topological group. For 
example, we may take G = SL2(R) or G = SL,,(R) (or more generally a semisimple 
real Lie group). A Borel measure on G is a countably additive function, with values 
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in [0, co], defined on the o-algebra generated by open sets in G under complement and 
finite or countable unions. 


Definition 29.3.1. A Radon measure on G is a Borel measure that is finite on compact 
sets. A left Haar measure on G is a nonzero Radon measure y that is left translation- 
invariant, so u(gE) = u(E) for all Borel subsets EF € G and all g € G. 


The notion of right Haar measure is defined similarly. 


Remark 29.3.2. Since G is second countable and locally compact, every open subset 
of G is o-compact (the union of a countable collection of compact subspaces), so a 
Radon measure is necessarily regular, i.e., the measure of a Borel set is the infimum 
of the measures of its open supersets and the measure of an open set is the supremum 
of the measures of its compact subsets: see Rudin [Rud87, Theorem 2.18]. 


Proposition 29.3.3. G admits a left Haar measure that is unique up to scaling by an 
element of Ro. 


Proof. See e.g. Deitmar—Echterhoff [DE2009, Theorem 1.3.4] or Diestel—Spalsbury 
[DS2014, §7.2]. o 


We will construct Haar measures explicitly as we need them, so we need not appeal 
to the general result of Proposition 29.3.3 beyond the uniqueness statement which is 
itself straightforward to establish: see Exercise 29.4. 


Example 29.3.4. On G = R” under addition, a left (and right) Haar measure is given 
by the usual Lebesgue measure. 


Example 29.3.5. Suppose G has the discrete topology (all sets are open). Then the 
counting measure u(E) = #E € Zsg U {oo} is a left (and right) Haar measure. 


From now on, let u be a left Haar measure on G. 


29.3.6. In general, a left Haar measure need not also be right translation-invariant. For 
g € G, the measure defined by yg(E) := u(Eg) for a Borel set E is again a left Haar 
measure, so by Proposition 29.3.3, we have ug = Ag(g)u for some Ag(g) € Rso. 
Since u is unique up to scaling, the function Ag(g) does not depend on u. 


Definition 29.3.7. The map Ag: G — Rs» is the modular function of G. We say G 
is unimodular if AG is identically 1. 


By 29.3.6, G is unimodular if and only if every left Haar measure is a right Haar 
measure. 


Lemma 29.3.8. The modular function AG is a homomorphism. 
Proof. Exercise 29.5. oO 


Example 29.3.9. If G is abelian or discrete (see Example 29.3.5), then G is unimodular: 
every left Haar measure is a right Haar measure. (There are many other important 
classes of groups that are unimodular, including discrete (e.g., finite) groups, compact 
groups, and the others we will in what follows.) 
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29.3.10. In this paragraph, we briefly review the theory of integration we need; for 
further details, see Deitmar—Echterhoff [DE2009, Appendix B.1]. Let f: G > C bea 
complex-valued function on G. We say f is measurable if for all Borel sets E C C, the 
subset f—!(E) is measurable; we say f is real(-valued) if f(G) C R and is nonnegative 
if f(G) € Ryo. 

If E is a measurable set then the characteristic function 1g of E (equal to 1 on E 
and 0 outside E) is defined to have integral 


fre) dx) = Me) = f du(x). (29.3.11) 
G E 


A step function is a finite C-linear combination of characteristic functions of measur- 
able sets, and we define the integral of a step function by linearity using (29.3.11). 
If f is measurable and nonnegative, we define 


g anonnegative step function saat 


1 f(x) du(x) := sup n g0) du(x) : that g(x) < f(x) for all x € G 


and say f is integrable if Jo f(x) du(x) < œ. 
If f is real we say f is integrable if both f* := max(f,0) and f~ := — min( f, 0) 
are integrable. If f is real and integral, we then define 


1 VOLTOR: f f E) du(x) - f F dua). (29.3.12) 
G G G 


Finally, a general f is integrable if Re f and Im f are integrable, in which case 
f F(x) dua) = f Re f(x) du(x) +i I Im f(x) du(x). 
G G G 


29.3.13. Let H be another Hausdorff, locally compact, second countable topological 
group, and let ¢: G — H be continuous surjection with kernel N := ker ¢. Then we 
have an exact sequence 


(Wye Sara, 


We say that left Haar measures on G, N, H are compatible if for all integrable functions 
f[f:GoC, 


f F(x) dux) = f (/ fey) ay) du(p(y)). (29.3.14) 
G H\JN 

Given measures on two terms, there exists a unique compatible measure on the third— 
but note, this measure depends on the exact sequence (Exercise 29.7). 

29.4 Modulus and Fourier inversion 


We continue our background review with notation from the previous section; we now 
treat the modulus of an automorphism and present Fourier inversion. 


510 CHAPTER 29. IDELIC ZETA FUNCTIONS 


29.4.1. Let ¢ € Aut(G) be a continuous automorphism. Then the measure defined by 
Hg(E) := u(ġ(E)) is again a left Haar measure. By Proposition 29.3.3, there exists a 
unique ||¢]| € R>o such that 

HM(PE) = |IP|lM(E) (29.4.2) 


for all measurable sets £. In particular, for all integrable functions f on G we have 
I F(x) dua) = I FCG) du(b(x)) = liġi ni FOO) dul). (29.4.3) 


Definition 29.4.4. The modulus of ¢ € Aut(G) is |||] € R>o as defined in (29.4.2). 


The definition of the modulus is independent of the choice of Haar measure p, 
since Haar measure is unique up to scaling and such a scalar cancels in (29.4.2). 


Example 29.4.5. Let g € G; then the conjugation automorphism g(x) = g7!xg has 
llġgl| = Ag(g), because yp is left-invariant. In particular, G is unimodular if and only if 
all conjugation maps have trivial modulus. 


Now let A be a Hausdorff, locally compact, second countable topological ring 
and let u be a left Haar measure on the additive group of A. Since A is abelian, A is 
unimodular and u is also a right Haar measure. 


29.4.6. Let a € A*. Then the left multiplication map Aa : A > A by x & ax is 
a continuous automorphism of A as an additive abelian group, so we may define its 
modulus ||q|| := ||Ag|| by (29.4.2): symbolically, we write 


du(ax) 


llal| = ; (29.4.7) 
du(x) 
We have ||ab|| = |la\|||b|| for all a, b € A* (Exercise 29.6). 
29.4.8. The measure on A* defined by 
d 
d(x) := me (29.4.9) 


is a (multiplicative) Haar measure by 29.4.6. 
We conclude this introductory section with the Fourier inversion formula. 


29.4.10. Let C! := {z € C : |z| = 1} be the circle group. A unitary character 
of A is a continuous homomorphism y: A — C!, considering A as an additive 
group. Let AY := Hom(A, C!) be the unitary character group of A under pointwise 
multiplication, and equip AY with the compact-open topology (as a closed subset of 
the set of all continuous maps A > C!). 


Remark 29.4.11. We reserve the term character for continuous group homomorphisms 
A — K*, where K is a field (of values for the character); this notion makes sense for 
any field K. Some authors call unitary characters just characters, then calling our 
characters instead quasi-characters. 
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Now suppose that there exists y € AY such that the map 
A- AY 
xe (yh wry) 
is an isomorphism of topological groups. (This is a hypothesis on A, and may depend 


on a choice; in the cases we consider below, we will identify a standard such w.) 


Definition 29.4.12. For f : A — C continuous and integrable, the Fourier transform 
of f (relative to y, u) is the function 


fY: ASC 
ji ae I f(y Waxy) duty). 


Theorem 29.4.13 (Fourier inversion). There exists a unique Haar measure t on A 
(depending on y) such that for all f: A — C continuous and integrable with fY 
(defined relative to T) continuous and integrable, we have 


flx) = [ FY Way) drO). (29.4.14) 


Proof. The proof of this theorem is beyond the scope of this textbook, and we use it as 
a black box; see e.g. Deitmar—Echterhoff [DE2009, Theorem 3.5.8] or Folland [Fol95, 
Theorems 4.32-4.33]. o 


The normalized measure t in Theorem 29.4.13 is called the self-dual measure on 
A (with respect to y). 


29.5 Local measures and zeta functions: archimedean case 


Let B be a finite-dimensional simple algebra over the local field F = R. A good general 
reference for the next three sections is Weil [Weil82, Chapter IT]. 


29.5.1. The (additive) Haar measure u on B is the usual (Lebesgue) measure, normal- 
ized as follows: letting n = dimp B, we choose an R-basis e1, . . ., €n for B, so that we 
may write x = >); x;e; E€ B with x; are coordinates on B, and we define 


dx := |d(e,.. n epl!’ dxı +- dXn 


where d is the discriminant defined by (15.2.1) and the reduced trace is taken on B as 
an R-algebra. By Lemma 15.2.5, we see that this measure is independent of the choice 
of basis e;. 

Another application of Lemma 15.2.5 then gives the modulus 


lla|| = INmgir(@)| (29.5.2) 


for all a € B*. 
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Example 29.5.3. We compute: 


dx = dx, for x € R; 
dz = 2dx dy, for z = x + yV-1 € C; 
dæ = 4dt dx dy dz, for æ =t + xi + yj + zk € H; 
da = I] dxij, for æ = (xij); € Ma (R); 
ij 
da = 2" | | dxij dy, for a = (xij + yy V-D j € Mn(C). 
LJ 
And: 
Ix|l = Ixl, for x € R*; 
llzll = lz}, for z € C*; 
jall = nrd(a)’, for a € H*; 
lal] = |det(a)|", for a € GL, (R); 
la|| = |det(a)|?”, for a € GL,(C). 


The modulus for R is the usual absolute value, whereas the modulus for C is the 
square of the absolute value (cf. Remark 12.2.3), explaining conventions on normalized 
absolute values in the product formula 14.4.6. 


29.5.4. As in (29.4.9), the (multiplicative) Haar measure u% on B* is defined by 


du(@) 
lell ` 


du” (a) := (29.5.5) 


29.5.6. We define the standard unitary character 
y: BoC! 
a => exp(—2ri trd(a@)) 


with reduced trace taken on B as an R-algebra. We note that Y(@« 8) = W(Ga) for all 
a, p EB. 


A fundamental result in standard Fourier analysis (generalizing the case B = R, C 
and following from it in the same way) is the following proposition. 


Proposition 29.5.7. The standard unitary character induces an isomorphism B = BY 
of topological groups, and the measure u defined in 29.5.1 is self-dual (with respect 
to y). 


In light of Proposition 29.5.7, we will also write t = u for the measure defined 
above. 
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29.5.8. The exact sequence 


1> HE > E $ Ro > 1 


defines a measure t! on H! by compatibility, taking the normalized measures T* on 
H* and on Ryo < R*. 


Lemma 29.5.9. We have t! (H!) = 47°. 
Proof. Let p > 0 and let 
E = {a € H* : nrd(a) < p*} 


be the punctured ball of radius p. Let f be the function ||a|| on E, zero elsewhere. On 


the one hand, r 
I fla) dr*(a) = f i aE, 
a plela 


Recalling that t is 4 times Lebesgue measure, and that the Lebesgue measure of a 
sphere of radius p has volume 7?pt/2, we get T(E) = 27?p*. On the other hand, by 
compatibility, this integral is equal to 


P 2rd 
f f tenataneey= f f terie = 
Ryo JH! o JH r 


-aimi Lo Ag. wat 1 p* 
= 27 (H) Papa el 5 
0 


We conclude that t!(H!) = 47°. o 


We will need further functions to integrate, so we make the following definition. 
When we consider functions on B, we may think of choosing an R-basis e1, .. ., €n of 
B, identifying B ~ R” and writing x = >); xie; € B; our definitions will be independent 
of this choice of basis. 


Definition 29.5.10. A function f: B — C decays rapidly if for all n > 0 we have 
IFœD| = O((max;lx:|)7”) as max;|x;| > ov. 

A function f: B — C is Schwartz(-Bruhat) if f is infinitely differentiable and 
every partial derivative of f decays rapidly. 


29.5.11. Let *: B — B be the conjugate transpose involution (see 8.4.3) on B. Let 
Q(a) := trd(aa*) for a € B. We compute that 


wD =x, for x € R; 

Oz) = Az? =207? +y), forz =x + yV-1 eC; 

Q(a) = 2nrd(a), for a € H; 

O(a) = » Xs for a= (xij) j € M, (R); and (29.5.12) 


ij 
Qla) = 2 X lzy, for æ = (zj)j € Mx (©). 


ij 
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Definition 29.5.13. The standard function on B is 
~: BoC 
F(a) = exp(-7Q(a)). 


It is straightforward to verify that the standard function is Schwartz. 


(29.5.14) 


Definition 29.5.15. For a Schwartz function ®, we define the (local) zeta function 
Z2(s) = | ®(a)|la||*° dt*(a@) (29.5.16) 
BX 


wherever this integral converges. We abbreviate Z% (s) = Zp(s) for ® = Y the standard 
function. 


As in (26.8.3), we define 


.— „sS/2 
ees Tisa) (29.5.17) 
Lels) := 222) Tis). 


Lemma 29.5.18. We have 
Zg(s) = IrR(s), 
Ze(s) = aT c(s), 
Zyls) = n(2s — 1) p25) p(2s — 1) = 20° R(25)TR(2s + 1), 
Zmp(R)(S) = 70 R(25)Pa(25 — 1), 
Zmyc)(s) = 20° Tc(28) Mc (2s — 1). 


ia d 
Zp(s) = af em &, 
0 


x 


Proof. We have 


making the substitution x < mx? gives the result. A similar argument with polar 
coordinates gives Zc(s). The remaining integrals are pretty fun, so they are left as 
Exercise 29.8. o 


Lemma 29.5.18 explains the provenance of the definitions of IR(s), Mc(s) from 
(26.8.3), and ultimately their appearance in (26.8.14). (A quick check on the constant in 
frontis provided by Zg(1) = 1; these particular choices of constants follow convention.) 
By comparison, Lemma 29.5.18 then shows that in general Z?(s) has meromorphic 
continuation to C. 


29.6 Local measures: commutative nonarchimedean case 


Now let F be a nonarchimedean local field, with valuation v, valuation ring R, and 
maximal ideal p = 7R C R. Let q := #R/p. 

We begin by defining a Haar measure normalized so that R has measure 1; then we 
extend this to find the normalization in which the measure is self-dual with respect to 
a standard unitary character. 
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29.6.1. We have R = lim R/p” with projection maps z,: R —> R/p”. We define the 
(additive) measure u on R by 

#7), (E 

WE) := lim a 


for a subset E C R, when this limit exists. It is straightforward to check that E defines a 
Haar measure on R, and the measure extends to F by additivity. In this normalization, 
we have u(R) = 1, and more generally u(a + p”) = 1/q” fora € R and n = 0. 


Let | | be the preferred absolute value (see 14.4.12), with |z| = 1/q. 


Lemma 29.6.2. For all x e F*, we have 
Ill] = Ixl; (29.6.3) 


in particular, if x € R, then |x| = N(Rx)! where N is the counting norm (16.4.7). 


Proof. By multiplicativity, we may suppose x € R. Then [R : xR] = gq’ = N(Rx). 
Adding up cosets of xR in R, we obtain 


UCR) = [R : xR] (xR) = gq? (xR); 


so ||x|| = u(xR)/u(R) = ge) = |x|, as claimed. o 


Example 29.6.4. Since u(R*) = u(R) — u(p) and 


H) = |lzllu(R) = 1/4, 
we have i 
MCR*) = 1 - 7 (29.6.5) 


29.6.6. We normalize the (multiplicative) Haar measure u% on F* by defining 


1 d(x) ı d(x) 
H(R*) |ixll Ilx] 


du” (x) := =(1- 1/4) 
so that u*(R*) = 1. 
Next, we consider the Fourier transform in this context. 
29.6.7. We first define an additive homomorphism 
Or: F > R/Z 
as follows. 
(a) If F = Qp, we define (x)Q, € Q to be such that 0 < (x)g, < 1 and x - (xQ, € 


Zp- 
(b) If F = F,((¢)) and x = Di; a,t' then we take (x) p := Trp, /F, (4-1)/p. 
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(c) In general, if F 2 Fo is a finite separable extension of fields, then we define 
(x)r i= (TPF / Fy X) Fy. 
Definition 29.6.8. The standard unitary character of F is 
Wr: F> C! 
Wr(x) = exp(2ri(x})r). 
Proposition 29.6.9. The standard unitary character Yp defines an isomorphism 
F > FY 
xe (yh wary) 
of topological groups. 
Proof. Exercise 29.9. oO 
Proposition 29.6.10. The following statements hold. 


(a) If F is a local number field (so a finite extension of Qp for a prime p), then the 


measure 
T := |disc(R)|!/? u = N(dise RY? u 


is self-dual (with respect to y). 
(b) If F is a local function field, then u is self-dual with respect to y. 


Proof. First consider (a), and suppose F is a finite extension of Q,. We seek to satisfy 
(29.4.14); the equation holds up to a constant, so we may choose appropriate f and x. 
We choose f as the characteristic function of R and x = 0, so that f(0) = 1 and 


f | | Fwod) du(y) = | f) | i #02) du] du(z). (29.6.11) 
F F F F 


By character theory, 


[wo du(y) = wy € F : W(yz) = 1) = wy € F : Trr jQ, (yz) € Zp}) 


so (29.6.11) is equal to p(R#) where 
RË = codiff(R) = {x € F : Trr/g, (xR) € Zp}. 
# 


Let x; be a Zp-basis for R with x; the dual basis, giving a Zp-basis for RË. By Lemma 


15.6.17 we have 
disc(R) = [RË : R]z, 
so since u(R) = 1 by additivity we have 
u(RË) = |disc(R)[!. 
It follows then that 
T = (RI? u = |disc(R)| "u 


is self-dual. o 
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29.7 Local zeta functions: nonarchimedean case 


Continuing with F a nonarchimedean local field, let B be a finite-dimensional simple 
(not necessarily central) algebra over F, with maximal order O, and n = dimz(s) B. 


29.7.1. The (additive) Haar measure u on B is defined as in 29.6.1 as a projective limit, 
normalized so that w(O) = 1. 

We compute that ||a||z = |INmgjr(@)||r, so that if B is central over F with dimp B = 
n’, then |la||z = ||nrd(@)"||,-. In the remainder of this section, we work over B and drop 
the subscript g. 


29.7.2. For the (multiplicative) Haar measure u% on B*, we use the same normalization 
factor (29.6.5) as for F*, defining the normalized measure 


ı du(@) 


dp“ (a) = (1 - gy! 


(29.7.3) 


29.7.4. We extend the standard unitary character on F, defined in 29.6.7, to a standard 
unitary character on B by 


WB(a) := Yr (trd(a)) 
for a € B. Again, (af) = W(Ba) for all a, £ € B. 


If F is a local number field containing Qp, let Ro = Zp; if F is a local function 
field, let Ro = R. 


Definition 29.7.5. The absolute discriminant of B is 
D(B) := N(discr,(O)) = |discr,(O)|"! € Zso. 


The absolute discriminant is well-defined, independent of the choice of maximal 
order. 


Example 29.7.6. If B is a quaternion algebra over F, then 
D(B) = N(discr,(O)) = N(discr,(R))N(discrdr(O))’. 


Proposition 29.7.7. The standard unitary character w defines an isomorphism B œ 
BY of topological groups, and the measure 


T := D(B) "° u (29.7.8) 
is self-dual with respect to w. 


Proof. The same arguments as in Propositions 29.6.9 and 29.6.10 apply, with appro- 
priate modifications. o 


29.7.9. Having normalized the multiplicative measure u% and seeing the relevant 
modification in (29.7.8), just as in the case of B = F we define 


T” := D(B) P? u>. (29.7.10) 
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We may now define the local zeta function in the nonarchimedean context. 


Definition 29.7.11. A function f: B — C is Schwartz—Bruhat if f is locally constant 
(for every a € B, there exists an open neighborhood U > x such that f|y is constant) 
with compact support. A standard function Y¥ on B is the characteristic function of a 
maximal order. 


Taking an open cover of the support, we see that every Schwartz—Bruhat function 
can be expressed as a (finite) C-linear combination of characteristic functions of 
compact open subsets of B. 


Definition 29.7.12. For a Schwartz—Bruhat function ®, we define the (local) zeta 
function 
ZR(s) = I lal D(a) dr* (a). (29.7.13) 
BX 


We write 
Zg(s) := Z% (s) = f llall? dt* (a) (29.7.14) 
OnB* 


for the zeta function with respect to a standard function ¥; this is well-defined (inde- 
pendent of ¥) as any two maximal orders are conjugate, and ||a|| is well-defined on 
conjugacy classes by 29.7.1. 

We begin with a basic convergence estimate. 


Lemma 29.7.15. The function ZR(s) converges for Res > 1. 


Proof. We reduce to the case where © is the characteristic function of a compact open 
set U, since © is a (finite) C-linear combination of such. Since U is compact, ||a|| is 
bounded on U, so too is ila|[o-! for fixed Re s > 1; thus 


f lell" dt” (e) = | lal’! de(a) 
UnB* OnB* 


is bounded by a constant multiple of t(U) < ov, and therefore Za (8) is bounded by 
comparison. o 


This integral representation recovers the classical zeta function we studied earlier 
in the number field case (section 26.4). 


29.7.16. Since there is a unique ideal in R of absolute norm g° for each e > 0, we have 


o0 


1 F 
irs) =)" ao qy". 


e=0 


In like fashion, for e > 0, let ape = ape (O) be the number of (necessarily principal) 
right ideals of O of reduced norm pê and thereby absolute norm q”°. We then define 


1 2, dpe 
tots) d a = Qa ge 


IcO 


the sum as usual over all nonzero right ideals of O. 
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Lemma 29.7.17. The following statements hold. 


(a) We have 
Zg(s) = T*(O*)ER(5); 


in particular, the domain of convergence of Zp(s) is the same as €p(s). 


(b) If B is a quaternion algebra, then ¢g(s) converges for Re s > 1/2, and 


Cr(2s)ér(2s— 1), if B~ MF). 


(c) If B = M,(F), then €p(s) converges for Res > 1 —1/n, and 


ér(2s), if B is a division algebra; 
fa(s) = 


n-1 
éa(s) = | | ges -ò 


i=0 


Proof. First part (a). For e > 0, choose representatives for the dpe = ape (O) classes in 
O/O* with reduced norm p°. Every element a € O N B* can be written as the product 
of one of the representatives and an element of O%; since ||a|| = |nrd(a)”| we have 


dpe 


= T*(O%)ćg(s) (29.7.18) 


Zp(s) = / lal dr (a) = 1*0% $ 
OnB* eno 4 

as claimed. For (b) and (c), we gave a formula for dye for B = M, (F) in Exercise 26.12 

and for B a quaternion algebra in Corollary 26.4.7 (which extends to the function 

field case without change). Of course, these identities only hold within their respective 

domains of convergence, which for ¢-(s) = (1 —q~*)"! is Re s > 0, thereby giving the 

rest. o 


In the proof of the functional equation, we will need the following proposition, 
recalling duality (section 15.6). 


Proposition 29.7.19. For ¥ the characteristic function of O, we have 


ee i = (By, ifa € OF 
0, otherwise. 
Moreover, g 
Zg (8) = D(B) ZE (8). (29.7.20) 


Proof. For the first statement, by definition we have 
Wa) = | waparis) 
O 


If œ € OF then y (aß) = 1 for all B € O, and we obtain 
PY (a) = 1(O) = DBY? (O) = DBY’. 
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Otherwise, a ¢ O#, and by character theory PY (a) = 0. 
Since O is maximal, we have OË = O6 for some 6 € B* with ||6|| = D(B). Therefore 


Zp) = vB"? [altar = oar"? [asi ao) 
N x 


ng (29.7.21) 
pay? J lali? dr*(œ) = D(B) T"? Z} (5), 
OnB* 


proving the second statement. 


We conclude this section with some hopefully illustrative computations of measure. 


Lemma 29.7.22. Let B be a quaternion algebra over F. Then 


(0) = 1+ Md ie is a division ring; (29.7.23) 
1-1/q@, if B= Mo(F) 


Proof. If B is a division ring, then O is the valuation ring; let J C O be the maximal 
ideal, so O/J =~ Fg2 hence (J) = 1/q?, and then 


xX) — -1 x 1- 1/4? = 
K” (O*) = (1 - 1/q) (uO) — KY) = Teaya =1+1/q 
Similarly, if B ~ M2(F) then O =~ M2(R), and from the exact sequence 
1 > 1 +pM2(R) > GL2(R) > GL2(k) > 1 
where k = R/p, we compute u(1 + p M2(R)) = u(p Mo(R)) = 1/q* and 


w(GL2(R)) = (1 — 1/q) a(l + pO)# GL2(k) 
(q? - 1X? - q) 3 
= "~~ =]-] . 
qd — 1/4) 4 


29.7.24. Let B be a quaternion algebra. Then the reduced norm yields an exact sequence 
P30! 0823 
so we have an induced compatible measure t! on O!. Thus 
x O% D(B -1/2,,x OX 
Ro ) _ DBY 4p” (0%) 
T*(RX) D(R)-1/2 u” (R) 
(d$, diserd(O)?)~!/? u* (0%) 


-1/2 (29.7.25) 
dp 
E (q-1)!, ifB is a division algebra; 
= dp’ -1/0 N ý 


if B ~ Mo(F). 
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29.8 Idelic zeta functions 


In this section, we now define (global) zeta functions in an idelic context. Let F be a 
global field and let B be a central simple algebra over F. 

We first define the classical zeta function before generalizing to idelic zeta func- 
tions. We define 


én(s):=| | 2, (9), (29.8.1) 


vfoo 


the product over all nonarchimedean places of the local zeta functions ¢g,(s) defined 
in 29.7.16. We studied these (global) zeta functions over number fields in Chapters 25 
and 26. 


Example 29.8.2. By Lemma 29.7.17(c) we have 


n-1 


imnes) = | | ge(ns - ò. 


i=0 


With this in mind, we construct idelic zeta functions as products of local zeta 
functions, according to the product measures and characters, as follows. 


29.8.3. For v € P1 F, we have defined measures u, on B, in 29.5.1 for v archimedean 
and 29.7.1 for v nonarchimedean. We define the measure 


a= [] # 
vePI F 


on B as the product measure (on the restricted direct product). In particular, for every 
eligible set S c Pl F of places and every R(s)-order O in B, the measure yp restricts to 
the (convergent) product measure on [],<s5 By X [],gs Oy—and this uniquely defines 
the u as a Haar measure on B. 


Definition 29.8.4. The absolute discriminant D(B) of B is defined as the product of 
the local absolute discriminants over all nonarchimedean places: 


D(B) := I] D(B,). 


vfoo 


The absolute discriminant is well-defined because D(B,,) = 1 for all but finitely 
many v. 


Example 29.8.5. If F is number field, let dr € Z be the discriminant of F. Multiplying 
the factors 29.7.6 together, if B is a quaternion algebra we find that 


D(B) = al N(p)? ldr|*, if F is a number field; 
7 peRam(B) á 1, if F is a function field; 


the product taken over all nonarchimedean ramified places. 
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29.8.6. In light of Proposition 29.7.7, we define the Tamagawa measure on B by 


ene) ee] |: (29.8.7) 


with t = py for v archimedean and t, = D(B,)!/ 2 ily defined in (29.7.8) for v 
nonarchimedean. E 


Remark 29.8.8. The Tamagawa measure has an intrinsic definition, independent of 
local normalizations [Weil82, Chapter II]. This intrinsic definition is, among other 
things, important for generalizations (which are quite broad, see Remark 29.11.9). It 
is then a nontrivial effort to unravel definitions to verify that the general notion of 
Tamagawa measure reduces to the one defined above. We content ourselves with the 
characterization 29.8.12 below. 


29.8.9. If F is a number field, we define the product character 
wis I] Wy 


on B. If F is a function field, we align our local characters as follows: let w € QF [Fg 
be a nonzero meromorphic 1-form, and for a place v € Pl F let 


Yla) = exp(2ai(trd(a)yw/dty)r, ) 


where f, is a uniformizer at t, and we again define y := [[, Ww. In both cases, 
W(aP) = (Ba) for all æ, 8 € B, as this holds for y, for all v. 
~ Tn all cases, we define the function 


o:=|]o, 


where ®, is the characteristic function of a maximal order O, for v nonarchimedean 
and Q,, is the standard function (Definition 29.5.13) if v is archimedean. 


Proposition 29.8.10. The product character has the following properties. 
(a) The map 

(29.8.11) 
is an isomorphism of topological groups. 

(b) The image of B under (29.8.11) is the group of unitary characters that are trivial 


on B. 
(c) The product measure Tt is self-dual with respect to y. 


Proof sketch. In the number field case, the statement follows from the local versions 
and the product formula; in the function field case, it follows similarly and reduces to 
the fact that the sum of the residues is zero. o 
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29.8.12. The Tamagawa measure T is then also characterized as the unique Haar 
measure on B that is self-dual with respect to the Fourier transform associated to the 
standard character. 


29.8.13. Moving now from additive to multiplicative, we define the normalized Haar 
measure 
rs I] uc (29.8.14) 


on BX, where u% is defined in 29.5.5 for v archimedean and (29.7.3) for v nonar- 
chimedean. Finally, we define the Tamagawa measure on B* by 


1% := D(BY ux = I] 1%, (29.8.15) 


where T% = u% for v archimedean and T% = D(B,)~!/*uX as in (29.7.10) for v 
nonarchimedean. 


With measures finally (!) in hand, we can now idelically integrate functions against 
them. 


Definition 29.8.16. A Schwartz—Bruhat function on B is a finite linear combination 
of functions f: B —> C with f = [], f}, where each fy is Schwartz—Bruhat and 
fy = Y is a standard function for all but finitely many v. 


Theorem 29.8.17 (Poisson summation). For a Schwartz—Bruhat function ®, we have 


>) BB) = DOB). 


BEB BEB 
Proof. Symmetrize to obtain 
(2@)(a) := X De +). (29.8.18) 
BEB 


Using y to identify B with the dual of B/B via Proposition 29.8.10 (and attending 
carefully to relevant analytic concerns), the symmetrized function is equal to its Fourier 


series 
(Za) = > agy(—Ba) (29.8.19) 
BEB 


where 


ag = f (2®)(a) (Ba) dr(@) 
B/B 


= [Yaa + nue + yer) (9.8.20) 
B/B 


=!) yeB 


= T Day (Ba) dt(@) = D (B) 
since Wy) = | for all y € B. Therefore 
X &B) = CDO = X A). o 


BEB BEB 
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Remark 29.8.21. When F is a function field, the alignment in w implies the Riemann— 
Roch theorem for the curve with function field F: see Exercise 29.13. 


Definition 29.8.22. For a Schwartz—Bruhat function ®, we define the (idelic) zeta 
function 


D S 
750) := f olal ao; 
BX 
when ® = ¥ is a standard function, we write simply Zg(s). 


We have ö 
Z% (s) = I] Zg (8) (29.8.23) 
vePI F 


wherever the product is absolutely convergent; the local zeta functions Zg,(s) are 
defined in (29.5.16) for archimedean v and (29.7.13) for nonarchimedean v. 

We can make the comparison to the classical zeta function (29.8.1) explicit up to 
an idelic volume, as follows. 


Lemma 29.8.24. The following statements hold. 


(a) If F is a number field, then 
Zels) = P(OEa(8) | | Za, (9): 


yloo 


if F is a function field, then 
Zp(s) = T*(O*)éR(5). 


(b) Suppose B is a quaternion algebra. If F is anumber field with D := disc B, then 
POST! = dr iOO) (29.8.25) 
where ~(D) = [lpo Nm(p) - 1). If F is a function field, then 
rO =f) [| a- 1w). 


veRam(B) 


Proof. For part (a), we use Lemma 29.7.17 for the relationship between zeta functions. 
For part (b), we combine Lemmas 29.5.18 and 29.7.17(b) and we use Lemma 29.7.22 
for the local computation of measure and the equality 


| [0 - NOVO) = Ide PG) 
plD 


in the number field case. o 
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29.8.26. For the modulus || ||, we find that 


lall = | [llevihy. 
v 


In particular, recalling 27.6.11 we have the group 


B = {a € B* : lall = 1}, 


and by the product formula we have B* < B®., 
We now restrict the measure on B® to B*. Let n? = dimp B. Then 
lællg = |Inrd(@)" |p. 
We have an exact sequence 
1 > B® > B* > ||B*|| > 1; (29.8.27) 


we have ||B*I| = Ryo if F is a number field and ||B*|| < q”? if F is a function field 
with constant field F4. Noting this, we take the measure on ||B*|| defined by n`! dt/t 
in both cases. By compatibility (see 29.3.13), we obtain a measure t” on B®. 


29.9 Convergence and residue 


In this section, we establish convergence properties of idelic zeta functions, and com- 
pute their residue at s = 1. Throughout, F is a global field and B is a central simple 
algebra over F. Let n? = dimp B; we allow n = 1, so B = F. 


Proposition 29.9.1. Let ® be a Schwartz—Bruhat function on B. Then the following 
statements hold. 


(a) Z5(8) is absolutely convergent for Re(s) > 1. 
(b) Z5(s) has a simple pole at s = 1 with residue 


SOA) 


® 
ress=| Zg (S) = rs 


where ¢} (1) = ress=1 ¢r (1). 


Proof. In a nutshell, we prove this proposition by comparison to the classical zeta 
function ¢u,,(F)(s) of the matrix ring, for which we can compare explicitly with a 
Dirichlet series. 

We may suppose without loss of generality that ® = [],, ®,, since by definition ® 
is a linear combination of such. Let S$ C PI(F) be the (finite) set of places v of F such 
that one of the following holds: 


e v is archimedean (if F is a number field); 
- By # M,(F,); or 
°. D, #'P,, i.e., D, is not the standard function. 
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Then ö P 
z5(s)= F Z5, (8) I] ZM, (F,)(8). (29.9.2) 
veS vés 
where the first product is finite. By Lemma 29.7.15 for v € S archimedean and com- 
parison to Lemma 29.5.18 for v archimedean, the first product is absolutely convergent 
for Re s > 1. For the second (infinite) product, by Lemma 29.7.17(c), we have 


n-1 n-1 


éM, (F,)(S) = I] Cr, (ns — i) = [ Ja =g hy (29.9.3) 
i=0 i=0 


where q, is the size of the residue field of F,; note m,r, )(s)! = [];(1 - q1™™°) is 
holomorphic. Putting these together gives 


I] ZM,(Fy)(S) = P(O im (s) I] Zm, (F (S)! 


vés veS 


and so we reduce to showing that ¢m,(F)(s) is absolutely convergent for Re(s) > 1. But 
multiplying (29.9.3) gives 


n-1 
ém, (F)(S) = I] Cr (ns — i) 
i=0 
which is absolutely convergent (by comparison to the harmonic series) whenever 
n(Re s) - (n — 1) = n(Res)-n + 1 > 1,ie., when Res > 1. This proves part (a). 
To prove part (b), we dig deeper. Staring at (29.9.3), we identify the terms Cp, (ns — 
n + 1) that contribute to a pole at s = 1; by absolute convergence, for Re s > 1 we have 


n-2 
I] Cr, (ns — p) ; 
i=0 


(29.9.4) 
The first two products are finite and defined at s = 1, and the third (infinite) product 
is now absolutely convergent for Res > 1 — 1/n, so in a neighorhood of s = 1. 
Accordingly, to compute the desired residue, we may compute a limit. 
Let v be a nonarchimedean place of F and let q, be the size of its residue field. For 
Res > 1, we have 


Za (8) 


ZS) 
B = ®, r*(O* 
ér(us —n +1) | 12s rollen ÉF, (ns=n+1) p ty (Oy) 


D, f 
Zg (9) = f _ D )llavl t a 
X (29.9.5) 
= | | D, (a )llall da, | d -477 
B% 


coming from the normalization factors between additive and multiplicative Haar mea- 
sure at the nonarchimedean places, where q, is the size of the residue field of F,. 
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Therefore 
Za (s) j= a 
lim i X 


= lim 
sNI ér (ns-n+1) si 1-qg;! 


z f E da, = © (0). 
B, 


f D, (allal! day 
B% (29.9.6) 


In a similar way, if v is archimedean we have (without annoying normalization factors) 
M P, V 
lim Z} (s) = ®,(a,) da, = ®; (0). 
s\l ¥ s m 


Applying (29.9.6), we conclude 


l Z5(s) Zg” (5) 
lim osoari [es O [tet rasa 


=[ [eX = 00; 


(29.9.7) 


the interchange of the product and the limit is justified by absolute convergence of the 
product in a neighborhood of s = 1. Therefore 


oY (1 
Tess] Z5 (5) = (0) Tess=1 F (ns -n +1) = Se 


as claimed, finishing the proof of (b). oO 


29.10 Main theorem 


We now establish the main analytic properties of the idelic zeta function, including 
meromorphic continuation and evaluation of residues. Our basic reference is Weil 
[Weil82, Section III.1]; see Remark 29.10.24 for historical comments and further 
references. The proof follows the same strategy as in section 29.2, with a key role 
played by Poisson summation and conceptual clarity brought by idelic methods. 


Main Theorem 29.10.1. Let F be a global field, let B be a central division algebra over 
F with n? = dime B. Let ® be a Schwartz—Bruhat function on B. Then the following 
statements hold. 


® 
(a) The function Z;(s) has meromorphic continuation to C. Moreover: 


(i) If F is a number field, then Z5 (8) is holomorphic in CX{0, 1} with simple 
poles at s = 0,1 and residues 
SOA) 


$0) (1 
TeSs=1 Z5(8) = a eT TESs=0 z$ (s) = a 
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(ii) If F is a function field with field of constants Fg, then Z5(8) is holomorphic 
in C\{s : q5 = q°, q!} with simple poles when q° = q?, q! and residues 


D D~O) ® O(0) 7 (1) 
sugi Z3(s) = —————.,_ reS,,s_,0 Zp (s) = -= 
TeSys! Zp(S) a TeSgs =q? Zg($) ; 
(b) z (s) satisfies the functional equation 
Za- s) = Z2 (s). (29.10.2) 


(c) We have 
1 (B\B) = 0). 


The value of the residue ¢7(1) is given by the analytic class number formula 
(Theorem 26.2.3). Taking n = 1 gives the following important special case (without 
requiring this as input). 


Corollary 29.10.3. Let F be a global field and let p(s) := T1y4oo(1 — gy, the 
product over all nonarchimedean places of F and q, the cardinality of the residue field 
at v. Then the following statements hold. 


(a) If F is a number field with absolute discriminant dr, let £p (s) be the completed 
Dedekind zeta function as defined in (26.8.4). Then Er(s) is holomorphic in 
C\{0, 1}, with simple poles at s = 0,1, and satisfies the functional equation 
rC -= s) = čr (s). 

(b) If F is a function field of genus g with field of constants Fq, let Er(s) := 
g'8-)8 f(s). Then ¢r(s) is holomorphic in C\{s : q5 = q°,q'}, with simple 
poles when q5 = q°,q', and satisfies the functional equation ëF (1 — s) = ëp (s). 


Proof. Takingn = 1 gives B = F; we take ® = Y = [],, Y, to be the standard function. 

In the number field case, we have &(s) = ldp |S? Zr(s). By Lemma 29.8.24 and 
Lemma 29.5.18 for the archimedean contribution, we see that £F (s) = crldr|*/?Zr(s) 
where cpr is a constant depending on F. We accordingly conclude holomorphicity from 
Main Theorem 29.10.1(a). For the functional equation, by (29.7.20), we have Zr, (s) = 
D(F)" Zr, (s). Taking the product and reading off Main Theorem 29.10.1(b) gives 


EC - s) = cpldr |P ZEA - s) = dr |07 Zp (s) = crlde |? Zp = Els). 


The function field case in concluded in a similar but more direct manner. o 


Main Theorem 29.10.1 extends to the case of a matrix algebra (over a division 
algebra), with some additional complications because of the existence of zerodivisors: 
see Theorem 29.10.20 for the case B = M2(F). As we will see, the proof is close to 
uniform in the number field and function field cases; we have separated these two cases 
in the statement for clarity, so in particular the poles in the function field case occur at 
s € 2ni(log q)Z and s € 1 + 2xi(log q)Z. 
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Remark 29.10.4. Weil [Weil82] chooses a different normalization, giving a functional 
equation relating s to n — s and with residues SO) at s = 0 and —®(0)¢;,(1) at 
S=N. 


Proof of Main Theorem 29.10.1. We work with Tamagawa measure throughout, so to 
ease notation we abbreviate d* := dr*. To break up the integral, we define 


1, O0<t<l; 
A(t) := 41/2, t=1; (29.10.5) 
0, t>1. 


We break up B* by 4 according to the norm: for œ € B*, we define 


f(a) := Allal’, 
f-(@) := Allel). 


Then fila!) = f(a) and f, (œ) + f-(@) = 1 for all a € B*, and so defining 


(29.10.6) 


z (= f Fal Pa) da, (29.10.7) 


we obtain ö Pp oa 
ZR(8) = ZR (s)+ Ze (9). (29.10.8) 
We claim that the function z% (s) is holomorphic. Indeed, by Proposition 29.9.1, 
ZG (5) converges absolutely for Res > 1; thus the same is true for ZŠ (s). But if 
ZS" (s) converges absolutely at s = so, then it does so for all Re(s) < Re(so), so 
ZS (s) is holomorphic in C. 


a : ®,- 
For the remaining piece Z% (s), we have 


z (s) = i. a solar > (Ba) oe. 29107) 


‘Be BX 
Poisson summation (Theorem 29.8.17) gives 
>) 2B) = DOB). 
BEB BEB 


For a € B*, let alp) = (Ha) be the right translate. Then 


58) = i Oya (By) dey) 
1 (29.10.10) 
= Tall f D(ya(Bya')dt(y) = — (a!) 


ole lla 
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using that y is well-defined on conjugacy classes in the last equality. Plugging this into 
Theorem 29.8.17 gives 


"2 O(Ba) = = 3 D“ (a7! p). (29.10.11) 


4a lel £ 


At this point, we use the hypothesis that B is a division algebra over F. With this 
assumption in hand, 


$, 2a) = $ 2a) - 20). 


BEB* BEB 
From (29.10.11) applied to the sum in (29.10.9), we obtain 


Q,- a _ 
Za = | f-@lal ( X, D'e) + DO) - GO)|lall}@%a (29.10.12) 
BNE: BEB* 
valid for Re(s) > 1. 
Next, we make the substitution 
sel-s, aea'! 


in the definition of ZS HO (29.10.7). The Tamagawa measure d*æ is invariant under 


inversion œ — @7!, so 


oY, = -=s z 
Z% (l -s)= I Fallal OP & (a!) d*a 
BX 
= (29.10.13) 


= f F-a Da7) Fa. 
BX 
We break up this integral according to its dependence on B* as follows. We write 


ze" -s) = L me Lo py orep) da ea) 


BEB 


(replacing 67! + £ in the sum): in writing the integral this way, we integrate over 
any measurable set B* that maps injectively under the continuous quotient map BX > 
B*\B*. Combining (29.10.12) and (29.10.14), we obtain 


O-z -95 vlee (29.10.15) 
B*\B* 


where 
v(t) := (on 7 aor“) A(t). 


The function v only depends on lengths in B*. Recalling 29.8.26, in particular the 
exact sequence (29.8.27), we obtain 


f Mad aa = 108| f noe) (29.10.16) 
B*\B* IBxII 
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where as in the previous paragraph, 
tT™(B*\B®) = f dra 
> = BX\BO) E 


is the volume of any measurable set in B® that maps injectively under the quotient 
map. 
When F is a number field, 


@Y(0) f! dt 0) f! d 
v(t) d*t = =”) f pide N f i a 
BX] n 0 t n 0 t 


= (Fo =) 
=- + ; 


l-s sS 


(29.10.17) 


When F is a function field with constant field F4, 


DOJI X a 
vdt == ——" |e 5 gd] - 
ie n 2, 


(0) 
d=1 n 
1 1 s-1 1 -5 
=-— [oo 77 — + a0) 4 |: 
= q57 ]— q S 


“ 
=1 (29.10.18) 


We now finish the proof, beginning with the case that F is a number field. Com- 
bining (29.10.8), (29.10.15), and (29.10.17), we obtain 


Z5(s) = Z$) + Zz (1-5) 


_ Loggo |Z 
S 


The substitution s — 1 — s interchanges ® and ®Y, since (MY)’(s) = ®(s); from this 
we conclude the functional equation (b) and that 2 (s) has meromorphic continuation. 


Since O) and Zz. HO) are entire, we conclude that ZS (s) is holomorphic in C 
away from s = 0, 1. Proposition 29.9.1(b) gives 


® 
ress=1 ZR (s) = P 


which proves (a); on the other hand, (29.10.19) tells us 
T (B*\ BOD” (0) 


n 


© 
ress=| Zg(s) = 


We conclude that t” (B* \BY) = ¢;,(1), which proves (c). 
The case where F is a function field follows in a similar manner. o 


The above extends to the case of a matrix algebra over a division algebra, with 
additional complications. To keep our eye on the prize, we treat just the case B = M2 (F). 
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Theorem 29.10.20. Let F be a global field and let B = Mo(F). Let ® be a Schwartz— 
Bruhat function on B. Then the conclusions of Main Theorem 29.10.1 hold with the 


exception that Z5(s) may also fail to be holomorphic at s = 1/2 (if F is a number 
field) or q5 = q!’ (if F is a function field). 


Proof. Let By := (a, bı | F) and B2 = (a, b2 | F) be nonisomorphic division quaternion 
algebras with Ram(B,) N Ram(B2) = Ø, neither ramified at an archimedean place, and 
let B3 = (a,b,bz | F). Then by multiplicativity of the Hilbert symbol, we have 
Ram(B3) = Ram(B,) u Ram(B2). For i = 1,2,3 and for each v, let O;,, = ®, if 
v ¢ Ram(B;) and ®,,,, = Y, the standard function if v € Ram(B;), and let ®; = [],, ®:,y. 

Then ù ù 

o Zp (s)Z5, (9) 
Zz(s)= aT O. (29.10.21) 
3 
Zz, (s) 


® 
The meromorphic continuation and functional equation for Z} then follow from the 


corresponding properties of z“ (s) and (29.10.21); we conclude also that Zz, (s) has 
the claimed simple poles, and the values at the residues hold peeling off the factor 
;-(1)/2 because they hold at each v. 


To conclude parts (a) and (b), we show that ZS (s) is holomorphic away from s = 
0, 1/2, 1 Gif F is a number field) and g* = ql ? (if F is a function field). Decomposing 
the sum, we may suppose without loss of generality that ®, = Y, is a standard function 
for all but finitely many places v. Let B’ be a division quaternion algebra over F that 
is unramified at archimedean places and all places where ®, + ‘Y,,. Then 
; Z**(s) 
Z5()=Zs(s) [| 4 


s (29.10.22) 
veRam(B’) Zp" ($) 


By Lemma 29.7.17, we have 


= fp (28-1 = (1 - q1”)! 


which is holomorphic away from q$ = q}! 2. When further F is a number field, we vary 


B’ to conclude holomorphicity away from s = 1/2 (Exercise 29.10). 
To keep things tidy, we conclude the proof of part (c) in Theorem 29.11.3. m 


As an important special case, we have the following theorem, recalling the calcu- 
lation of D(B) (Example 29.8.5). 


Theorem 29.10.23. Let B be a quaternion algebra over F and let D(B) € Zso be the 
absolute discriminant of B. Then the zeta function Zg(s) has the following properties. 


(a) Zg(s) has meromorphic continuation to C. More precisely: 
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(i) If F is a number field, then Zg(s) is holomorphic away from s = 0, 1/2, 1 
with simple poles at s = 0,1 and residues 


* 1 , i 
ppan Tess=0 Zg(s) = = ), 


moreover, Zg(s) is holomorphic at s = 1/2 if and only if B is a division 
algebra. 


TeSs=1 Zg(s) = 


(ii) If F is a function field of a curve of genus g over Fg, then Zp(s) is 
holomorphic away from q° = q°, q! 
and residues 


,q!, with simple poles at q5 = q°,q' 
* 1 * 1 

BEE a TESgs -g0 Zg(s) = ei l, 

2g?) YDB) 17 


moreover, Zp(s) is holomorphic where q5 = q!’ if and only if B is a 
division algebra. 


TeSgs-gi ZB(S) = 


(b) Zg(s) satisfies a functional equation. More precisely: 
(i) If F is a number field, then 


Zg(1 —s) = D(B)!? Zg (s). 
(ii) If F is a function field, then 
Zp(1 - s) = (q4?®)D(B)) Ze). 


Proof. We apply Main Theorem 29.10.1 and Theorem 29.10.20 with ® as in 29.8.9 with 
@,, the characteristic function of a maximal order or the standard function according 
as v is nonarchimedean or archimedean, so ®(0) = 1. 

In the number field case, we have OY = ,, self-dual when v is archimedean, and 
by (29.7.20) we have 


Ze 0G,) 79 @) 


and ®Y(0) = D(B,)~!/* when v is nonarchimedean. Multiplying these together and 
applying Theorem 29.10.1(c) gives 


Z2(1 - s) = ZË (s) = D(B)/?-* 22 (8). 


In the function field case, a similar argument holds but with the character modified 
by a global differential as in 29.8.9: for the relevant additional factor, see Exercise 
29.13. 

To conclude, we note that when B ~ M2(F) then by Lemma 29.8.24 the nonar- 
chimedean part of Zg(s) is given by ¢r(2s)¢r (2s — 1), and this has a double pole at 
s = 1/2 (or accordingly g°~!/? = 1). o 


Remark 29.10.24. In her 1929 Ph.D. thesis, Hey [Hey29] defined the zeta function of a 
division algebra over Q, proving that it has an Euler product and functional equation— 
this was a tour de force in algebraic and analytic number theory, especially at the time! 


534 CHAPTER 29. IDELIC ZETA FUNCTIONS 


For more on Hey’s contribution and its role in the development of class field theory, 
see the perspective by Roquette [Roq2006, §9], including Zorn’s observation that the 
functional equation yields an analytic proof of the classification of division algebras. 
Hey’s thesis was never published (though the content appears in Deuring [Deu68, VII, 
§8]), and classical treatments of the zeta function for the most part gave way to the 
development of Chevalley’s adeles and ideles. 

Tate’s thesis [Tate67] (from 1950, published in 1967) is usually given as the 
standard reference for the adelic recasting of zeta and L-functions of global fields as 
above: it gives the general definition of a zeta function associated to a local field, an 
integrable function, and a quasi-character ['Tate67, §2]; see also the Bourbaki seminar 
by Weil [Weil66]. But already in 1946, Matchett (also a student of Emil Artin) wrote 
a Ph.D. thesis at Indiana University [Mat46] beginning the redevelopment of Hecke’s 
theory of zeta and L-functions in terms of adeles and ideles. At the time, Iwasawa also 
contributed to the development of the theory; see his more recently published letter to 
Dieudonné [Iwa92]. 

These results were generalized to central simple algebras over local fields by 
Godement [God58a, God58b], Fujisaki [Fuj58], and Weil [Weil82, Chapter III] (and 
again Weil [Weil74, Chapter XI]) in the same style as Iwasawa and Tate; and then they 
were further generalized (allowing representations) by Godement—Jacquet [GJ72], 
providing motivation for the Langlands program. 


As an example of the generalizations indicated by Remark 29.10.24, we conclude 
with a slightly more general statement. 


Theorem 29.10.25. Let F be a global field, let B be a central division algebra over F, 
let ® be a Schwartz—Bruhat function on B, and let y: B* — C be a unitary character 
such that y restricted to F* is trivial. Define 


15sx) := i _ 2a) x(a)|lall* dr*(a). 


® 
Then the function Lz (5, x) is absolutely convergent for Res > 1. If x is nontrivial, 


then E (s, x) has holomorphic continuation to C and satisfies the functional equation 
Ko oY = 
L36 X) = Ly (8, x7'). 


Proof. Proven in the same way as Main Theorem 29.10.1, just keeping track of the 
character y. The (possible) residues at s = 0, 1 (or more generally in the function field 


case q° = q°, q!) are multiplied by 


f x(a) dr (a) 


BO/EX 7 


which is zero when y is nontrivial by character theory and therefore the L-function is 
fact holomorphic at these points. o 
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29.11 Tamagawa numbers 


Building on Main Theorem 29.10.1, in this section we compute the measure of certain 
quotients with respect to the normalized idelic measure above. Let B be a quaternion 
algebra over the global field F. 


Lemma 29.11.1. Let Q be the set of real ramified places in B, and let 
FY := {x € F® : x, > O forall v € Q). 


Then the sequence 
ops BOS pO 1 
is exact, giving a compatible measure on B! denoted t!. Under this measure, we have 
TOBB) = T'(B'\BDTO PAE”). (29.11.2) 


Proof. The surjectivity of the reduced norm is locally established, and its kernel is 
B! by definition. The image of B* under the reduced norm is Ee by the Hasse— 
Schilling theorem of norms (Main Theorem 14.7.4), with kernel B!. Moreover, the 
natural inclusion 

FoF ao a ie 
is also surjective by weak approximation. Putting these together, we obtain (29.11.2), 
which we might be tempted to summarize in the exact sequence of pointed sets 


1 BB! > B\ BOS PAFO 5 1 
but we will resist the temptation. Oo 
We now prove the final result in this chapter. 
Theorem 29.11.3. Let B be a quaternion algebra over a global field F. Then 
w = fOUP\E®) = Ge(1) 


and 
7'(B'\B!) = 1. 


Proof. The equality eR \FO) = ¢;,(1)is the statement of Main Theorem 29.10.1(c) 
applied to B = F. Then B is a division algebra, the equality r“(B*\B)) = Gp (1) is 
again Main Theorem 29.10.1(c), and the equality 7!(B!\B') = | then follows from 
(29.11.2). 

To conclude, we make the appropriate modifications in the remaining case, and 
suppose that B = M2(F). Then B! = SLo(F) and similarly B! = SL2(F). By the exact 
sequence (29.11.2), we may show tT (SLo(F)\ SL2(F)) = t!(SL2(F)/ SL2(F)) = 1. 
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We will do Fourier analysis on F°, extended from F, with self-dual measure T and 
character y. Let ® be a Schwartz function on F°. The Fourier transform is 


o0) = ‘a POW" Dax 


and Poisson summation reads 


X w= Y” OG). (29.11.4) 


xeF? yer? 


The group SL2(F) acts on column vectors F7\ {0} by left multiplication, with the 
stabilizer of F*\{0} given by SLo(F). Thus 


(0) = | (x) dra) = i Dax) - 0(0)} dr'a). 29.115) 
F? SLo(F)/ SLoF) \ 4 
From (29.11.4), we derive 
Y aes = + Y y) (29.11.6) 
a leli 2, 
xE ye 


and llællstar) = 1. Plugging in (29.11.6) into (29.11.5), we have 


@(x) dt(x) = Í » Per» -00 dr'(a@).  (29.11.7) 
E? SL(F)/ SLF) \ eF? 


Replacing ® — ®Y and a — (at)! (preserving the measure) in (29.11.7) gives 


0(0)= | ®(x)d = 
(0) i: 2 Œ) drka) R > 


Dey) - so) dr' (æ). (29.11.8) 
yeF? 


Subtracting (29.11.8) from (29.11.5) gives 


wo-20-= | (©”(0) - ©©)) dr'(a) 
SL2(F)/ SL: (F) 
= T'(SL(F)/ SL2(F)(®* (0) - (0); 
choosing ® such that ®(0) + ®Y (0), we obtain T!(SL2(F)/ SL2(F)) = 1. o 


Remark 29.11.9. We return to Remark 29.8.8. In general, there is a natural, intrinsic 
measure on the adelic points of a semisimple algebraic group G over a number field F, 
called the Tamagawa measure. With respect to the Tamagawa measure, the volume 
vol(G(F)/G(F)) is finite, and the Tamagawa number of G (over F) is defined as 


T(G) := vol(G(F)/G(F)). 
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For example, in the above we computed the volume for the group G associated to the 
group B! of quaternions of reduced norm 1. 

In the late 1950s, Tamagawa defined the Tamagawa measure [Tam66]. Weil 
[Weil82] (based on notes from lectures at Princeton 1959-1960), computed that 
t(G) = 1 for G a classical semisimple and simply connected group; the conjecture 
that this holds in general was known as Weil’s conjecture on Tamagawa numbers. This 
difficult conjecture was proven by the efforts of many people: see Scharlau [Scha2009, 
§2] for a history. In particular, the calculation of the Tamagawa number of the orthog- 
onal group of a quadratic form recovers the Smith-Minkowski-Siegel mass formula 
that computes the mass of a genus of lattice. 


Exercises 


1. Let up be the standard Haar measure on Qp. 


(a) Let D(a, 6) := {x € Qp : |x — al < ô} be the open ball of radius 6 € Ryo 
around a € Qp. Compute the measure up(D(a, 6)), and repeat with the 


closed ball of radius 6 € R>ọ. 
(b) Show that 


log p 
log(lxlp) duip(x) = -—==. 
L E pol 


2. Let F := F,(7); then F is the function field of P!, a curve of genus g = 0 and 


1 
d -qX -q"5) 
Let B := M2(F). Verify directly that g(s) as defined in (29.8.1) satisfies 
fg - s) = q5 ¢3(s), and then compare this with Theorem 29.10.23. 


3. Let G be a locally compact, second countable topological group, and let H < G 
be a subgroup. 


ér(s) = 


(a) Show that G/H is locally compact. [Hint: Use Exercise 12.4. ] 
(b) If H is closed, show that G/H is Hausdorff (repeating Exercise 12.5). 


[We do not need G to be Hausdorff in this exercise.] 
4. Let G be a Hausdorff, locally compact, second countable topological group. 


(a) Let u, u’ be two Haar measures on G. Show that there exists x € R>ọo such 
that w = Kp. 

(b) Let u be a left Haar measure on G. Show that u(G) < œ if and only if G 
is compact. /Hint: Use Exercise 12.4.] 


> 5. Let G be a Hausdorff, locally compact, second countable topological group. Let 
ube a left Haar measure on G. Show that the modular function Ag: G > Rso 
is a homomorphism of groups. 
> 6. Let A be a Hausdorff, locally compact, second countable topological ring, and 
let a,b € A*. Show that 
I|ab|| = llallllbll. 
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T: 


8. 


> 9, 


> 10. 


> l1. 


12. 
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Consider the exact sequences 
1> {+1} > RX 5 Rš > 1 


where the map ¢ is given by either the quotient by +1 (equivalently, the absolute 
value) or by the map x +» x*. Equip R* and RX, with the standard Haar measure 
dx /|x|. Compute the unique compatible measures on {+1} for the two choices of 
ġ and show they differ by a factor 2. 


Finish the proof of Lemma 29.5.18. [Hint: It may help to use the Iwasawa 
decomposition (Proposition 33.4.2 and Lemma 36.2.7).] 


Prove Proposition 29.6.9, as follows. Let F be a nonarchimedean local field and 
lety: F > C! be a nontrivial unitary character of F, for example the standard 
unitary character. For each x € F, let Yx: F — C! be defined by y4 (y) = w(xy). 


(a) Show that yw, is again a unitary character of F and that w, is trivial if and 
only if x = 0. 
(b) Show that 
WY: F > FY 
xP Wy 


defines a continuous, injective group homomorphism. /Hint: recall that 
FY inherits a compact-open topology, so a basis of neighborhoods of the 
identity is given by {f € FY : f(K) € V} for K © F compact and V C FY 
an open neighborhood of 1. Given such K,V, show that there exists an 
open neighborhood U C F of 0 such that xK C w'V for all x € U.] 

(c) Show that ¥(F) is dense in FY. [Hint: if Yx(y) = 0 for all x € F, then 
y=0.] 

(d) Prove that Y~! is continuous (on its image). [Hint: work in the other 
direction in (b).] 

(e) Show that ¥(F) is complete, hence closed, subgroup of FY. Conclude that 
is an isomorphism of topological groups. 


Complete the proof of Theorem 29.10.20 for F a number field, as follows. By 
choice of B’, we showed that ZR (8) is holomorphic except for s = 0,1 and 
possibly when q$ = qui * for v € Ram(B’). Show (by varying the choice of B’) 


that Zh (8) is holomorphic away from s = 0, 1/2, 1. 


Let B be a quaternion algebra over a global field F. Show that there exists a 
compact E C B such that the map B — B\B is not injective on E. [Hint: 
accept the results of section 29.3 and take E with measure u(E) > u(B\B) and 
integrate. ] ~ ~ 


Following the proof of Theorem 29.11.3, show that if B = M, (F) that 
TO(BAB®) = O(PYEM) = G0 


and 
7'(B'\B') = 1. 
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13. This exercise assumes some background in algebraic curves, see e.g. Silverman 
[Sil2009, Chapter II]—for a wider survey, see Rosen [Ros2002]. Let F be the 
function field of a curve X over F4. The divisor group Div X is the free abelian 
group on the set of places v; it has a degree map deg: DivX — Z with 
deg v = [k, : F4], where k, is the residue field at v An element f € F* has a 
divisor div f = >), ord, (f)v. A nonzero meromorphic differential w on X has 
also a divisor K := },, a,v where a, = ord,(w/dt,) where t, is a uniformizer at 
v; we call K a canonical divisor. Given D = >), d,v € Div X, let 


L(D):= | |p” oF, 


and L(D) := L(D) NF = {f € K : ordy(f) = —d,} and finally ((D) := 
dimp, L(D). Define the genus of X or of F by g := (K) € Zo. 


(a) Show that the characteristic function ® of L(D) is q*°8? -deg K/2 times the 
characteristic function of L(K — D). 
(b) Apply Poisson summation to ® to obtain 


> i= gis D-deg K /2 >, 1 
feEL(D) fEL(K-D) 


1 
€(D) = deg D — z deg K + €(K - D). 


(c) Plug in D = 0 to (b) to get L(0) = F4 the constant functions and conclude 
deg K = 2g - 2. 
(d) Conclude the Riemann—Roch theorem 


€(D) — €(K — D) = deg D +1- g. (29.11.10) 
14. Prove Theorem 29.10.25. 
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Chapter 30 


Optimal embeddings 


To conclude our analytic part, we apply idelic methods to understand embeddings of 
quadratic orders into quaternion orders. 


30.1 > Representation numbers 


A subject of classical (and continuing) interest is the number of representations of an 
integer by an integral quadratic form, in particular as a sum of squares. Because of 
the subject of this text, we consider quadratic forms in three and four variables where 
quaternions provide insight. 

Lagrange famously proved that every positive integer is the sum of four squares. We 
proved Lagrange’s theorem (Theorem 11.4.3) by viewing the sum of four squares as 
the reduced norm on the Lipschitz order, and concluded the argument by comparison 
to the Hurwitz order, which is Euclidean. In Theorem 14.3.8, we proved Legendre’s 
three-square theorem: every integer n > 0 not of the form 47(8b + 7) is the sum of 
three squares. The proof is harder for three squares than for four (see Remark 11.4.4): 
our proof used the Hasse—Minkowski theorem (the local-global principle for quadratic 
forms over Q) and again the fact that the Hurwitz order has class number 1. (We gave 
a variant in Exercise 14.5, where we used the local-global principle for embeddings.) 

In each of these cases, we may also ask for a count of the number of such repre- 
sentations. For k € Zs; and n € Zso, let 


re(n) = HC, x2, ...,%4) EZE xt tg te +2 = nn} (30.1.1) 


be the number of ways of writing n as the sum of k squares; equivalently, this is 
the number of lattice points on the sphere of radius Vn in R*. The number r4(n) is 
computed in terms of the factorization of n in the Hurwitz (or Lipschitz) orders, and 
has a simple answer: we saw in Exercise 11.3 that for an odd prime p, 


r4(p) = 8(p + 1) 


and we upgraded this in Exercise 26.7 to a general formula for r4(n) in terms of the 
sum of (odd) divisors of n. 
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We may similarly ask for a formula for 73, but it is more difficult both to state and 
to prove. Define 


PPM (n) = HCx, y, z) € Z? : x? +y? +z? = n and ged(x, y, z) = 1} (30.1.2) 


as the number of primitive representations of n as the sum of three squares. Then 
ri ™(n) = r3(n) if n is squarefree, and more generally 
nm = X P™(n/d?). 


d\n 


Let h(d) = #Pic Sg be the class number of the quadratic order of discriminant d < 0; 
equivalently, h(d) is the number of reduced primitive integral positive definite binary 
quadratic forms of discriminant d. 

Gauss [Gau86, Section 291] showed that rE ™ (n) is a constant multiple of A(—4n) 
as follows. 


Theorem 30.1.3 (Gauss). We have r3(1) = 6, r3(3) = 8, and for n € Zso: 


0, ifn = 0,4,7 (mod 8); 
rg" (n) = 4 12h(-4n), ifn = 1,2,5,6 (mod 8) and n + 1; 
Sh(—4n) = 24h(-n), ifn = 23 (mod 8) and n + 3. 


(One can uniformly include the cases n = 1,3 by accounting for the extra roots of 
unity in Q(V-n).) 

Theorem 30.1.3 is a special case of Theorem 30.4.7—see Exercise 30.4—but for 
historical and motivational reasons, we also give in the next section an essentially 
self-contained proof for the case n = 1,2 (mod 4), following Venkov [Ven22, Ven29]. 

The main obstacle to generalizing Gauss’s theorem (Theorem 30.1.3) is that quater- 
nion orders need not have class number 1: a generalization with this hypothesis “follow- 
ing the general plan described by Venkov” is given by Shemanske [Shem86]. Another 
annoyance is the growing technicality of the local computations giving the explicit 
constants involved. Both of these issues are in some sense resolved by employing 
idelic methods (hence the placement of this chapter in this text) and even the proof of 
Gauss’s theorem itself is simplified by these methods (in the next section). The result 
is Theorem 30.4.7: representations are spread across the genus of an order, with the 
constants given by local factors (computed in this chapter for maximal orders and then 
Eichler orders). 


30.1.4. For indefinite quaternion orders, strong approximation applies, and we are 
almost always able to prove that the contribution to each order in the genus is equal, with 
one quite subtle issue known as selectivity: in certain rare circumstances, a quadratic 
order embeds in precisely half of the orders in a genus. We pursue selectivity in the next 
chapter (Chapter 31): technical and rather extraordinary, it is a subject that demands 
care. 

Happily, a locally norm-maximal order (such as an Eichler order) over Z is not 
selective! 
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An important application of this theory is a refinement of the mass formula to a 
class number formula. Recall the Eichler mass formula (Theorem 25.3.18): if B is a 
definite quaternion algebra over Q of discriminant D and O C B is an Eichler order of 
level M, then 

1 _ gD) 


tjeciso W7 12 


where wy = #O_(J)/{+1}. We can account for the necessary correction: 


#ClsO = > |- OWM) 5 ` (1-5) 
12 WJ 
[JJeCls O [J]eCls O 


wy>l 
The latter “error term” is accounted for by (finite) cyclic subgroups spread across repre- 
sentative left orders O, (J)—and this is precisely the contribution computed idelically 
above! 


Theorem 30.1.5 (Eichler class number formula). Let B be a definite quaternion algebra 
over Q of discriminant D, and let O C B be an Eichler order of level M. Let N = 
DM = discrd O. Then 


O= y(D)W(M) 12,8 


my 12 4° 3 
where n 
(FUT (+ (Feat 
Q= [If P II P 
0, if4| N; 
and 7 4 
BE o 
g= pi P I] P 
0, if9| N 


30.2 Sums of three squares 


In this section, we prove the theorem of Gauss showing that the number of represen- 
tations of an integer as a sum of three squares is a class number. For further reading, 
see also Grosswald [Gro85, Chapter 4] and the references therein. 

_A few parts are easy to establish. The count 73(1) = 6 is immediate. If 4 | n, then 
ri ™ (n) = 0 since x? + y? + z? = 0 (mod 4) implies x = y = z = 0 (mod 2). Similarly, 
if n = 7 (mod 8) then re ™ n) = 13(n) = 0 by the three-square theorem. 

The two remaining cases lie deeper. A proof using quaternions is due to Venkov 
[Ven22, Ven29]; alternate accounts are given by Hanlon [Hanlon81, Chapter 2] and 
Rehm [Reh76]. To accomplish the task of giving an argument that is as self-contained 
as possible and still previews the ideas and structure contained in this chapter, we give 
a proof in the case n = 1,2 (mod 4). 
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Proof of Theorem 30.1.3 for n = 1,2 (mod 4). Suppose n = 1,2 (mod 4). Let $ := 
Z[V—n] c K := Q(y=n). Then S is maximal and ramified at 2, i.e., S ® Zp is the ring 
of integers of the field K 8 Q2 = Q2(V-n). 

Let B = (—1,—1 | Q) be the rational Hamiltonians and O Cc B the Hurwitz order. 
We consider the set 


W = {a = xi + yj + zk € O: ais primitive and nrd(a) = n}; (30.2.1) 
then #W = rp imen), By the three-square theorem, W + 0, so let a e W. We embed 


K = B 
y-n > Q 


and for convenience identify K with its image. By the Skolem—Noether theorem (Corol- 
lary 7.1.5), every other element a’ € W is the form a’ = B~'aB with B € B* (but not 
necessarily conversely!). Let 


E := {8 € B* : p 'aß € W}. (30.2.2) 


The set E has a right action of the normalizer Ngx(O) (checking that primitivity is 
preserved). 

We relate E to the group of fractional ideals Idl S as follows. Let b C K bea 
fractional S-ideal. Since O is (right) Euclidean, bO = BO for some 8 € B* that is 
well-defined up to right multiplication by O*. The heart of the proof is the following 
claim: the map 


(30.2.3) 
b +> BNgx(O) 


is a well-defined, surjective map of sets. The most efficient (and clear) proof of this 
claim is idelic, and we prove it in two steps. 

First, the map (30.2.3) is well-defined: that is to say, 6 € E. Write bS = DS, so that 
BO = bO and £ = bf with f € O*. Then b commutes with a (in K c B), so 


a =B'ap=p'b'abp=m'ape @'Onc m'!OfNB=ONnB=O. 


To show that the map is surjective, we need to establish one other important point 
comparing the global and the idelic: we claim that there exists Y € Ng.(O) such that 


Vv lav = Bla. (30.2.4) 


The existence of ¥ may be established locally. We prove this in Proposition 30.5.3: for 
p # 2, it amounts to showing that two elements of M2(Z,,) with square —n are conjugate 
under GL2(Z,,), and for p = 2 it follows from the description of the valuation ring: it is 
here where we use the fact that S is maximal at 2, but this is only a technical issue. (The 
reader may accept (30.2.4) and proceed, or pause here and work this out, in Exercise 
30.3.) 

Given (30.2.4), we see that BY~! centralizes a in B* and so belongs to K™, whence 


B € K*Nz.(O). 


30.3. OPTIMAL EMBEDDINGS 545 


Finally, since 2 is ramified in K—using n = 1,2 (mod 4) again—and the nontrivial 
class in Ngx(O)/ O* = Z,/2Z is represented by an element of reduced norm 2, we have 


K*N5.(O) = K*O*, 


and therefore 6 = bf with b € K* and f € O*. Therefore 6O = BO and so BO = bO 
where b = 6S. 

From the claim, we conclude the theorem. We have 6~'a = a if and only if 8 
centralizes « if and only if 8 € K*, so the desired elements B~'!aB up to Ngx(O) are 
uniquely determined by the class [b] € Pic S = Idl $/ PIdl S. Finally, when n > 1 we 
have S* = {+1} so #O*/S* = 12, and the result follows. Oo 


Importantly, Venkov’s proof of Gauss’s theorem given above is explicit and con- 
structive, given at least one representation as a sum of three squares. 

The early observation made in the proof above is that the sum of three squares is 
the restriction of the reduced norm to the trace zero elements of the Hurwitz order. 
One then seeks a similar statement for quadratic forms Q = nrd |oo obtained more 
generally. (This is almost the same thing as the ternary quadratic form associated to O 
itself via the Clifford algebra construction in Chapter 22; the difference is that for the 
latter we take the dual of the order and scale, as in (22.1.3).) 

Just as in the proof above, for a representation Q(x, y, z) = n corresponding to œ € O 
with a? + n = 0, we obtain an embedding S = Z[V=n] © O of a quadratic order into 
the quaternion order; conversely, to such an embedding we find a representation. It is 
more convenient to work with embeddings, as they possess more structure. Viewed in 
this way, we may equivalently restrict the reduced norm from O to the order S itself, 
and then we are asking for the representation of a binary quadratic form by a quaternary 
quadratic form. 


30.3 Optimal embeddings 


We now begin in earnest. We start by considering quadratic embeddings into quater- 
nions, both rationally and integrally. Let R be a Dedekind domain with F = Frac R, 
and let B be a quaternion algebra over F. 

Let K be a separable quadratic F-algebra: then either K 2 F is a separable quadratic 
field extension or K = F x F. Suppose that K > B. 


30.3.1. The set of embeddings of K in B is identified with the set K*\B”, by 7.7.12: 
if @¢: K — B is another embedding, then by the Skolem—Noether theorem there 
exists B € B* such that 6(a) = B'af for all œ € K with 8 well-defined up to left 
multiplication by K*, the centralizer of K under conjugation by B*. 


We now turn to the integral theory. Let O C B be a quaternion R-order, and let 
S C K be a quadratic R-order; we will be interested in embeddings ¢: S —> O. Such 
an embedding gives an embedding ¢: K — B by extending scalars. We keep the 
embeddings for various suborders organized as follows. 
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Definition 30.3.2. An R-algebra embedding ¢: S — O is optimal if 
(K) NO = O(S). 
Let 
Embr(S, O) := {Optimal embeddings $ — O}. (30.3.3) 
When no confusion can result, we drop the subscript R and write simply Emb(S, O). 


30.3.4. If an embedding ø: S <> O is not optimal, then it is optimal for a larger order 
S’ D S. Accordingly, there is a natural decomposition 


{Embeddings S > O} = || Emb(S’, O). (30.3.5) 
S’2S 


Lemma 30.3.6. An R-algebra embedding ¢: S — O is optimal if and only if the 
induced Ry-algebra embeddings Sy — Oy are optimal for all primes p C R. 


Proof. Immediate from the local-global dictionary for lattices (Theorem 9.4.9). o 


Lemma 30.3.6 says that for an embedding ¢, the property of being optimal is a 
local property. 
We define 
E = Eso := {8 € B* : B ŻKB NO = BSB} 


2 (30.3.7) 
= {8 € BX: KN BOB! = S}. 


(The set E also depends on the fixed embedding K — B, but this ‘reference’ embedding 
will remain fixed throughout.) In equation 30.3.7, we see two different ways to think 
about embeddings: we either move S and see how it fits into O, or we fix K and 
move O. 


Lemma 30.3.8. The map 
K*\E — Emb(S, O) 


BY $p 
where $g(@) := Blap for a € S is a bijection. 


(30.3.9) 


Proof. Immediate from 30.3.1. o 


We further organize our optimal embeddings up to conjugation as follows. 


30.3.10. Let O! < T < Ngx(O). Then the image of I in Ngx(O)/F* has finite index. 
For example, we may take IT = O%. (The scalars do not play a role in the theory of 
embeddings: they act by the identity under conjugation.) 

For y € I, and an optimal embedding ¢ € Emb(S, O), we obtain a new embed- 
ding via œ œ> y '¢(a)y, i.e., T acts on the right on Emb(S, O) by conjugation, and 
correspondingly on the right on E by right multiplication. 

Let 


Emb(S, O; T) := {T-conjugacy classes of optimal S — O} (30.3.11) 
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and 
m(S, O;T) := #Emb(S, O; T). (30.3.12) 


The quantity m(S, O; T) only depends on the type (isomorphism class) of O (trans- 
porting I" under the isomorphism of orders, of course). 
By Lemma 30.3.8, there is a bijection 


Emb(S, O; T) > K*\E/T. (30.3.13) 


We conclude this section by a comparison: for groups T sitting between unit groups 
and norm 1 unit groups, we can compare embedding numbers as follows. 


Lemma 30.3.14. If O! < T < O%, then 
m(S, O; T) = m(S, O; O*)[nrd(O*) : nrd(T) nrd(S*)]. (30.3.15) 
Proof. We have a surjective map 
Emb(S, O; T) => Emb(S, O; O*) 
and the lemma amounts to looking at the fibers. From (30.3.13), we turn instead to 
K*\E/T > K*\E/O*. 

For £ € E, the fiber over K*BO% is 

K*\K*BO"/T e KP*\KP*O*T @ (K®¥ n O*%)\O*/T (30.3.16) 
where KË = B-'K 8. But by hypothesis on 8, we have K®* N O* = SP% , so the fiber is 
in bijection with 

K*\K*BO*/T © SPX\O*XT. 


Finally, the reduced norm gives a homomorphism O* — R* with kernel O!, so since 
O! <T < O%, we have 


#5P*\ OX/T = #nrd(O*)/ nrd(SP*T) = [nrd(O*) : nrd(L) nrd(S*)] 
independent of £, giving the result. o 


Remark 30.3.17. The term optimal goes back at least to Schilling [Schi35], but the 
notion was studied in the context of maximal orders as well by Chevalley [Chev34], 
Hasse [Hass34], and Noether [Noe34]. The theory of optimal embeddings was devel- 
oped thereupon by Eichler [Eic56a, §3]; many key ideas can be seen transparently in 
Eichler [Eic73, Chapter II, §§3-5]. For further history up to the present, see Remark 
30.6.18. 


30.4 Counting embeddings, idelically: the trace formula 


In this section, we give a formula the number of conjugacy classes of embeddings, 
using local-global (idelic) methods. We retain notation from the previous section, but 
now specialize to the case where R is a global ring with F = Frac R. We use adelic 
(mostly idelic) notation as in 27.6.4. 
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We began in the previous section with a first embedding K —> B. As a reminder, the 
existence of such an embedding is determined by a local-global principle as follows. 


30.4.1. By the local-global principle for embeddings (Proposition 14.6.7, and 14.6.8 
for the case K = F x F), there exists an F-algebra embedding K —> B if and only if 
there exist F,,-algebra embeddings K, — B, for all v € PIF if and only if K, is a field 
for all v € Ram B. 


30.4.2. The definitions in the previous section extend to each completion. Let K = 
K 8r F and similarly S = S 8p R. Let 


f = Ty) < NRO 


be a subgroup whose image in Nz. (O)/ F* has finite index. For example, we can take 
I the congruence closure of T. 


We then analogously define 
Emba(S, O; I) = {T-conjugacy classes of optimal So O} (30.4.3) 


and tee oe 
m(S, O; T) := # Embg(S, O; T). (30.4.4) 


30.4.5. As in Lemma 30.3.8, we define 
B= Ezo i= (Pe BF ' RAG =F 5p} 


and obtain a bijection 
Emb(S, O; T) > K*\E/T. (30.4.6) 


under conjugation. 


We now show that the number of global embeddings is counted by a class number 
times the number of local embeddings. In view of Lemma 30.3.14, we may focus on 
the case I = O*. As usual, we write Cls O for the right class set of O. 


Theorem 30.4.7 (Trace formula). Let h(S) := # Pic S. Then 


B m(S, OLD; OL(1)*) = h(S)m(S, O; 0%). (30.4.8) 
[I]€Cls O 


See Remark 30.4.24 for an explanation of the name trace formula. 
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Proof. We decompose the set K*\E/ OX in two different ways. 
First, there is a natural map 
K*\E/O* > K*\E/O* (30.4.9) 
which is a surjective map of pointed sets. The fiber of (30.4.9) over the identity element 
i K*\K*/S* = Pic S. (30.4.10) 
We claim that the fibers of (30.4.9) may be similarly identified. Indeed, the fiber over 
K*BO% consists of the double cosets K*vBO* with KX € K*\K*, and 
K*9BO* = K*BO* 
if and only if ae 
VB = pph 

with p € K* and fi € O%, if and only if 

p '¥ = pup! c KX n BOR! = (30.4.11) 


(since B € E), if and only if K*v c K* S*, as claimed. From the claim and (30.4.6), 
we conclude that o ee 
#(K*\ E/O*) = h(S)m(S, O; O%). (30.4.12) 


On the other hand, each BO* cE / O* defines a right O-ideal; intersecting with B 
and organizing these right ideals by their classes, we will now show that they give rise 
to optimal embeddings of the corresponding left order. For brevity, write Oz = O_(/). 
There is a map of pointed sets 


K*\E/O* > B*\B*/O*% > ClsO. (30.4.13) 
Choose representatives 
B*\ BX /O* = |_| B*@O* (30.4.14) 
[I]€Cls O 


so that J = aO N B. Then 
Or =01() = ONT N B. 
Let 
Er = {B € BX: KN BO;B"! = S). (30.4.15) 
Now if Eon Pore 
BO* € E/O%, 
then there exists a unique J such that 


B* BO* (a B*@10>, 
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and therefore PEN pa 
BO* = (B@r)O* 
for some £ € B*. If 6’ € B* is another, then 
BB € @r0*@7! N B = OF 
so the class BO* is well-defined. 
We claim that 6 € Ez, and conversely if 6 € Er then B = Ba;' € E. Indeed, 


K N BOIB = K A (Bar NEO 

= K n BOG! 
so K N BO, 6"! = S if and only if K BOB"! = §. Therefore there is a bijection 
K*\E/O* > |_| K*\E)/O% > | _|Emb(S,0;;07) 

7 7 


(30.4.16) 
K*BO* > K*BO%. 
Putting (30.4.12) and (30.4.16) together and counting, the theorem follows. o 
When O has class number 1, we hit the embedding number on the nose. 
Corollary 30.4.17. If #Cls O = 1, then 
m(S, O; O*) = h(S)m(S, O; O*). 
Proof. Immediate from Theorem 30.4.7. o 


We recall that the genus of O (Definition 17.4.8) is the set Gen O of R-orders 
O’ C B locally isomorphic to O. 


Corollary 30.4.18. Jf Emb(Sp, Op) + 0 for all primes p, then there exists an order 
O’ € Gen O such that Emb(S, O’) + 0. 


Proof. By the trace formula, we have Ip m(Sp, Op; 0%) 7 mS, Ô; Ox) > 0, so since 
all terms in the formula are nonnegative integers, we have m(S, OL (D); OL(1)*) > 0 for 
some [7] € Cls O, and O.U/) € Gen O. (m 


Remark 30.4.19. The possible failure of local optimal embeddings to ‘glue’ to a global 
optimal embedding into O is measured by the phenomenon of selectivity, studied in 
Chapter 31. 


30.4.20. More generally, an isomorphism ¢: O æ O’ of orders induces a bijection 
Embr(S, O) @ Embr(S, O’), and ¢(O*) = O’%, so we may group together the terms 
in (30.4.8) according to the type set Typ O. Let 


h(O) := [Id1 O : Pldl O] 
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be index of the subgroup principal two-sided fractional O-ideals inside the invert- 
ible ones, studied in section 18.5. By Proposition 18.5.10, the equation (30.4.8) then 
becomes bo 
> h(O')m(S, O’; O") = h(S)m(S, O; O*). (30.4.21) 
[O’]eTyp O 


Remark 30.4.22. The foundational formula (30.4.8) is proven by counting a set two 
different ways. As such, it admits a purely combinatorial proof: Brzezinski [Brz89] 
shows that it follows from looking at “transitive actions of groups on pairs of sets and 
on relations invariant with respect to these actions” [Brz89, p. 199]. 


We also record the following corollary (in the vein of Remark 30.4.22). 
Corollary 30.4.23. Left multiplication defines a group action 
K*\K*/S*OK*\E/O* 
that defines a free action of the group Pic S on the set 


|_| Emb(S, OL (J); OLD”). 
[]eCls O 


Proof. In the proof of the trace formula, we identified K” \K*/S* = Pic S and estab- 
lished an idelic to global bijection in (30.4.16); in (30.4.11), each fiber of the map 
(30.4.9) is identified with Pic S, describing the orbits of the action. (See also Exercise 
30.12, where a direct proof is requested.) o 


Remark 30.4.24. Theorem 30.4.7 is called a trace formula as it can be applied to 
compute the trace of certain matrices (called Brandt matrices) that encode the action 
of Hecke operators on a space of modular forms. We return to this important point of 
view in detail in Chapter 41, and apply the above formula to compute traces in section 
41.5. 


30.5 Local embedding numbers: maximal orders 


In view of Theorem 30.4.7, we see that up to a class number of the base ring, optimal 
embeddings are counted in purely local terms. In this section and the next, we compute 
the relevant local embedding numbers; after that, we will return to the global setting 
to put the results together. 

To this end, in this section and the next, we suppose R is local (as in 23.2.1): so R 
is a complete DVR with maximal ideal p = 7R and finite residue field k := R/p. 


30.5.1. Since we are now local, the R-order S is free and so S = R[y] for some (not 
unique) y € S. Let f(x) := x- t,x + n, be the minimal polynomial of y, and let 
dy := E — An, be the discriminant of fy, equal to the discriminant of S in R/ R*, An 
R-algebra embedding from S is then determined uniquely by the image of y. 
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We now compute (local) embedding numbers in the case of a maximal order; to 
do so, we introduce some notation. 


30.5.2. Recalling that K 2 F is a separable quadratic F-algebra and p is the maximal 
ideal of R, we define a symbol recording the splitting of the prime p in K (mirroring 
the Kronecker symbol): 
1, if K ~ F x F is split; 
(5) := 4 0, if K D F is a ramified field extension; 
j -1 if K 2 F is an unramified field extension. 


If q = #k is odd, and K = F[x]/(x? — d), then 


is the usual Legendre symbol. 
Proposition 30.5.3. The following statements hold. 


(a) We have m(S, M2(R); GL2(R)) = 1. 
(b) Suppose B is a division algebra and O C B its valuation ring. If K is a field and 
S = Rx is integrally closed, then 


mS, O; Npx(O)) =1 
K 
m(S, O; O*) = 1- (=). 
p 
If K is not a field or if S is not integrally closed, then Emb(S, O) = 0. 


We recall that No, (F)(M2(R)) = F* GL2(R), so (a) also includes the case of the 
normalizer. 


Proof. First, part (a). We have at least one embedding ¢: S — Endr(S) ~ M2(R) 
given by the regular representation of S on itself (in a basis): in the basis 1, y, we have 


yr h l (30.5.4) 


a matrix in rational canonical form. This embedding is optimal, because if x, y € F 
satisfy 
_ {x -ny 
$lx + yy) = p ae A € M2(R) 
then x, y € R already, so ¢(K) N M2(R) = A(S). 

To finish (a), we need to show that the embedding (30.5.4) is the unique one, up to 
conjugation by GL2(R). So let y: S — M2(R) be another optimal embedding. Then 
via w, the R-module M = R? (column vectors) has the structure of a left S-module; the 
condition that y is optimal translates into the condition that the (left) multiplicator ring 
of R? in K is precisely S, and therefore M is invertible as a left S-module by Exercise 
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16.13, and therefore principal, generated by x € M, so that M = Sx = Rx + Ryx. In 
the R-basis x, yx, the left regular representation has the form (30.5.4), completing the 
proof. 

Here is a second quick matrix proof (finding explicitly the cyclic basis). Let y œ> 


h 4 € M2(R) be an optimal embedding; then at least one of b,c,d —a € R*. 
Therefore there exist x1, x2 € R such that g(x}, x2) = cx? + (d — a)xı X2 — bx; e R”. 
Let x = (x1, x2)'; then yx = (axı + bx2, cxy + dx2)*. Let œ € M2(R) be the matrix with 


columns x, yx. Then det æ = q(x1, x2) € R*, so in fact œ € GL2(R). We then compute 
Ji É n- i —(ad — a) 
c d 1 a+d 
as claimed. (This matrix proof will be generalized in the next section, when we work 
more generally with Eichler orders.) 

Next, part (b). By Corollary 13.4.5, there exists an embedding K —> B if and 
only if K is a field, so suppose K is a field. By Proposition 13.3.4, the valuation 
ring O is the unique maximal R-order in B, consisting of all integral elements, so 
the embedding restricts to an embedding S <> O by uniqueness. Suppose S = Rx; 
then an embedding S — O extends to K —> B so is conjugate 30.3.1 to every other 
under the action of B*. But the valuation ring is unique, thus Ngx(O) = B*. Thus 
m(S, O; Ngx(O)) = 1. Finally, we have Ngx(O)/(F*%O*%) = Z/2Z, generated by any 
j € O with nrd(j) = ~. If the extension K 2 F is ramified, then K = F(j) so j 
centralizes K, and m(S,O; Ngx(O)) = m(S, O; O*) = 1. If instead K D F is inert, 
then B = (K, j | F), and conjugation by j normalizes but does not centralize K, and 
m(S, O; O*) = 2. o 


In all cases, the local embedding numbers are finite. 
Corollary 30.5.5. m(S, O; T) < œ. 


Proof. Recalling that the image of T in Ngx(O)/F* is a subgroup of finite index, we 
know that the natural surjective map 


Emb(S, O; T) > Emb(S, O; Ngx(O)) 


is finite-to-one; applying this argument twice we reduce to the case that IT = O*. 

Let O’ 2 O be a maximal R-order. Then O* < O’*. Moreover, each ġ € Emb(S, O) 
gives (by composing with O — O’) an embedding ¢: S — O’ that is optimal for 
some superorder S” 2 S. Thus 


m(S, 0; 0%) < $ [0% : O*]m(S', 0'; 0%); 
S'2S 
There are only finitely many superorders S’ in the sum, since the integral closure Rx 


contains all S’ so [Rx : S’]r | [Rx : S]r—or equally well, compare discriminants. 
Applying Proposition 30.5.3, we conclude m(S, O; O*) < oo. o 
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30.6 = Local embedding numbers: Eichler orders 


At this point, the presentation of local embedding numbers begins to run off the rails: 
for more general classes of orders, formulas for local embedding numbers are rarely 
as simple as in Proposition 30.5.3. To avoid tumbling too far, we present formulas for 
the case where O is a local Eichler (residually split) order following Hijikata [Hij74, 
Theorem 2.3]. See Remark 30.6.18 for further reference. 

In this section, we retain the assumption that R is local and the notation in 30.5.1. 
Further, we suppose O is an Eichler order of level p° with e > 0. 


0 1 f 
w= |5 0) £0: 


Npx(O)/F*O* = (w). (30.6.2) 


If e = 0, then w € O* and Ngx(O) = F*O%; if instead e > 1, then (©) = Z/2Z. 
For ¢ € Emb(S, O), let 67 be defined by 


30.6.1. Let 


then 


67: SOO 
$7 (a) = 7 '4(a)a. 
By (30.6.2), ¢, ¢’ € Emb(S, O) are equivalent in Emb(S, O; Ngx(O)) if and only if ¢’ 
is equivalent to either ¢ or 6” in Emb(S, O; O*). 
Lemma 30.6.3. Let ¢ € Emb(S, O). Then there exist x € R and v € Ngx(O) such that 
fix) =x? -tx+nm, =0 (mod p°) 
and 


ray =(_ h(x) = | (30.6.4) 


-xX 


Proof. We may suppose that O is the standard Eichler order. Let ø: S —> O be an 
embedding, with 


oy) = E seo 


so that a, b,c,d € R. We have t, = trd(y) = a + d and n, = nrd(y) = ad — ben®. We 
observe that ¢ is optimal if and only if at least one of b,c,a — d € R*: indeed, there 
exists z € R such that (y — z)/a € O if and only if all belong to p. 

Suppose now that ¢ is optimal. We have three cases. 


If b € R*, then take vy = i pa € O*; we compute 
fia) = a - (a + d)a + (ad — ben®) = —ben® 


and 


-1 _[{ a 1\ | a 1 
v by)v= i i} = E ps ) (30.6.5) 
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as desired. 
If c € R*, then we take v = w = P ar now 
nm 0 
d c 
=i 
v d(y)v = K c) (30.6.6) 


and we apply the previous case. 
Finally, if a — d € R*, we may suppose b € p as well as c € p if e = 0, and then we 


1 1 
take v = ( i) to find 


a— cr Eo (30.6.7) 


4 _ 
vay = cn cn’ +d 


to reduce again to the first case. o 


Definition 30.6.8. An optimal embedding ¢ € Emb(S, O) is normalized and associ- 


ated to x € Rif 
x 1 
Hy) E es ty _ 3 


(as in (30.6.4). 


The statement of Lemma 30.6.3 is that for all @ € Emb(S, O), either the class of ¢ 
or of ¢” in Emb(S, O; O%) is represented by a normalized embedding. To conclude our 
efforts, we need to check which of these are equivalent. We recall that d, = i, —Any. 


Lemma 30.6.9. Let ¢, ¢’ be normalized embeddings associated to x, x’ € R. Then the 
following statements hold. 


(a) 6, ¢' are conjugate by O* if and only if x = x’ (mod p°). 

(b) If dy € R* or e = 0, then ¢” is equivalent to ¢’ in Emb(S, O; O*) if and only if 
x’ = ty — x (mod pê). 

(c) If dy ¢ R* and e > 1, then 6” is equivalent to ¢’ in Emb(S, O; O”) if and only 
if x’ = t, — x (mod pê) and f,(x) # 0 (mod petty, 


Proof. First (a). If ¢, 6’ are conjugate by some u € O*, we reduce modulo p° M2(R) 
to obtain the ring of upper triangular matrices, and see that the diagonal entries of 
o(y), $’(y) are congruent modulo p° and in particular x = x’ (mod pf). Conversely, 


if x = x’ (mod p°), then let u = P a l we confirm that 
daomu En l J-0m 
H YH f(x’) t y). 


, 
yX 


In preparation for (b) and (c), we note that 


$7 (y) = e X =EN r) ; (30.6.10) 


X 
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We now prove (b). If e = 0, the statement is true: @ € O* and all embeddings are 
conjugate. Suppose e > 1. Since f(x) = 0 (mod pf) we have 


dy = (x - X} = (x - (t - x)? = (t - 2x)? (mod p°) 


soy = 2x € RY. Foru € R, letu = (4 7 


0 d and consider 


e ult- 2x) - une — m~ f(x) 
x +un® 


-10 tb, —X- Un 
K'o ow=[” - 


F =x u(t, — 2x) — f(x)/n° 


me x 


(30.6.11) 


Ill 


) (mod p°). 


Thus we may choose u so that the top-right entry of (30.6.11) is equal to 1. The result 
then follows from (a). 

We conclude with (c), and we are given e > 1 and dy ¢ R*. If n~® f(x) € R* then 
from (30.6.10) and looking back at (30.6.5), already 6” is conjugate under O* to the 
normalized embedding associated to t — x, so by (a), we then have 6” equivalent to ¢’ 
if (and only if) x = t — x (mod pê). To finish, suppose f(x) € p°*! and d € p. Reducing 
modulo p° M2(R), the ring O/(p* M2(R)) consists of upper-triangular matrices and its 
unit group is generated by diagonal matrices and units u as in the previous paragraph. 
But then by (30.6.11), the top-right entry of every O*-conjugate of 6” (y) belongs to 
p, and so it cannot be equal to ¢’(y). o 


The statements above give a way to compute the local embedding number in terms 
of arithmetic of R. 


Proposition 30.6.12. For s € Zs, let 


M(s) := {x € R/p* : fy(x) = 0 (mod p°)}. (30.6.13) 
Then for e > 1, 
m(S,0;0%) = a. PN 
#M(e)+#img(M(e + 1) — R/pf), otherwise. 


Proof. By Lemma 30.6.3, the set Emb(S, O; O*) is represented by the set of normal- 
ized embeddings and their conjugates under w. By Lemma 30.6.9(a), the normalized 
embeddings according to x € M(s) are distinct; by (b)-(c), the remaining conjugate 
embeddings are new when they lift to M(e + 1). o 


Example 30.6.14. Let R = Zp and S = Z2[V-1], so y = V-1 and f(x) = x? +1. We 
have 
M(s) = {1 mod 2}, ifs =1; 
0, ifs > 2. 
Therefore by Proposition 30.6.12, if O is an Eichler order of level 2° over Zz, then 
l, ife<1; 


m(Z2[V-1], O; O*) = é WE (30.6.15) 
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We record an important special case. 


Lemma 30.6.16. [fe = 1 and S is a maximal R-order in K (equivalently, S is integrally 
closed in K), then 


K 
m(S,O;O*) = 1+ (5) 
p 
Proof. Referring to Proposition 30.6.12, we first compute #M (1), the number of solu- 
tions to fy(x) = 0 (mod p), so over the finite field k = R/p: this number is 2, 1, 0 accord- 
K K 
ing as | — | = 1,0, —1. If d € R*, we are done; so suppose d ¢ R*, hence x) = 0. Let 


x € M(2). We claim that (y — x)/z is integral over R, contradicting that S is maximal. 
Indeed, 7° | fy) = x — t,x + n, = nrd(y — x); and p | dy = L - 4n, = (ty - 2x)*, so 
p | (t) — 2x) = trd(y — x). Thus M(2) = Ø and so the count remains the same. o 


To conclude, when q = #k is odd, we can make Proposition 30.6.12 completely 
explicit. 


Lemma 30.6.17. Suppose #k = q is odd, that e > 1, and let f := ordy (dy). 
(a) If f = 0, then k 
m(S, O; O*) = 1+ (5) 


(b) Ife < f, then 


m(S, O;O*) = age V2, if e is odd; 
_ 7 ge? (q +1),  ife is even. 


(c) Ife = f, then 
g f=, if f is odd; 
8,050") = K 
m( =) FP 4 gf PH! f + (5)} if f is even. 


(d) Ife > f >O, then 


0, if f is odd; 
m(S, O; O*) = q! (q +1) (! + (5): if f is even. 


Proof. Since the residue field k has odd characteristic, we can complete the square 
and without loss of generality we may suppose that trd(y) = 0, and 


M(s) = {x € R/p* : x? = d (mod p*)} . 


We will abbreviate m = m(S, O; O*) and repeatedly refer to Proposition 30.6. 12. 
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First suppose f = 0. Then dy € R*, so by Proposition 30.6.12, we have m = #M (e). 
But by Hensel’s lemma, #M (e) = 0 or 2 according as d is a square or not in R for all 
ezl. 

Next suppose that e < f. The solutions to the equation x? = 0 (mod p°) are 
those with x = 0 (mod pls/21), Thus #M(e) = ge fel = gle! and we see that 
#img (M(e + 1) > R/p®) = get), so m = 2g€-)/? if e is odd and m = q°° + 
q€! = q/q + 1) if e is even. 

Ife = f, then again #M (e) = q!¢/*!, To count the second contributing set, we must 
solve x? = d, (mod p°*!). If e = f is odd then this congruence has no solution. If 
instead e is even then we must solve y? = (x/nf!?)? = dy|nt (mod p) where z is a 
uniformizer at p. This latter congruence has zero or two solutions according as d is 
a square, and given such a solution y we have the solutions x = y (mod 2/?*!) to 
the original congruence, and hence there are 0 or 2gf-f/?+") = 2¢f/?-! solutions, as 
claimed. 

Finally, suppose e > f > 0. If f is odd, there are no solutions to x? = d, (mod p°). 
If f is even, there are no solutions if d, is not a square and otherwise the solutions are 
x = y (mod p*-/?) as above so they total 2g//* + 2gf/?-! = 2gf/?-\(q + 1). o 


On the other hand, when the residue field k has even characteristic, the computations 
become even more involved! 


Remark 30.6.18. Eichler studied optimal embeddings [Eic38b, §2] very early on, 
computing the contribution of units (coming from embeddings of Z[w] and Z[i] in a 
maximal order O) in the mass formula. He then [Eic56a, §3] studied more generally 
optimal embeddings of quadratic orders into his Eichler orders of squarefree level. 
Hijikata [Hij74, §2] studied optimal embeddings in the context of computing traces 
of Hecke operators on To(N) (general N), with embedding numbers given for certain 
orders. See also Eichler’s treatment [Eic73, §3] in the context of the basis problem for 
modular forms, as well as Pizer’s presentation [Piz76a, §3]. (See Remark 41.5.13 for 
further detail.) Brzezinski [Brz91, Corollary 1.16] (a typo has it appear as Corollary 
1.6) gives a general formula for Eichler orders (which is to say, a generalization of 
Lemma 30.6.17 to include q even)—the proof method is different than the method of 
Hijikata above, and the answer is organized a bit differently than Lemma 30.6.17. 

But these papers are just the beginning, and there is a cornucopia of further results. 
Many of these are obtained in pursuit of progressively more general forms of the trace 
formula (see for example the summary of results by Hashimoto [Hash77]) for Eichler 
orders. Shimizu [Shz63, §§26—27] considered embedding numbers over totally real 
fields in computing the contribution of elliptic elements to formulas for the dimension 
of spaces of cusp forms and later for the trace formula [Shz65, §3]. The contributions 
of elliptic elements over totally real fields was also pursued by Prestel [Pre68, §5] and 
more generally for embeddings by Schneider [Schn75] (and quite explicitly for real 
quadratic fields [Schn77]) and Vignéras [Vig76a, §4]. 

Pizer [Piz76b, §§3-5] considered optimal embeddings for residually split orders 
(see 24.3.7) over Q: these were then applied to further cases of the basis problem for 
modular forms [Piz76c, Piz80b]. Then Hijikata—Pizer-Shemanske [HPS89b, §§ 1-5] 
developed in a uniform manner the optimal embedding theory for basic orders (they 
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called them special, cf. Remark 24.5.7): the application to the trace formula is then 
contained in their monograph [HPS89a]. Brzezinski [Brz90, $3] also obtains recursive 
formulas for optimal embedding numbers of a local Bass (equivalently, basic) order 
(in characteristic not 2), using an effective description of the automorphism group of 
the order. 


30.7 Global embedding numbers 


We now combine the ingredients from the previous three sections to arrive at a formula 
for global embedding numbers. 
We return to the setting of section 30.4, with R a global ring. 


30.7.1. Our global ring R = Riz comes from an eligible set T C PIF. (This set is 
usually denoted S, but we do not want any confusion with the quadratic R-algebra 
SCK.) 

For all but finitely many places v ¢ T, we have O, ~ M2(R,) maximal and T, = 
O*. By Proposition 30.5.3(a), for such places v, we have #Embp, (Sy, Oy; O%) = 1. 
Therefore the number mS, Ô; D) = # Emb RG, O; D) of adelic embeddings is given by 
the (well-defined, finite) product 


m(8,0; D) = I] m(Sy, Oy; Ty) (30.7.2) 
véT 


(well-defined and finite). 
We arrive at the following first global result. 


Theorem 30.7.3. Let N = discrd(O). Then 
Dd) mS, OL); OLD”) = ACS) | | m(Sp, Op; OF). 


[[]eCls O plot 


Proof. For all p { N, we have Op ~ Mo(Ry), so the result follows by combining 
Theorem 30.4.7 with 30.7.1. o 


Working with the normalizer instead, we find the following result. 
Proposition 30.7.4. Suppose that O is a maximal order with # Cls O = 1. Then 
m(S, O; Ngx(O)) = h(S). 


Proof. As in the proof of Theorem 30.7.3, but with a further appeal to Proposition 
30.5.3 to show that all local embeddings numbers are equal to 1. o 


An important illustrative special case is the following. 


Example 30.7.5. Let D = discr B and suppose that O C B is an Eichler R-order of 
squarefree level Wt, so discrd O = ON. Suppose further that S is a maximal R-order 
in K. Then Theorem 30.7.3 reads 


g msounour=wf-() (6 


[T]eCls O pid pI 
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with the local embedding numbers computed in Proposition 30.5.3(b) for p | © and 
Lemma 30.6.16 for p | M. 

Suppose further that B is T-indefinite and that #Clo R = 1; then#ClsO = #Clo R 
by Corollary 28.5.17 (an application of strong approximation), so 


m(S,0;0%) = h(S)| | f = (=) II f 4 (=)). (30.7.6) 


pid pI 
Embeddings of cyclotomic orders are of particular interest. 


Example 30.7.7. Consider the case F = Q and R = Z. Let D = disc B and suppose 
that O is an Eichler order of level M, so N = DM = discrd O. We recall D is squarefree 
and gcd(D, M) = 1. Suppose B is indefinite. Then by Theorem 28.2.11 (an application 
of strong approximation), # Cls O = 1. 

If K 2 Qis a cyclotomic quadratic extension, then either K = Q(w) = Q(v-3) or 
K = Qi) = Q(V-4) with corresponding rings of integers Zg = Z[w] and Zg = Z[i], 
each with h(Zx) = 1. With local embedding numbers computed in Lemma 30.6.17, 
Theorem 30.7.3 then gives 


m(Z[w], O; OX) = MO -(S) 00+): if 9} M; 


Similarly, using (30.6.15), 


m(Z{i], O; O*) = ne-E) if4ł M; 


0, if4|M. 


Absent further hypothesis, it is difficult to tease apart the term m(S, O; O*) from 
the sum over left orders in Theorem 30.4.7. In the next chapter, we will show that the 
hypothesis that B is T-indefinite is almost enough. 


30.8 Class number formula 


In this section, we explain how the theory of optimal embeddings can be used to convert 
the mass formula into a class number formula, following Eichler. 

Suppose throughout this section that B is T-definite. By Lemma 26.5.1, the group 
O% /RÝ is finite; let wo = [O* : R*]. 


30.8.1. To a nontrivial cyclic subgroup of O*/R*, we associate the quadratic field K 
it generates over F; such a field is necessarily cyclotomic, with K ~ F(&q) for q the 
order of the cyclic subgroup. (Only certain F(¢2,) arise as quadratic extensions of F, 
and different values of q can give rise to the same quadratic field up to isomorphism.) 

Conversely, to a quadratic field K 2 F, we obtain a (possibly trivial) cyclic subgroup 
(K* N O*)/R*. 
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Lemma 30.8.2. Every nontrivial aR* € O*/R* belongs to a unique maximal cyclic 
subgroup. 


Proof. Since O*/R% is finite, œ belongs to at least one maximal cyclic subgroup; if it 
belonged to two, then the corresponding quadratic fields would both contain the field 
corresponding to a, hence by degrees would be equal, so by maximality the cyclic 
subgroups would have to be equal. o 


30.8.3. Recall that m(S, O; O*) counts optimal embeddings ¢: S — O up to conju- 
gation by O*. Since O*/R* is finite, the set Emb(S, O) is itself finite. Precisely two 
embeddings give rise to the same image ¢(S), differing by the (necessarily nontrivial) 
standard involution. The stabilizer of O* on ¢ € Emb(S, O) is OX NA(S) = (S)* = S*. 
Let ws = [S* : R*]. We have shown that 


m(S, O; OX) = #16(S) CO: 6 € Emb(S; os, (30.8.4) 
O 


Proposition 30.8.5. We have 


1 1 1 
Lae = (1-2) >, m(S,0; 0%). 
WO 2 52 q i 


S*:R*]=g 


Proof. We count off the group O*/R* by maximal cyclic subgroups, keeping track 
of the trivial class. By Lemma 30.8.2, every nontrivial aR* € O*/R* belongs to 
a unique maximal cyclic subgroup of some order q > 2: such a subgroup is of the 
form $(S)*/R* with (S) € O an optimally embedded order, and has q — 1 nontrivial 
elements. Therefore 


wo-1l= > a (q — DHEIS) E O : ġ € Emb(S;0)}. 
q22 SCKg 
[S*:R*]=q 


Plugging in (30.8.4), we obtain 
Z ; X - px Wo. 


q22 SCKg 
[S*:R*]=q 


dividing through by wo gives the result. o 


We now recall the Eichler mass formula (Main Theorem 26.1.5), giving an explicit 
formula for the weighted class number 


mass(Cls O) := ` LN 


teciso ™! 


where wy = [OL(D* : RX]. 
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Main Theorem 30.8.6 (Eichler class number formula). Let R be a global ring with 
eligible set T, let B be an T-definite quaternion algebra over F = Frac R, and let O C B 
be an R-order. Then 


1 1 an Ay 

#Cls O = mass(ClsO) + 5 ) (! - 5) X, A(sym(S, 0; 0%) 
q22 [S*:R*]=q 

where the inner sum is over all quadratic R-algebras S 2 R such that [S* : R*] = q € 


Z>2, and h(S) = #Pic S. 


Proof. We apply Proposition 30.8.5 to each order O, (7) for [I] € Cls O and sum. We 
obtain 


1 
(: — 2) = # Cls O — mass(Cls O) 
[]eCls O wI 


1 1 
=5 > f = =) > > m(S, OLD; OLD”) 
[S*:R*]=q [I] 


q22 IJeCls O 
1 1 an a 
-15 (1 5) Yn SymS,6;0%, 
q22 [S*:R* ]=¢ 
the last equality by Theorem 30.4.7. o 


The expressions in the Eichler class number formula (Main Theorem 30.8.6) are 
arithmetically involved but can be computed effectively. In special cases, they give 
reasonably simple formulas, as previously advertised. 


Example 30.8.7. When F = Q, Main Theorem 30.8.6 yields the formula given in 
Theorem 30.1.5, with the computation of local class numbers given in Example 30.7.7. 


Remark 30.8.8. The formula for the class number for Eichler orders of squarefree 
level (i.e., hereditary orders) was given by Eichler [Eic56a, Satz 10-11]; for Eichler 
orders over Q of arbitrary level (as in Example 30.8.7), it was given by Pizer [Piz76a, 
Theorem 16]. Main Theorem 30.8.6 is proven by Vignéras [Vig80a, Corollaire V.2.5] 
and Körner [Kör87, Theorem 2]. For further reference and discussion (in the context 
of computing local embedding numbers), see Remark 30.6.18. 


30.8.9. Let K > F be a quadratic field extension, let Rx be the integral closure of R in 
K, and let S C K be an R-order of conductor f. Then there is an exact sequence 


x 
1> S* > RR > (REARKY” L, Pic S > Pic Rg > 1 (30.8.10) 
(R/f)* 
giving rise to the formula of Dedekind 
me Olli- (s) 
h(S) = TRX: SN 1-|—]—~ 30.8.11 
(S) = o|] 1 (30.8.11) 


plf 
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K 
where N is the absolute norm and (5) is given (globally) as in 30.5.2: 


k —1, ifp is inert in K; 
(5) = 40, if p is ramified in K; 
1, ifp splits in K. 


30.9 Type number formula 


We continue with the hypotheses of the previous section. A further application of the 
strategy to compute the class number is to also compute the type number. The methods 
are indeed quite similar: rearranging Corollary 18.5.12, we have 


#ClsO 1 
PO $ f = ) (30.9.1) 
vit [O’]eTyp O ae 


#TypO = 


where zo = [Ngx(O’) : FXO’). But now the structure of the normalizer groups come 
into play, and one can give a type number formula similar to the class number formula 
30.8.6 in terms of certain embedding numbers at least for Eichler orders. Unfortunately, 
even over Q, these formulas quickly get very complicated! To give a sense of what can 
be proven, in this section we provide a type number formula in a special but interesting 
case due originally to Deuring [Deu51], and we refer to Remark 30.9.12 for further 
reference. 


Proposition 30.9.2 (Deuring). Let B be a definite quaternion algebra over Q with 
disc B = p prime and let O C B be a maximal order. Then # TypO = 1 for p = 2,3, 
and for p = 5, 


1 1 
#TypO = z* Cis O+ z CP) + h(-4p)) 
where [h(—p)] = h(—p) when p = 3 (mod 4) and is 0 otherwise. 


Proof. In light of (30.9.1), we begin by considering the Picard group Pic O (with 

R = Z): by 18.4.8, we have an isomorphism Pic(O) ~ Z/2Z generated by the unique 

right ideal J C O with nrd(J/) = p. The ideal J is automatically two-sided and contains 

all elements of reduced norm divisible by p (see 13.3.7); and thus J is principal if and 

only if there exists an element «œ € O with nrd(q) = p if and only if zo = 2. 
Therefore, (30.9.1) reads 


#TypO = ly ClsO + l > #{J’ c O’ principal right ideal : nrd(J’) = p}. 
2 [O’]eTyp O 
(30.9.3) 
We now compute this sum in terms of embedding numbers. First, the map a œ> «O 
gives 
#{J C O principal right ideal : nrd(J) = p} 
(30.9.4) 


= ee € O: nrd(a) = p} 
2wo 
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where wọ = [O* : Z*]. 

Next, we claim that if a € O has nrd(q) = p, then trd(@) = 0, i.e., a? + p=0. 
Indeed, if t = trd(@) then the field K = Q(q) has discriminant t? — 4p < 0 (since B is 
definite) so |t| < 2./p. If t + 0, then the polynomial x? — tx + p splits modulo p, so p 
splits in K; but K, —> Bp and B, is a division algebra, so Ķ, is a field, a contradiction. 
Thus ¢ = 0. 

With these results in hand, we can bring in the theory of embedding numbers. We 


have 
#a € O : nrd(a) = p} = Ha € O: a* +p =0} 


= J, #Emb(S,0). (30.9.5) 
S2Z[V-p] 
The group O*/{+1} acts by conjugation on Emb(S, O) without fixed points as in 30.8.3: 
since p + 2,3, we have S* = {+1}. Thus 
#Emb(S, O) = wom(S, O; O*). (30.9.6) 
Combining (30.9.4), (30.9.5), and (30.9.6), and plugging into (30.9.3), we have 


1 1 
#TypO = 5#ClsO + 7 > ` m(S, O'; 0”). (30.9.7) 
S2Z[y-p] [O’]eTyp O 


By (30.4.21) (rewriting Theorem 30.4.7), for $ 2 Z[./—p] we have 


>, h(O’)m(S, O'; O%) = h(S)m(S, O; O*) 
[O’JeTypO 


where A(O’) = [IdlO’ : PIdlO’]; but A(O’) = 1 whenever m(S, O’; O'%) + 0 by the 
first paragraph, so we may substitute into (30.9.7) to get 


1 1 este) ek 
#TypO = z“ Cls O + z >; h(S)m(S, O; O*). (30.9.8) 
S2Z[y-Pp] 


The order O is maximal, so the adelic embedding number is the product of local 
embedding numbers computed in Proposition 30.5.3: there is only a possible contribu- 
tion at p, since p + 2 the order S is maximal, and K is ramified so m(Sp, Op; OF) = 1, 
thus m(S,O; O*) = 1. 

Finally, the order Z[./—p] of discriminant —4p is maximal whenever p = 1 (mod 4) 
and the sum becomes simply h(—4p); when p = 3 (mod 4), this order is contained in 
the maximal order of discriminant —p, so the sum is h(—p) + h(—4p). The result is 
proven. o 


Remark 30.9.9. For an alternate direct proof of Proposition 30.9.2 working with elliptic 
curves, see Cox [Cox89, Theorem 14.18]. 
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30.9.10. The sum of class numbers in Proposition 30.9.2 can be rewritten uniformly 
in terms of the ring of integers as follows: 


1, ifp=1 (mod 4); 
[h(—p)] + h(—4p) = #C1Q(V-p) - 44, ifp = 3 (mod 8); 
2, if p =7 (mod 8). 


When p = | (mod 4), there is nothing to do. For p = 3 (mod 4), we have 


3h(—p), ifp = 3 (mod 8); 


= ae if p = 7 (mod 8); 


according as 2 is inert or split in K = Q(/—p). 


Remark 30.9.11. In section 42.1—42.2, we relate quaternion algebras to supersingular 
elliptic curves; in this language, Proposition 30.9.2 gives rise to a formula for the 
number of supersingular elliptic curves defined over F, up to isomorphism. 


Remark 30.9.12. Eichler [Eic56a, Satz 11] gave a type number formula for definite 
hereditary orders over a totally real field; this formula has an error which was corrected 
by Peters [Pet69, Satz 14, Satz 15] over fields of class number one and by Pizer [Piz73, 
Theorem A] in general. Pizer [Piz76a, Theorem 26] gives a formula for the type number 
for (general) Eichler orders over Q. Finally, Vignéras [ Vig80a, Corollaire V.2.6] gives 
a “structural” type number formula (without explicit evaluation of the sum) for Eichler 
orders, and Korner [K6r87, Theorem 3] gives a general type number formula. For a 
generalization to totally definite orders in central simple algebras of prime index over 
global fields, see Brzezinski [Brz97]. 


Exercises 


1. As in section 30.1, for k > 1 let 
re(n) := #{(x1, x2, ..-,XK) € ZÉ Ex bag te tax =n}. 


We gave formulas for r3(n), r4(n). For completeness, observe that rı(n) = 2,0 
according as n is a square or not, and give a formula for r2(n) in terms of the 
factorization of n in the ring Z[/]. 


> 2. Let B be a quaternion algebra over Q, let O C B be an order, let K be a quadratic 
field with an embedding K — B and suppose $ = K N O is the ring of integers 
of K. 


(a) Let b c K be an invertible fractional S-ideal. Show that bOn K = b. [Hint: 
since 1 € O, we have bO N K 2D b. For the other inclusion, consider 


(bON K)-6-'b c (bb 'ONK)- b= b.] 
(b) Rewrite the proof in (a) idelically. 
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> 3. Letn = 1,2 (mod 4). 


(a) Let p be an odd prime, and let œ, œ’ € M>(Z,) satisfy a? +n = 0. Show 
that there exists u € GL2(Z,) such that a’ = Wtan. 


1,-1 
(b) Let Bo = o) and O% its valuation ring. Show that if œ, œ’ satisfy 


the same reduced characteristic polynomial, then there exists v € N, BX (02) 
such that a’ = v™!æv. [Hint: Npx(O2) = BY.] 
(c) Put together (a) and (b) to conclude (30.2.4). 
4. Deduce the theorem of Gauss (Theorem 30.1.3) from Theorem 30.4.7. 


5. Letm,n € Zso be odd, coprime, with m < 24n. Show that there are 12h(m?—4n) 
solutions to the equations 


@ tbet n 
a+b+c+d=m 


with a, b,c, d € Z. [Hint: consider the unit (1—i— j — k)/2 of the Hurwitz order. ] 

6. Specialize Theorem 30.1.5 to the case D = p and N = 1, as follows. Let B be a 
definite quaternion algebra over Q of discriminant D = p prime and let O c B 
be a maximal order (Eichler order of level M = 1). Show that 


1, if p = 2,3; 

(p — 1)/12, if p = 1 (mod 12); 
#Cls O = 4 (p + 7)/12, if p = 5 (mod 12); 

(p + 5)/12, if p = 7 (mod 12); and 

(p + 13)/12, if p = 11 (mod 12). 


7. Let R be local and O be an Eichler R-order of level p (so O is hereditary, but 
not maximal). Let K be a quadratic F-algebra and $ C K an R-order. Show that 


K 
m(S, O; O*) = 0 if and only if S is maximal and (5) =-l. 


8. Let B be an indefinite quaternion algebra over Q of discriminant D and let O € B 
be an Eichler order. Let S C K be a quadratic order of discriminant ds = df?, 
where d is a fundamental discriminant. 


(a) Suppose O is maximal. Show that there exists an optimal embedding 
S — O if and only if for all p | D, we have (K | p) + 1 and p{ f. 

(b) Suppose O is Eichler of level M and (for simplicity) that N := DM is odd. 
Show that there exists an optimal embedding S$ — O if and only if the 
following conditions hold: 

e For all p | D, we have (K | p) # 1. 

¢ For all p° || M, either (K | p) = 1 or p° | ds. 

[Hint: recall that #Typ O = 1.] 

[It is possible to extend this to all N by analyzing local embeddings at 
p = 2 working over Q2.] 
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9. 


10. 


11. 


12. 


Let F = Q(vd) be a real quadratic field of discriminant d > 0. Show that there 
exists q + 2,3 such that [F (64) : F] = 2 if and only if d = 5, 8, 12. 

Let R be a global ring with field of fractions F, let B be a quaternion algebra over 
F, and let O, O’ C B be maximal R-orders. Let K C B be a quadratic F-algebra in 
B, and suppose that K NO = K NO’; let S be this common, optimally embedded 
quadratic R-order. Show that there exists an invertible fractional S-ideal a C K 
such that Oa = aO’. [Hint: the statement is local, so put the Eichler order ONO’ 
and generator y € S in standard form; show that there exists uy € O* ~ GL2(R) 
and &œy € Sy such that UpQy = Wy generates the normalizer. ] 

In this exercise, we prove a local-global principle for optimal embeddings in a 
self-contained manner. [For a much more precise result, read on to Chapter 31!] 
Let F be a number field with ring of integers R. Let B be a quaternion algebra 
over F and let O C B be an R-order. Let K be a quadratic F-algebra and let 
S C K be an R-order. Suppose that K — B; equivalently, by the local-global 
principle for embeddings (Proposition 14.6.7), every place v € Pl F does not 
split in K. 


(a) Suppose that for all primes p of R, there exists an optimal embedding 
Pp: Sp — Op. Show that there exists an order O’ € Gen(O) (so O’ is 
locally isomorphic to O) and an optimal embedding ¢: S > O’. [Hint: 
choose a maximal order containing S and apply finitely many local cor- 
rections. ] 

(b) Now suppose that #Clg(o) R = 1. Show that there exists an optimal embed- 
ding ¢: S > O. 

(c) Show that parts (a) and (b) follow from the trace formula (Theorem 30.4.7). 


Give a direct proof of Corollary 30.4.23. 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
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Chapter 31 


Selectivity 


In the previous chapter, we saw that (conjugacy classes of) embeddings of a quadratic 
order into a quaternion algebra are naturally distributed over the genus of a quaternion 
order; in applications, we want to compare the number of embeddings over orders in 
a genus. Such a comparison can be thought of as a strong integral refinement of the 
local-global principle for embeddings of quadratic fields (Proposition 14.6.7), which 
belongs to the more general framework of the Albert—Brauer—Hasse—Noether theorem. 

This chapter is quite technical, and it may be skipped on a first reading. To reward 
the reader who persists, we conclude this chapter with the construction of isospectral, 
nonisometric hyperbolic Riemannian manifolds, following Vignéras. 


31.1 Selective orders 


To get a preview of what selectivity is all about, right off the bat we give an example 
of the failure for a quadratic order to embed equitably in the genus of an order. 


Example 31.1.1. Let F := Q(V-5) and R := Zr = Z[V-5]. Then CIR = Z/2Z, and 
the nontrivial class is represented by the ideal p = (2, 1 + V—5) C Zr with p? = 2Zp. 
By class field theory, the Hilbert class field K 2 F is a quadratic extension, and the 
genus theory of Gauss gives K = F (V-1) = F (V5). The maximal order of K is 


Zg = Zp[w] where 
v=9e7(44) - V=1 + V-5 


2 2 


satisfies w? — V—S5w — 1 = 0, a polynomial of discriminant —5 + 4 = —1. 
We take B = M>(F) and the maximal order O = M>(Zp). By 17.3.7, there is a 
bijection Cl R æ Clsp O, with the nontrivial right ideal class represented by 


_(p 0 IP P}. 
i=(6 RJM = ( AF 


R p 
O’ = OL) = (E 4 s 
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its left order is 
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These two orders are not isomorphic and up to isomorphism represent the two types 
of maximal R-orders in Mo(F). 

We claim that there is an embedding Zg — O but no embedding Zg — O’. The 
first part of the claim is easy: taking the rational canonical form, we take the embedding 


wea=(t va): (31.1.2) 


The proof that Zg # O?’ is more difficult. (The embedding (31.1.2) does not 
extend to O’ because of the off-diagonal coefficients; and we cannot conjugate this 
embedding in an obvious way because the ideal p is not principal.) Such an embedding 
would be specified by a matrix 


,_ {a b R p 
” =le -a + V25] £ \po! R 
with 

-— det(a’) = a? — V-5a + bc = 1; (31.1.3) 


so the content in the second claim is that there is no solution to the quadratic equation 
GLI). 

Indeed, suppose there is a solution. Let f(x) = x? — V-5x -1€ Zr[x], so that 
f(a) + be = 0. We may factor bZr = pb with b C Zr and [b] € Cl Zp nontrivial; 
by parity, there exists a prime q | b with [q] nontrivial. Factoring cZp = p™!c with 
c C Zr, we have bcZr = be C q, so f(a) = —bc = 0 (mod q). But f(x) has trivial 
discriminant, and modulo a prime q C Zr, it either splits (into distinct linear factors) or 
remains irreducible. And by the Artin map, f(x) splits modulo q if and only if q splits 
in K if and only if the class [q] € Cl Zp is trivial. Putting these two pieces together, 
we have f(a) = 0 (mod q) and f(x) is irreducible modulo q. This is a contradiction, 
and there can be no solution. 


With this cautionary but illustrative example in hand, we state our main theorem. 
We return to the idelic notation of section 30.4. We will consider embeddings in the 
context of strong approximation (see Chapter 28). 

The following notation will be in use throughout this chapter. 


31.1.4. Let R = Rp be a global ring with eligible set T and let F = Frac R be its field 
of fractions. Let B be a quaternion algebra over F and suppose that B is T-indefinite. 
Let O C B be an R-order. 

Let K 2 F bea separable quadratic F-algebra and let $ © K be an R-order. Suppose 
that Emb(S: ; O) + Q, which is to say, for all primes p C R, the Ry-algebra Sp embeds 
optimally into Op. 


Our struggle will be to understand when the local optimal embeddings glue together 
to give a global optimal embedding. As a start, we know by Corollary 30.4.18 that 
there exists some order O’ € Gen O in the genus of O (i.e., locally isomorphic to O) 
such that Emb(S, O^) + @. 
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Definition 31.1.5. We say that Gen O is genial for S if Emb(S, O’) + @ for all O’ € 
Gen O. If Gen O is not genial, i.e., there exists O’ € Gen O such that 


Emb(S, O’) = 0, 
then we say that Gen O is optimally selective for S. 


By definition, Gen O is genial for S if and only if S embeds optimally in every 
order O’ that is locally isomorphic to O. 


31.1.6. We define the following condition, called the optimal selectivity condition: 
(OS) K is a subfield of the class field Hg ao) of F obtained from Clg n(o) R. 


In particular, if K is not a field, then (OS) does not hold. We now state our main 
theorem, with notation and hypotheses in 31.1.4. 


Main Theorem 31.1.7 (Optimal selectivity). Suppose that O is an Eichler order. Then 
the following statements hold. 


(a) Gen O is optimally selective for S if and only if the optimal selectivity condition 
(OS) holds. 
(b) If Gen O is optimally selective for S, then Emb(S, O’) + 0 for precisely half of 
the types [O’] € Typ O. 
(c) In all cases, 
m(S, O’; O) = m(S, O; O*) 


for all O’ € Gen O whenever both sides are nonzero. 


Since the optimal selectivity condition (OS) only depends on K, if GenO is 
optimally selective for S then it is optimally selective for all R-orders in K. 


Remark 31.1.8. It was first noted by Chevalley [Chev36] in the more general situation 
of matrix algebras that it was possible for a commutative order to embed into some, but 
not all, maximal orders. An approach to selectivity is sketched by Vignéras [Vig80a, 
Théorème III.5.15], but there are some glitches [CF99, Remark 3.4]. Maclachlan 
[Macl2008, Theorem 1.4] gives a proof of Main Theorem 31.1.7(a)—(b) for hereditary 
orders (Eichler orders of squarefree level). For a more detailed literature survey and 
further comments, see 31.7.7. 


31.1.9. When Gen O is optimally selective for S, then we can refine Main Theorem 
31.1.7(b) detecting the half of types of orders for which there is an optimal embedding 
of S. 

From our hypothesis Emb(S, O) + 0, we know that S embeds into some order in the 
genus of O; we might as well take this to be O itself, so we suppose that Emb(S, O) + 0. 
Let O’ € Gen O. Then O’ is connected to O, so O’ = O(I) for an invertible right 
O-ideal J C O. Let 

a=[O:ONO']=[O!:OnNO'. 
Then Emb(S, O’) + 0 if and only if Frobg is trivial in Gal(K | F). 

For example, if a = p is prime, then Froby is trivial in Gal(K | F) if and only if p is 
not inert in K. For maximal orders, we can equivalently formulate the index in terms 
of distance on the Bruhat—Tits tree (see section 23.5 and Exercise 23.9). 
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The core application of the optimal selectivity theorem is the following corollary. 


Corollary 31.1.10. Suppose that Gen O is genial for S. Then 


h(S) Bay ee Be 
OX) = . Ax 

m(S, O; O”) = yoo” ® 0; O”). 

Proof. We combine Main Theorem 31.1.7 and Theorem 30.4.7. o 


We conclude this introduction with a second application, a generalization of Corol- 
lary 28.6.4. Let Q be the set of real, ramified places of B, and let RX o := R* N F% o 
be the subgroup of R* of elements that are positive at all v € Q. 


Corollary 31.1.11. Let O C B be an Eichler R-order. Then 
nrd(O%) = RŽ 


>90" 


Proof. Let u € RZ 0° We repeat the argument of Corollary 28.6.4: we find y’ € O’ 
with nrd(y’) = u and O’ € Gen O. We may suppose further that Gen O is not selective 
for R[y’] by shrinking the open set to ensure that K = Frac R[y’] A Hgno)- Let 
S = KOO’. Then S € O’ is optimally embedded; and by Main Theorem 31.1.7(c), 
there exists an optimal embedding ¢ : S — O, hence ¢(y’) = y € O has nrd(y) = u 


as desired. oO 


31.2 Selectivity conditions 


In this brief section, we make the somewhat opaque optimal selectivity condition (OS) 
explicit for Eichler orders. 


Proposition 31.2.1. Let O be an Eichler order of level W. Then Condition (OS) holds 
if and only if all of the following four conditions hold: 


(a) The extension K 2 F and the quaternion algebra B are ramified at the same 
(possibly empty) set of archimedean places of F; 

(b) K and B are unramified at all nonarchimedean places v € P| F; 

(c) Every nonarchimedean place v € T splits in K; and 

(d) If p C R is a nonzero prime and ordy (W) is odd, then p splits in K. 


Proof. We determine the class field Hgjo) obtained from the group GN(O) = 
Fe 0 nrd(Np.(O)). 

Recall we have G(O) = FZ o nrd(O*) = FZ oR”, since O is an Eichler order 
and therefore locally norm-maximal, so Hg(o) is the maximal abelian extension of F 
unramified away from the real places in Ram(B) and such that the remaining places 
v € T split completely. 

The normalizer nrd(Ngx (O)) is the restricted direct product of local normalizers, 


computed in (23.2.8) for p | D and Corollary 23.3.14 for p { D: for the latter, 
F? RŠ, if ord, (MT) is even; 


rd(Npx(Op)) = 
nrd(Ngx (Op)) e if ordp (M) is odd. 
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Therefore, the quotient Clg(o) R > Clen oy R factors through the quotient by squares 
Cleo) R/(Cleo) R) and then the further quotient by the primes p | D = disc B and 
p | Mt with ord, (Mt) odd. 

We now conclude the proof. If K is not a field then all of the desired conclusions 
hold; so suppose K is a field. Since K — B, if v € PIF ramifies in B then v also 
ramifies in B. A containment K C Hgyno) is permitted at archimedean places if and 
only if the archimedean ramification in K 2 F is no bigger than this. In a similar 
way, the conditions in the previous paragraph establish (c)-(d), and K is unramified 
at all nonarchimedean places v. To conclude (b), if p | D then p splits in HG n(o) and 
therefore in K; but we are assuming that K —> B so Ky —> By, a contradiction since 
By is a division algebra, and there can be no such p. o 


31.3 = Selectivity setup 


We now embark on a proof of the selectivity theorem (Main Theorem 31.1.7); this goal 
will occupy us for the remainder of this chapter. In this section, we begin to isolate 
the problem: there is a group that is at worst Z/2Z and is usually trivial, and we pin it 
down using strong approximation, the reduced norm, and class field theory. Our basic 
reference is Vignéras [Vig80a, Théorème III.5.15], and the surrounding text. 

Our notation is as in 31.1.4. 


31.3.1. To establish the main theorem in the case where K ~ F x F is straightforward. 
We leave this case as an exercise (Exercise 31.1). 


We suppose throughout the rest of this chapter that K is a field. 
Let O! < T < Ngx(O) (as in 30.3.10). Recall that there is a bijection (30.3.13) 
Emb(S,O;T) e K*\E/T 
where 
E = {B € BX: KÊ NO = SÊ} 

and we abbreviate conjugation K? = B-'K for conciseness. Conjugating if necessary, 
we may suppose that | € E, i.e., we start with an order and an optimal embedding 
KNO=S. 


We employ idelic notation as in section 30.4. The inclusion B* — Bx gives an 


inclusion eee 
E/T => E/T (31.3.2) 


with 
E := {Be BX: RË n O = P}. 


The hypothesis of strong approximation allows us to identify precisely the image 
of the map (31.3.2) via the reduced norm in the following way. 


31.3.3. As in Theorem 28.5.5 (a motivating application of strong approximation), the 
reduced norm induces a bijection 


BX\BX/T > FX 9\F*/nrd(L) = Clea) R (31.3.4) 


where G(T) := FX 9 nrd(I). 
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Lemma 31.3.5. We have 

EJT = {8T € E/T: nrd(B) € GI} C E/T. 
That is to say, if B € E, then there exists p € E such that Br = br if and only if 
nrd(B) € G(T). 


Proof. We find a 8 € B* (without the condition that 8 € E) immediately from the 
bijection (31.3.4). But B = By € BI and T < Nax(O) gives 


KP aO = KP nO = 8% 
and intersecting with B we find £ € E. o 


Lemma 31.3.5 points the way more generally, at least to detect if there is an 
embedding in the first place in an order. First, we need to give representatives of the 
type set. 


31.3.6. Recalling 28.5.7, there is a bijection 
Typ O © B*\B*/Np.(O); 
explicitly, every isomorphism class of order in Typ O is of the form 
O' =707 'NB=O0" NB 

(yes, the choice of inverse is deliberate), with the class of Y € B* in BX\B*/N a (O) 
uniquely defined. 

In the presence of strong approximation (Corollary 28.5.10), we have a further 
bijection 

Typ O eo CIGNO) R 


where 


GN(O) = G(Ngx(O)) = FX o nrd(Nz.(O)). 


>Q 


Now we look back at embeddings and reduced norms. 


31.3.7. Let 


nrd(E) = {nrd(B) : B € E} c F>. (31.3.8) 


The set E does not obviously have a group structure, so nrd(E y= {nrd(B) : B cE } 
is a subset. But Ê is stable under K>, so nrd(E) i is a union of cosets of nrd(k™). 

The set nrd(E ) is quite large, because it contains reduced norms from K KX c E and 
Npx(O) CE. 


31.3.9. By the main theorem of class field theory (Theorem 27.5.10), the Artin map 
gives a bijection 


E*/F* Nmxip(K*) © Gal(K | F) = Z/2Z. (31.3.10) 
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Let S C Tbe the set of archimedean places of F. Recall the isomorphism (27.5.8) 
PMP’ FZ, oo? Fg lF eo: 
We have 
F* 


3 >0 £ NMgr (KZ) < Nmxir(K*) 


(the latter embedded at the infinite place with the other components 1), and by the same 
argument as in Lemma 27.5.6, the image of F* Nmgjr(K*%) under the isomorphism 
(27.5.8) is F% o Nmxir (Kg) where È C PI(F) is the set of places ramified in K (going 
from real to complex in the extension F C K) and 


F% o `= {a € F” : v(a) > 0 for all real v € Z}. 


> 


Therefore we have an isomorphism 
F*/F* Nmx\r(K*) > Fs |F% o Nmx\F (Kg). (31.3.11) 
We then further project from the target of (31.3.11) to F* = F. 7 to obtain the map 


E*/F* Nmgir(K*) > F” /F* o Nmxir(K*). (31.3.12) 


Lemma 31.3.13. We have P pi 
F% o Nmxir(K”) < F* 


with total index at most 2, and index equal to 2 if and only if every nonarchimedean 
place v € Tis split in K. 


Proof. In the projection (31.3.12), we start with a group of order 2; in order to keep 
it this size, the projection away from the nonarchimedean places in T must be an 
isomorphism, which holds if and only if for all nonarchimedean places v € T we must 
have v split in K. o 


We conclude this setup section with an overview. 


31.3.14. Selectivity arises from an examination of layers in the following selectivity 
sandwich: 
~, (OS) -~ ~ m os OS ra 
Foo nrd(K*) < F% 0 nrd(K”) nrd(Ng.(O)) < F% 0 nrd(E) < F* (31.3.15) 


In terms of the sandwich bread, the left-most group F% , nrd(K*) has index at most 


2 in the right-most group FX by Lemma 31.3.13. The set Ee nrd(E ) is stable under 
multiplication by F% o nrd(Ķ*) so is a union of its cosets in F*; therefore, it is actually 
a subgroup. 

Much ado about a (possible) group of order two! 


Again by Lemma 31.3.13, the sandwich collapses if there is a nonarchimedean 
place v € T that is inert in K 2 F, so there is only work to do when every v € Tis 
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split. Under this assumption, we will show in Lemma 31.4.1 that the first inequality 
labelled (OS) is an equality if and only if the optimal selectivity condition (OS) holds. 
In Propositions 31.5.1 and 31.5.7, we will show that the middle inequality labelled m 
is always an equality and that such an equality implies equality of embedding numbers 
(when they are nonzero). Last but not least, in Proposition 31.4.4 we will show that the 
final inequality labelled s is an equality if and only if there is no selectivity obstruction, 
i.e., Emb(S, O’) + @ for all O’ € Gen O. 


31.4 =x Outer selectivity inequalities 


In this section, we consider the outer ends of the selectivity sandwich 31.3.14. 
The left-most inequality is interpreted in the language of class field theory as 
follows. 


Lemma 31.4.1. We have 
FX o nrd(K*) < F% o nrd(K*) nrd(Np.(O)) (31.4.2) 


with index at most 2, and equality holds if and only if either the optimal selectivity 
condition (OS) holds or there exists a nonarchimedean place v € T inert or ramified 
in K. 


Proof. Recall that GN(O) = F% 0 nrd(N. 'zx(0)). In the side sandwich 


FX) nrd(K*) < F% o nrd(K*) nrd(Ngx(0)) < F* (31.4.3) 


we again have total index at most 2. By class field theory and the Galois correspondence 
relative to the corresponding tower of class groups, we have 


FX o nrd(K*”) = FX o nrd(K™) nrd(Ngx(0)) < F” 
(so first equality, then strict inequality in (31.4.3)) if and only if K C Hen) if and 
only if (OS) holds. The result then follows by Lemma 31.3.13. oO 


We next consider the right-most inequality, and we show that it contains the obstruc- 
tion to selectivity. 


Proposition 31.4.4. Let [O’] € Typ O be represented by the class BXVN 3x0). Then 
Emb(S, 0’) + 0 if and only if nrd(¥) € FX o nrd(E). 


Proof. Define 
E’ = {8' € BX: KË nO’ =S?} 
so that Embg(S, O’) — K*\E’, and similarly E. 
Suppose Emb(S, O’) + 0, represented by 8’ € E’. Then 


RP nO = RP naO = SF 
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so 8'7 = B € E. Therefore 
nrdO) = nra(8"') nrd(B) € FX o nrd(E). 
Conversely, suppose that nrd(v) € F% o nrd(E); then there exists a € F% 0 and 
B € E such that nrd(v) = a nrd(£). Since B € E, we have 
RÊ nÔ = 
thus if B’ = B7! we get 
RP nO = RË nÔ = (31.4.5) 
and 8’ €e E’. We have 
nrd(ĝ’) = nrd(Bv"!) = a7! € FX o 
So by Lemma 31.3.5, there exists 6’ € E’ mapping to B, and Emb(S, O’) + @ as 


claimed. o 


The following corollary indicates the significance of the preceding proposition. 


Corollary 31.4.6. If F> o nrd(E) = F*, thenEmb(S, O’) + Oforallorders O' € Gen O 


in the genus of O. Otherwise, F% o nrd(E) < F* has index 2 and Emb(S, O’) + 0 for 

precisely half of the types of orders in Typ O: we have Emb(S, O’) + 0 for 
O'=7OF'NB 

if and only if nrd(V) € FX, nrd(K*). 


>Q 


In particular, in the latter case we have # Typ O even. 


Proof. We apply Proposition 31.4.4, with indexing of the type set as in 31.3.6. o 


31.5 » Middle selectivity equality 


In this section, we pursue the middle inequality in the selectivity sandwich 31.3.14. 
First, we show that equality in this middle equality implies equality of embedding 
numbers, whenever they are nonzero. 


Proposition 31.5.1. We have 
FX o nrd(K™) nrd(Ng.(O)) < FX 


>QC 


ynrd(E) (31.5.2) 


with index at most 2. If equality holds in (31.5.2), then whenever O’ € Gen O and 
Emb(S, O’) is nonempty, we have 


m(S, O; O*) = m(S, O’; O”*). 
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Proof. The statement about index follows from the layering of the sandwich (31.3.15). 
For the second statement, suppose that Emb(S, O’) is nonempty; then by Proposition 
31.4.4, we have O’ = O" AB with nrd(y) € F% o nrd(£). If equality holds in (31.5.2), 


then there exists a € F% o @ € K*, and 7 E€ Nex (O) such that 
nrd(v) = anrd(@) nrd(7). (31.5.3) 


We restore notation from Proposition 31.4.4, and modify the argument in the 
converse. We define the map 


E> E 
B> =apir! 


We argue as in (31.4.5). From £ € E, we have K® n O = SP. We have @ € K*, so 
K? = K. And f € Ngx(O), so O7 = O. Therefore 


(31.5.4) 


RP naO = KP nO = (31.5.5) 
and indeed f’ € F’. Finally, by (31.5.3) 
nrd(A’) = nrd(@Bnv') = a7! nrd(B) € FX 9. 


By Lemma 31.3.5, there exists 6’ € E’ such that p'O% = BO”, well-defined up to 
O*. Therefore, (31.5.4) descends to a map E — E’/O’”, and it further descends to a 
map 

E/O* > E’/O” (31.5.6) 
BO% > p'o% T 


because 
aBunv 'O' = @BunOv! = @BuONT | = @BOny | = EGY O. 


This map works as well interchanging the roles of O and O’, and after a little chase, 
we verify that the map (31.5.6) is bijective. Taking orbits under K* on the left, we 
conclude the proof. o 


In fact, equality holds in the middle for Eichler orders. 
Proposition 31.5.7. If O is an Eichler order, then the inequality (31.5.2) is an equality. 


Proof. We will prove that 
nrd(K*) nrd(N.(O)) = nrd(E); (31.5.8) 


the inclusion (<) was direct, so we prove (=). The desired equality is now idelic, so 
we reduce to checking in the completion at a prime p. If By is a division algebra, then 
Oy, is maximal, and m(Sp, Op; Ngx (O,)) = 1 by Proposition 30.5.3(b) we have the 
stronger equality E, = Npx (Op) = By. 
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Otherwise, By ~ M2(Fp) is split. Without loss of generality, we may suppose 
that Op is a standard Eichler order. Further, by Lemma 30.6.3, after conjugating 
by a normalizing element if necessary, we may suppose that the reference optimal 
embedding Sp —> Oy is normalized. But then this embedding extends to an optimal 
embedding Sp <> M2(R,): the upper-right entry is 1. 

Now let p € Ep be arbitrary, with associated embedding gh : Sp — Op. We 
repeat the argument in the previous paragraph: by Lemma 30.6.3, replacing By by By Vp 
if necessary with vp € N BX (Op), we may suppose that gh P is normalized, and therefore 
extends to an optimal embedding into M2(Rp). But by Proposition 30.5.3(a), we have 
m(Sp, M2(Rp); GLo(Ry)) = 1—all optimal embeddings into M2(Rp) are conjugate 
under GL2(Rp )—so there exists up € GL2(Rp) such that ge = Øp- Therefore By € 
HpK so 


nrd(Bp) € nrd(K,)R; < nrd(K%)nrd(O%) < nrd(Ky) nrd(Npx (Op)) (31.5.9) 


as claimed. o 


31.6 =» Optimal selectivity conclusion 
We now officially complete the proof of the selectivity theorem for Eichler orders. 


Proof of Main Theorem 31.1.7. By 31.3.1, we may suppose K is a field. We refer to 
the selectivity sandwich (31.3.15), using Proposition 31.5.7 to simplify the middle 
equality: 


~ (OS) = ba ~ ss 
Feo nrd(K*) < F% o nrd(K*)nrd(N5x(0O)) = F% 0 nrd(E) < F*, 


with total index at most 2. 

By Proposition 31.4.4, we have that Gen O is optimally selective for S if and only 
if the right-most inequality (labelled s) is strict. Such an inequality is strict if and only 
if the total index is 2 and the left-most inequality is an equality. By Lemma 31.4.1, this 
happens if and only if the condition (OS) holds. This proves (a). 

Statement (b) is a restatement of Corollary 31.4.6, and statement (c) follows from 
the second statement in Proposition 31.5.1. Oo 


It has been a long and pretty technical road, so as refreshment we work through an 
example (cf. Maclachlan [Macl2008, §4, Example 1]). 


Example 31.6.1. Let F be the totally real cubic field Q(b) where b? — 4b — 1 = 0; then 
F has discriminant 229 and ring of integers R = Zr = Z[b]. The usual class group 
CIR is trivial, but the narrow class group is Cl* R ~ Z/2Z, represented by the ideal 
p = (b+ 1)Zp of norm 2—the ideal p is principal, but there is no generator that is 
totally positive. The narrow class field K = H* 2 F is quadratic, with Ht = F(Vb). 


Let B = | F . Then b is positive at precisely one real place and negative at the 


other two, and b € Z.. Computing the Hilbert symbol at the even primes, we conclude 
that Ram(B) is equal to two real places. In particular, B is indefinite. The class group 
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Clg) R with modulus equal to these two real places is equal to Cl* R, as we see by 
the real signs of b. 

Next, we compute representatives of the type set of maximal orders for B. By 
strong approximation (Corollary 28.5.10), we have Typ O in bijection with Cle no) R, 
so we need to compute the idelic normalizer: but B is unramified at all nonarchimedean 
places, and 

NR (O) = Nor PMR) = F*O*. 


Thus nrd(NVz.(O)) = P2R*, and 


GN(O) = FX _,F°?R* = FX „R* = G(O). 


>90 >90 
In other words, the quotient map Clg(q) R > Clen(o) R is an isomorphism, still a 
group of order 2. We conclude that # Typ O = 2. 
We compute a maximal order 
i+ j b +ij 
OZ : 
2 roa? 


We conjugate this order by an ideal of reduced norm p to get the second representative 


b 
O = O1 = Zr @® Zfi @ Zf 


(b? +b+1)+(b+1)i+j (b+1)+(b+b+1)i+ij 
—— = eo 


Therefore these orders represent the two types of maximal orders, and TypO = 
{[O1], [O2]}}. 

With all of these elements in place, we can observe selectivity (Main Theorem 
31.1.7). We saw that both K and B are ramified at no nonarchimedean places and 
exactly the same set of real places. In particular, the field K —> B embeds by the local- 
global principle. Let S = Zg = Zr[w] be the maximal order in K. Then w*—bw+1 = 0. 
Then Emb(S; O) + 0 (Proposition 30.5.3(a)). 

The optimal selective condition (OS) holds because we took it so, K = H*. It 
follows that S embeds in exactly one of O; or O2. We find that 


O2 = ZF OZri O ZF 


_ bP +ij 
~ 2b 


satisfies a7 — ba + 1 = 0 as desired; so S$ embeds in O; (and not O3). 


€ Oi 


Remark 31.6.2. Without the hypothesis of strong approximation, it is very difficult to 
tease apart the contributions from different orders in the genus: indeed, the generating 
series for representation numbers for a definite quaternion order give coefficients of 
modular forms, discussed in Chapter 41. 


31.7 Selectivity, without optimality 


To conclude this chapter, we compare the above with a weaker condition than optimal 
selectivity, and close with connections to the literature. We continue notation and 
hypotheses from 31.1.4. 


31.7. + SELECTIVITY, WITHOUT OPTIMALITY 581 


Definition 31.7.1. We say that Gen O is selective for S if there exists O’ € Gen O such 
that there is no embedding S — O’ of R-algebras. 


The difference between Definition 31.1.5 and Definition 31.7.1 is that in the latter, 
we do not insist that the embedding is optimal. It may happen that Gen O is selective 
for S, but Gen O is not optimally selective for S: such a situation arises exactly when 
there is an order O’ € Gen O such that S embeds in O’ but does not optimally embed 
in O’. 


Example 31.7.2. We return to Example 31.1.1. We saw that Gen O is optimally selec- 
tive for the maximal order Zg. By Main Theorem 31.1.7, Gen O is also optimally 
selective for S = Zr[V-1] € Zr. 

We claim that Gen O is not selective for S. ForO = M2(Zx), we take the normalized 
embedding 


The surprise is that we can also find an embedding S => O’ = (i A just not an 


optimal one: we take 


-vV-5 2 ) (31.7.3) 


wera’ = 2 vos)" 


(The argument given in Example 31.1.1 no longer applies, because the polynomial 
x? — 1 has nontrivial discriminant, giving just enough room for the prime p to sneak 
in.) 

For sanity (to show there is no contradiction with the main theorem of optimal selec- 
tivity), we confirm that the embedding (31.7.3) does not define an optimal embedding 
into O’. We have 2Zr = p?, so 


1- V-5 2 
2 1+ V-5 


so the order R + p-!(a’ + 1) 2 R + Ra’ = S embeds in O’. 


œ +1= € pO’ 


31.7.4. If Gen O is optimally selective for S but not selective for S, then S C O is 
optimal but there is an order O’ such that ¢’: S —> O’ is an embedding but not an 
optimal embedding. Let S’ = ¢’(K) MO’ 2D S. So there exists a prime p | [S’ : S]r, 
and in particular, S is not maximal at p. In particular, if S is integrally closed, then 
Gen O is selective for S if and only if Gen O is optimally selective for S. 


Theorem 31.7.5 (Chinburg—Friedman, Chan—Xu, Guo-Qin). Let O be an Eichler 
order of level IN and suppose that Gen O is optimally selective for S. Then Gen O is 
selective for S if and only if the following condition holds: 


(S) Jf p | discr S and ordp (W) # ordy(discr S), then p splits in K 2 F. 
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Proof. Our very setup (section 31.3) is designed to count optimal embeddings, so 
to avoid lengthening this chapter, we refer the reader to Chinburg—Friedman [CF99, 
Theorem 3.3] for the case of maximal orders, and Chan—Xu [CX2004, Theorem 4.7] 
and Guo-Qin [GQ2004, Theorem 2.5] (independently) for Eichler orders. o 


Remark 31.7.6. The condition Proposition 31.2.1(d) (one part of (OS) for Eichler 
orders) is not visible in the selectivity theorem for Eichler orders, but is implied by it: 
by (b), the extension K 2 F is unramified so p is unramified in K, thus ord, (discr(S)) 
is even and necessarily not equal to ordp(9) if the latter is odd. 


31.7.7. Chinburg—Friedman [CF99, Theorem 3.3] prove Theorem 31.7.5 for maximal 
orders, and they applied this theorem to embeddings in maximal arithmetic groups 
[CF99, Theorem 4.4]. Chinburg—Friedman proved their results in the language of the 
Bruhat-Tits tree of maximal orders. This selectivity theorem was then generalized to 
Eichler orders by Chan—Xu [CX2004, Theorem 4.7] and Guo—Qin [GQ2004, Theorem 
2.5] (independently). Interestingly, while Guo—Qin follow Chinburg—Friedman in their 
proof, Chan—Xu instead use results on exceptional spinor genera and their results are 
phrased and proven in the language of indefinite integral quadratic forms. (These results 
are given for number fields, but the proofs adapt to global fields as pursued here.) 

Some selectivity theorems beyond those for Eichler orders are also known. Arenas- 
Carmona [A-C2013, Theorem 1.2] considers more general intersections of maximal 
orders. Linowitz [Lin2012, Theorems 1.3—1.4] gives a selectivity theorem for (opti- 
mal) embeddings into arbitrary orders, subject to some additional technical (copri- 
mality) hypotheses. More generally, selectivity theorem have been pursued in the 
more context of central simple algebras: see e.g. Linowitz—Shemanske [LS2012] and 
Arenas-Carmona [A-C2012]. 

However, these selectivity results do not prove Main Theorem 31.1.7 on the nose, 
either because they deal with selectivity instead of optimal selectivity or do not prove 
the more powerful statement that the embedding numbers are in fact equal. On the 
latter point, a general setup to establish equality of embedding numbers can be found 
in work of Linowitz—Voight [LV2015, §2]. 


31.8 = Isospectral, nonisometric manifolds 


We conclude with an application to geometry. We need to borrow from the future, so 
the reader is invited to read this section lightly, using it as present and future motivation; 
and then to return to this section after a more careful reading of Chapter 38. 

In 1966, Kac [Kac66] famously asked: “Can one hear the shape of a drum?” Put 
another way, if you know the frequencies at which a drum vibrates, can you determine 
its shape? This beautiful question has led to an almost countless number of articles: 
see Giraud—Thas [GT2010] for a survey. 

To restate the question in a mathematical framework, let M be a connected, compact 
Riemannian manifold. Associated to M is the Laplace operator, defined by A(f) := 
— div(grad(f)) for f € L?(M) a square-integrable function on M. The eigenvalues of 
A on the space L*(M) form an infinite, discrete sequence of nonnegative real numbers 
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0 = Ap < 41 <A. <..., called the spectrum of M. By the study of the wave equation, 
the spectrum of M can essentially be thought of as the frequencies produced by a drum 
“in the shape of M”. Two Riemannian manifolds are said to be Laplace isospectral 
if they have the same spectra. We are led to ask: if two Riemannian manifolds are 
Laplace isospectral, are they isometric? 

A general algebraic method for constructing Laplace isospectral manifolds is due 
to Sunada [Sun85] (see the surveys by Gordon [Gor2000, Gor2009]), arising from 
almost conjugate subgroups of a finite group of isometries acting on a manifold— 
providing a negative answer to Kac’s question by Gordon—Webb—Wolpert [GWW92]. 
But preceding Sunada, in 1980 Vignéras [Vig80b] constructed such examples—indeed, 
one of the applications in her book on quaternion algebras [ Vig80a] was to explain this 
construction! 


Theorem 31.8.1 (Vignéras). For every m > 2, there exist Laplace isospectral and 
nonisometric manifolds of dimension m. 


We sketch a proof of this theorem in this section, with attention to the particular 
detail of selectivity. In 1994, Maclachlan—Rosenberger [MacRos94] claimed to have 
produced a pair of Laplace isospectral, nonisometric hyperbolic 2-orbifolds of genus 
0, but then Buser-Flach-Semmler [BFS2008] later showed that these examples were 
too good to be true! The subtle issue they found: the phenomenon of selectivity. 

Our construction is quaternionic, of course. We consider the situation of sections 
38.2-38.3, specifically the setup in 38.2.1 and 38.3.1. Let F be a number field with r 
real places and c complex places, so that [F : Q] = r + 2c =n. Let B be a quaternion 
algebra over F and suppose that B is split at t real places. We have an embedding 
(38.2.2) 1: B => M2(R)' x Mo(C)°. Letting H := (H*)! x (H3)° as in (38.2.9) and 


PBX, := BXo/F*, 


we see that uPBS,) < Isom*(H) acts on H by linear fractional transformations as 
orientation-preserving isometries. 

Let R = Zr be the ring of integers of F and let O C B be an R-order. Then the 
group T!(O) := (O! {+1} < PB*,, is a discrete subgroup acting properly on H by 
isometries. Suppose now that B is a division algebra, and further that the only elements 
of O! of finite order are +1. (We will soon see that this assumption can be satisfied; in 
some sense, it is quite typical.) Let X '(O) := T!(O)\H be the quotient, a Riemannian 
manifold of dimension m := 2t + 3c. 

The condition of Laplace isospectrality is implied by the following arithmetic 
condition in terms of conjugacy classes of embeddings. 


Theorem 31.8.2. Let O’ € Gen O. Suppose that for every quadratic field K 2 F and 
every quadratic R-order S C K, we have the equality of embedding numbers 


m(S, O; O') =m($,0';0"). (31.8.3) 
Then X! (O) and X! (O') are Laplace isospectral. 
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Proof. See Vignéras [Vig80b, Théorème 6]. The statement there is in terms of all 
embeddings, not just optimal embeddings—but the total count of conjugacy classes 
of embeddings of an quadratic order is a sum of the corresponding count of optimal 
embeddings of superorders (as in 31.7), so it is sufficient to have a genial order. 

The key ingredient in the proof is the Selberg trace formula, which allows us 
to show that the spectra of the Laplace operators agree by the stronger condition of 
representation equivalence: for a more general point of view on this deduction, see 
Deturck—Gordon [DG89]. o 


The rub is in the equality (31.8.3). The restriction of the equivalence classes to 
units of reduced norm 1 is harmless: by Lemma 30.3.14, we have m(S,O;0!) = 
m(S, O; O% )[nrd(O*) : nrd(S*)], and as a consequence of strong approximation we 
have [nrd(O*) : nrd(S%)] = [nrd(0* : nrd(S*)]. Thus if O’ € TypO, then (31.8.3) 
holds if and only if m(S, O; O*) = m(S, O’;O”). 

Finally, selectivity enters! We suppose that O is an Eichler order. By Main Theorem 
31.1.7(c), we have the desired equality when Gen O is genial (i.e., not optimally 
selective)—and by part (b), this equality may fail for some S. 


Corollary 31.8.4. Suppose that Gen O is genial and O’ € Gen O has O’ # O. Suppose 
further that o(Ram(B)) + Ram(B) for all © € Aut(F). Then X'(O) and X'(O’) are 
Laplace isospectral, nonisometric Riemannian manifolds of dimension m. 


Proof. Xsospectrality follows from Theorem 31.8.2 with Main Theorem 31.1.7. To 
show that X, X’ are not isometric, since O’ # O we know that PO’ is not conjugate 
to PO in B*, but we need this for the groups '!(O’),'!(O) in Isom*(H), which is 
slightly larger (see Remark 38.2.11). We leave the details to Exercise 31.3 (or see 
Linowitz—Voight [LV2015, Proposition 2.24]). o 


The remainder of the proof of Theorem 31.8.1 involves finding suitable data F, B, O. 
We exhibit a pair for m = 2 following Linowitz—Voight [LV2015, Example 5.2], giving 
a pair of compact hyperbolic 2-manifolds of genus 6 which are isospectral but not 
isometric. 


Example 31.8.5. Let F = Q(w) where w is a root of the polynomial x4—5x?—2x+1=0. 
Then F is a totally real quartic field with discriminant dr = 5744 = 24359, Galois 
group S4, class number #C1 R = 1 and narrow class number #CI* R = 2. Let B be 
the quaternion algebra over F which is ramified at the prime ideal pı3 of norm 13 
generated by b := w? — w? — 4w and three of the four real places, with split place 


b 
w > —0.751024...: then B = (=) where a = w? — w? — 3w — 1 is a root of 


x4 + 8x3 + 12x? - 1. 
A maximal order O C B is given by 
(w° + paid) Fe a Di +i p, 


O has type number 2, so there exists two isomorphism classes of maximal orders 
O; = O and Op. 


O=RORi® 
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We claim that O; and O> have no elements of finite order other than +1. Indeed, if 
we had such an element of order q then F(2,) is a cyclotomic quadratic extension of F, 
whence Q(é2,)* © F; but F is primitive, so the only cyclotomic quadratic extensions 
of F are K = F(V-1) and K = F(V-3). But as pı3 splits completely in F(W—1) and 
F(V-3), neither field embeds into B. We conclude that the groups T; are torsion free. 

Since B is ramified at a finite place, the genus of O is genial by Proposition 31.2.1, 
and since Aut(F) is trivial, the hypothesis of Corollary 31.8.4 are satisfied: X!, X} are 
Laplace isospectral, but not isometric. 

Finally, by Theorem 39.1.13, we have area(X;) = 207, so g(X;) = 6 for i = 1,2. 
Fundamental domains for these are given in Figure 31.8.6. 


Figure 31.8.6: Fundamental domain for the genus 6 manifolds X (Te (TC 1) 


We obtain a second example by choosing the split real place w œ> —1.9202..., and 
since F is not Galois, as in the case of the 2-orbifold pairs 2 and 3, these are pairwise 
nonisometric. 


For an example with m = 3, see Exercise 31.4. 


Exercises 


> 1. Prove Main Theorem 31.1.7 in the case K = F x F: to be precise, show that an 
R-order S C F x F embeds equally in all Eichler R-orders. /Hint: we must have 
B =~ M2 (F), so reduce to the case where S is embedded in the diagonal and then 
conjugate. ] 


2. The following exercise gives insight into the proof of Theorem 31.7.5 on selec- 
tivity. Let R be local, and let O be an Eichler order of level pê. 


Let ¢: S — O be an optimal embedding that is normalized and associated to 
x € R, so represented by 


z x 1 
NEG t-x 
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as in Definition 30.6.8. 


(a) Compute v-!av for the matrix 


a b 
v= (2 4): 
(b) Show that the off-diagonal entries are equal to 
(ad — bc)! Nmgjr (by — (bx + d)), 
(ad — be)! Nmxgır (ay — (ax + c)) 


so belong to (det v)~! Nmgjr (K*). 


(c) Suppose a’ = v-!ay gives a normalized, optimal embedding. Show that if 
p is inert in K 2 F, then det v has even valuation. 


> 3. With notation as in section 31.8, let T, I” < B*/F* andı: BX,/F* — Isom*(H). 


Then X(T), X(T”) are isometric if and only if there exists a permutation o of 
the factors of G := Isom*(H) and v € 1(B*) such that o (T) = vI’v7! and a 
Q-algebra automorphism T: B — B such that the diagram 


BX —___1_» B* 


G—"+G—=+G 


commutes, where v acts on G by conjugation. [Hint: use the Skolem—Noether 
theorem. | 


. In this exercise, we exhibit Laplace isospectral, nonisometric hyperbolic 3- 

manifolds, following Linowitz—Voight [LV2015, Example 6.3]. Let F = Q(w) 

be the quintic field with discriminant —43535 and defining polynomial x° — x* + 

3 
3x3 — 3x + 1. Let B := (>). 
F 

(a) Show that B is ramified at the three real places of F and the prime ideal 
p= (wt — w°? + 3w? — w — 2) of norm 13. 

(b) Let O be a maximal order in B. Show that the type number of O is equal 
to 2. 

(c) Let O = O; and Oy be representatives of Typ O. Show that the only 
elements of finite order in O} are +1. 

(d) Show that Gen O is genial. 

(e) Let x := [!(O,)\H°, with notation as in section 31.8. Show that x} and 
X; are Laplace isospectral, nonisometric 3-manifolds. 

(f) Show that vol(X/) = vol(X3) = 51.024566... 
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Chapter 32 


Unit groups 


Having moved from algebra and arithmetic to analysis, and in particular the study of 
class numbers, in this part we consider geometric aspects of quaternion algebras, and 
the unit group of a quaternion order acting by isometries on a homogeneous space. 


32.1 > Quaternion unit groups 


By way of analogy, we consider what happens for quadratic orders. In this case, just as 
with class groups, the behavior of unit groups is quite different depending on whether 
the asociated quadratic field K is real or imaginary. 

In the imaginary case, the unit group is finite, as the norm equation Nmxjq(y) = 1 
has only finitely many solutions for integral y: these are elements of a 2-dimensional 
lattice in C that lie on the unit circle. Such an element is a root of unity that satisfies 
a quadratic equation over Q, and so only two imaginary quadratic orders having units 
other than +1 are the Gaussian order Z[i] of discriminant —4 and the Eisenstein order 
Zip] with p := (-1+ v-3)/2 of discriminant —3: see Figure 32.1.1. 


. . P =p , 
. . . —1 1 . & l1 >» 
š è 2 a 


Figure 32.1.1: Units in imaginary quadratic orders 


Orders O in a definite quaternion algebra B over Q behave like orders in an 
imaginary quadratic field. The unit group of such an order is finite, as the solutions 
to nrd(@) = 1 with œ € O are elements of a 4-dimensional lattice in R* again with 
bounded size. In section 11.5, after a close investigation of the case of Hurwitz units, 
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we classified the possibilities, embedding O*/{+1} = HH! /{+1} ~ SO(3) as a finite 
rotation group: the groups O* that arise over Q are either cyclic of order 2, 4, 6, 
quaternion Qg of order 8, binary dihedral 2D¢ of order 12, or the binary tetrahedral 
group 2T of order 24. In this chapter, we take up this task in the context of a general 
definite quaternion order, and realize all finite rotation groups using quaternions. 

Now we turn to real quadratic fields and correspondingly indefinite quaternion 
algebras. For the real quadratic order Z[vVd ] with d > 0, the units are solutions to the 
Pell equation Nmgjo(x — yVd) = x? - dy? = +1 with x, y € Z. All solutions up to sign 
are given by powers of a fundamental solution which can be computed explicitly using 
continued fractions; consequently, Z[Vd]* = (-1,u) = Z/2Z x Z where u = x + yVd 
is the fundamental unit. The fundamental unit often (but not always) has large height 
(in the sense that x, y € Z are large in absolute value), being of exponential size in the 
discriminant, by theorems of Schur and Siegel. The unit group of the ring of integers of 
Q(Vd) for d = 1 (mod 4) is treated in a similar way, by considering the norm equation 
x = xy + cy? = +] where c = (1 — d)/4. 

For quaternions, we are led to consider units in the standard order 


„b 
O:=Ze@ZieZj eZij C B:= (3) 


in an indefinite quaternion algebra, with a, b € Z and say a > 0. The norm condition 
nrd(y) = +1 for y = t + xi + yj + zij then reads 


t? — ax? — by’ + abz? = +1 (32.1.2) 


witht, x, y,z € Z. Amusingly, this “quaternion Pell equation” includes the Pell equation 
for Z[-Va] by setting y = z = 0, and in fact by considering embeddings of quadratic 
orders (the subject of Chapter 30), we see that this equation combines all Pell equations 
satisfying certain congruence conditions. Combining these Pell equations, we see that 
the group of solutions is an infinite, noncommutative group. (The case of an order 
different from the standard one will give a different norm equation, but the same 
conclusions.) See Jahangiri [Jah2010] for a Diophantine interpretation of the structure 
of the unit group of a quaternion order as a quaternionic Pell equation. 

We will seek to understand the group O% by its action on a suitable space, and in 
this way we are led to consider groups acting discretely on symmetric spaces; we will 
discover that the group O% is finitely presented and in particular finitely generated, 
so we still can think of a set of fundamental solutions (given by generators) whose 
products generate all solutions to (32.1.2). For example, we may take O = M2(Z) € 
M2(Q), where O* = GL2(Z), generated by the elementary matrices ; i ; , 
Accordingly, our investigation is detailed but fruitful, involving the theory of Fuchsian 
and Kleinian groups. 

In this chapter, we begin by discussing the general structure of these groups. 


32.2 Structure of units 


Throughout this chapter, we use the following notation, recalling our notation for global 
fields from section 14.4. Let F be a global field, let S c PI(F) be an eligible set of 
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places of F, and let R = Rs be the global ring associated to S, the ring of S-integers 
of F. (As always, the reader may keep the case F = Q, S = {oo}, and R = Z in mind.) 
Further, let B be a quaternion algebra over F, and let O C B be an R-order. 

We are interested in the structure of the group O*. Since Z(B*) = F*%, we have 
R* < Z(O%) central. We understand the structure of R* by Dirichlet’s unit theorem, 
as follows. 


32.2.1. From Dirichlet’s unit theorem (and its extension to S-units and the function 
field case), the group R* of units is a finitely generated abelian group of rank #S — 1, 
so that 

R* = Z/wZ@Z**! (32.2.2) 


where w is the number of roots of unity in F. (The proof is briefly recalled in 32.3.1.) 


The group O% is (in general) noncommutative, so we should not expect a description 
like 32.2.1. But to get started, we consider the quotient O*/R*, and the reduced norm 
map which gets us back into R*. 


32.2.3. We recall the theorem on norms (see section 14.7): as before, let 
Q := {v € Ram B: v real} C PIF (32.2.4) 


be the set of real ramified places in B (recalling that complex places cannot be ramified), 
and 
FX, := {x € F* : v(x) > 0 for all v € Q} (32.2.5) 


>Q0 * 
the set of elements that are positive at the places v € Q. If F is a function field, then 
Q = 0, and RŽ o = R*. The Hasse-Schilling norm theorem (Main Theorem 14.7.4) 
says that nrd(B*) = F% o Letting 
RŽ o := R* A F% 

Q 


> >90’ 


(32.2.6) 


we conclude that nrd(O”) < RY 0° (Strictly speaking, we only needed the containment 
nrd(B*) < F% o which follows directly from local considerations; but we pursue finer 
questions below.) 


32.2.7. In light of 32.2.3, the reduced norm gives an exact sequence 


nrd 
1 > O! => O* = R 
>Q0 


(32.2.8) 


where 
O! := {y € O* : nrd(y) = 1} (32.2.9) 


is the subgroup of units of reduced norm 1. 
Since nrd(R*) = R”? by the squaring map, we have O! N R* = {+1}, so (32.2.8) 
yields 


o! O* nrd RŽ 0 
1 —> ESN = R — Re (32.2.10) 


Since the group R* is finitely generated, the group R% )/ R* is a finite, elementary 
abelian 2-group. (In general, the reduced norm (the last) map in (32.2.10) need not be 
surjective.) 
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32.2.11. In general, the exact sequence (32.2.10) does not split, so the group O*/R* 
will be a nontrivial extension of O!/{+1} by an elementary abelian 2-group. 


Example 32.2.12. If B = M2(F) and O = M2(R), then OX = GL2(R) and O*/R* = 
GL2(R)/R* =: PGL2(R). The reduced norm is the determinant, which is surjective, 
and so the exact sequence (32.2.10) can be extended to 


1 > PSL»(R) > PGL»(R) “> R*/R2 > 1. 


32.2.13. In light of Example 32.2.12, it is natural to write POX := O*/R* and PO! := 
O! {£1}. 


Remark 32.2.14. Some authors write GL; (O) = O* and SL\(O) = O!, and this 
notation suggests generalizations. In such situations, it is natural to write PGL; (O) = 
O*/R* and PSL(O) = O! /{+1}. 


32.2.15. Suppose F is a number field. Then the group RZ o /R*? is canonically iso- 
morphic to a quotient of class groups, as follows. Let Zp be the ring of integers of 
F, and let Clo Zr denotes the class group of F given by the quotient of the group of 
fractional ideals by the subgroup of principal ideals with a generator positive at all 
places in Q. Then RE o /R*? is isomorphic to the quotient of Clo Zr by Cl Zp and the 
group generated by the finite primes in S. 


32.3 Units in definite quaternion orders 


In this section, we show that for a definite quaternion order O, the scalar units R* are 
of finite index in OX—i.e., the group PO*% = O%/R* is a finite group. 


32.3.1. To build intuition, suppose F is anumber field with r real places and c complex 
places. Recall the proof of Dirichlet’s unit theorem: we define a map 


R* > Rs 


xb (m log|x|,),, - 


(32.3.2) 


The kernel of this map is the group of roots of unity (the torsion subgroup of R*). 
The image lies inside the trace zero hyperplane >\,<5 xy = 0 by the product formula 
(14.4.6), and it is discrete and cocompact inside this hyperplane, so it is isomorphic 
to Z*!. In particular, R* is finite if and only if #S = 1; since S always contains 
the set of archimedean places of size r + c, we see that R* is finite if and only if 
(r, c) = (1,0), (0, 1), so F = Q or F is an imaginary quadratic field. 


Remark 32.3.3. Informally, one might say that R* is finite only when the completions 
at the places in S provide “no room” for the unit group to become infinite. This is 
analogous to the informal case for strong approximation in 28.5.4: if there is a place 
v € S where B} is not compact, then there is enough room for B! to “spread out” and 
become dense. 
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32.3.4. Recall (Definition 28.5.1) that B is S-definite if S C Ram(B), i.e., every place 
in S is ramified in B. In particular, if F is a number field, then since a complex place 
is split and S contains the archimedean places, if B is S-definite then F is totally real; 
in this case, when S is exactly the set of archimedean places, we simply say that B is 
definite. 


32.3.5. Consider the setup in analogy with Dirichlet’s unit theorem 32.3.1. We consider 
the embedding of B into the completions at all places in S: 


B > B; = [ [2 


veS 


By Exercise 27.14, R is discrete in Fs = [],<5 Fy and O is discrete in Bs (the point 
being that in the number field case, S contains all archimedean places). Consequently, 
the injections 
O*/R* > (Bs) (Fs) := | | BS/R; 
veS 


O' > (Bs)! =] |z; 


ves 


(32.3.6) 


have discrete image. 

Depending on whether the place v is nonarchimedean (split or ramified) or 
archimedean (split real, ramified real, or complex), we have a different target com- 
ponent B*/F* or Bl. The major task of Part IV is to describe these possibilities in 
detail and look at the associated symmetric spaces. 


We begin with the simplest case, where the unit groups involved are finite. 


Proposition 32.3.7. The group O* /R* is finite if and only if O! is finite if and only if 
B is S-definite. 


Proof. By the exact sequence (32.2.10), the group O*/R* is finite if and only if the 
group O! is finite. 

First, suppose that B is S-definite. Then by definition, foreach v € S, the completion 
B, isa division algebra over F,. But each B! is compact, from the topological discussion 
in section 13.5. Therefore in (32.3.6), the group O! is a closed, discrete subgroup of a 
compact group—hence finite. 

Now suppose B is not S-definite. Then there is a place vo € S that is unramified; we 
will correspondingly find an element of infinite order (like solutions to the quaternion 
Pell equation coming from the original Pell’s equation (32.1.2)). We have Bẹ = 
M2(Fv,), so there exists œ € B be such that the reduced characteristic polynomial splits 
in Fy; we may suppose without loss of generality that K = F[qa] is a field. Let S be 
the integral closure of R in K (not to be confused with the set S). Then by the Dirichlet 
S-unit theorem (32.2.1), the rank of S*/R* is at least 1: the set of places w € PI(K) 
such that w lies above v € S contains at least one element from each v and two above 
vo, because it is split. So there is an element y € S*/R™ of infinite order. As R-lattices, 
the order $M O has finite R-index and hence finite index in S, so S*/(§ MN O)* is a finite 
group, and therefore a sufficiently high power of y lies in (S$ N O)* c O*, and O* is 
infinite. o 
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Example 32.3.8. Let B = (—1,—1 | Q) and let O be the Z-order generated by i, j, so 
that S = {oo}. Then B is S-definite, and O* = (i, 7) ~ Qg is the quaternion group of 
order 8. 

Now consider S = {2, co}; then B is still S-definite. We find 


O[1/2)* = (2, i, j, 1 +i) 
O[1/2]*/Z[1/2)* = (2,1, j, 1 +i)/(-1,2) = Og x Z/2Z 
(Exercise 32.2). 


Finally, if we take S = {5, co}, then B is no longer S-definite; and O[1/5]* contains 
the element 2 + i of norm 5 € Z[1/5]”* and infinite order. 


(32.3.9) 


32.3.10. Suppose B is S-definite. Then by Proposition 32.3.7, the group O*/R™ is 
finite. Since we have an embedding 


O*/R*® => B*/F* 


it follows that a O*%/R* is a finite subgroup of PB*, so a classification of finite 
subgroups of PB* gives a list of possible definite unit groups; we make this our task 
in the remainder of this chapter. 


32.4 Finite subgroups of quaternion unit groups 


We now embark on a classification of finite subgroups of PB* = B*/F* and PB! = 
B' /{+1}; this is akin to first getting acquainted with the roots of unity in a number 
field. Suppose throughout the rest of this chapter that F is a number field; we allow B 
to be definite or indefinite. 

We begin in this section with the classification of the possible groups up to iso- 
morphism that goes back at least to Klein [Kle56, Chapter II]: the original book dates 
back to 1884 and is undoubtedly one of the most influential books of 19th century 
mathematics. See also the descriptions by Coxeter [Coxtr40] and Lamotke [Lamo86, 
Chapters I-l] for a presentation of the regular solids, finite rotation groups, as well as 
finite subgroups of SL2(C). 


Proposition 32.4.1. Let T < PB* be a finite group. Then T is cyclic, dihedral, or an 
exceptional group Aa, Sa, As. 


We met these groups already in Proposition 11.5.2, and the proof is an extension 
of this result. 


Proof. Let v be an archimedean place of F. Then the natural map B* — B*/F* has 
kernel F% N BX = F*, so the group homomorphism B*/F* — B*/F* is injective. 
First suppose that v is a ramified (real) place, so B, ~ H and 


BX /F* = H*/R* = H!/{+1}. 


By Corollary 2.4.21, we have HH! /{+1} ~ SO(3) so T is a finite rotation group: these 
are Classified in Proposition 11.5.2. 
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In general, we seek to conjugate the group T in order to reduce to the case above. 
We may prove the lemma after making a base extension of F, so we may suppose 
that v is complex, with B, ~ Mo(C). Then B*/FX ~ PGLo(C), and via the injection 
B*/F* — PGL2(C) we obtain a finite subgroup IT © PGL»2(C). The natural map 
SL2(C) — PGL»(C) is surjective, as we may rescale every invertible matrix by a 
square root of its determinant to have determinant 1, and its kernel is {+1}, giving an 
isomorphism PSL2(C) ~ PGL2(C). We then lift I under the projection SL2(C) > 
PSL2(C) to a finite group (containing —1). We have 


H! =~ SU(2) = {A € SL2(C) : A*A = 1} > SLAC) (32.4.2) 


as in 2.4.2. If ( , ) denotes the canonical (Hermitian) inner product on C? defined by 
(z,w) = zřw (as column vectors), then SU(2) is precisely the group of matrices of 
determinant 1 preserving (, ), i.e., 


SU(2) = {A € SL2(C) : (Az, Aw) = (z, w) for all z, w € C? 


since (Az, Aw) = z*(A*A)w = z*wif and only if A*A = 1. We now define a T-invariant 
Hermitian inner product on C? by averaging: for z, w € C?, we define 


1 
(wr = z Dye YW). 
ye 


Choose an orthonormal basis for (, ) and let T € SL2(C) be the change of basis matrix 
relative to the standard basis. Then (z, w)r = (Tz, Tw) and therefore TTT! c SU(2). 
The result now follows from (32.4.2) and the previous case. o 


32.5 Cyclic subgroups 


In the next few sections, we discuss each of the possibilities in Proposition 32.4.1 in 
turn, following Chinburg—Friedman [CF2000]. We begin with cyclic subgroups. 

There are always many subgroups of PB™ of order 2: every nonscalar element 
a € BX with trace zero has a? € F*. 


Proposition 32.5.1. Let m > 2 and let Gn € F” be a primitive mth root of unity. Then 
PB* contains a cyclic subgroup of order m if and only if Gn + | € F and F (ĉn) splits 
B. Such a cyclic subgroup is unique up to conjugation in PB™. 


Proof. First we prove (€). Suppose ¢,,+¢,,! € F and F(&n) splits B. If in fact ¿n € F, 
then F splits B, i.e. B ~ M2(F); then y := ( : ) has order m in PB* ~ PGL2(F). 
Otherwise, since ém + a € F, we have [F(G,) : F] = 2, with G, a root of the 
polynomial T? — (Gn + G,!)T + 1. By Lemma 5.4.7, this implies there is an embedding 
F(én) => B; let ¢ be the image of ¢, under this embedding. If ¢ has order d in PB™, 
then Q(Gn + VAn ie) C F; since n ¢ F, we must have d = m if m is odd or d = m/2 
if m is even. Let y = 1+ £. Then y?@-! = 2 + +27! € F*, so y has order m in PB*. 
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Now we prove (=). Suppose that y € B* has image in PB* of order m > 2, so 
that y” = a € F*. We do calculations in the commutative F-algebra K := F[y]. Let 
ç := yy! € K*. Then 

ç” = y"(y”y! = aa! = 1 (32.5.2) 


so ç” =1.Ifç1=1ford | m then y@ = oty4 = yt so y? € F* and thus d = m; thus 
ç has order m in B*. Applying the standard involution again gives 


Y=H¥_¥s=Sv=scy: (32.5.3) 


thus ¢ = ç7!, so ç ¢ F and trd(c) = ç+ ç! € F. Taking an appropriate power to 


match up the root of unity, we conclude ¿mn + G,,! € F. Finally, either K is a quadratic 
field in B, in which case K splits B by Lemma 5.4.7, or K is not a field and B ~ M2(F), 
in which case F already splits B. 

We conclude with uniqueness. Continuing from the previous paragraph, we have 
shown that y + y = (1+ s)y € F*, so y and 1 + ç generate the same cyclic subgroup 
of PB*, where ç™ = 1. If K = F(¢) is a field, then all embeddings F({,) — B are 
conjugate in B* by the Skolem—Noether theorem (Corollary 7.1.5), and consequently 
every two cyclic subgroups of order m are conjugate. Otherwise, the reduced charac- 
teristic polynomial of ç factors, so B ~ Mo(F), and its roots (the eigenvalues of ç) 
belong to F. If the eigenvalues are repeated, then up to conjugation in GL2(F), ¢ is 


a scalar multiple of i i with b € F, and therefore has infinite order, impossible. 


0 a 
primitive mth root of unity and the cyclic subgroup is unique up to conjugation. o 


Thus the roots are distinct, and ç is conjugate to a multiple of ( o) and so Ais a 


32.5.4. The splitting condition in Proposition 32.5.1 can alternatively be phrased in 
local-to-global terms (Proposition 14.6.7): K = F(&n) splits B if and only if every 
place v € RamB is not split in K. Since the field F(Z) is totally complex, every 
archimedean place splits, and so when K + F we have K — B if and only if no prime 
p € RamB splits in K. 


32.5.5. The proof of Proposition 32.5.1 describes the cyclic subgroup explicitly, up to 
conjugation (still with m > 2): 


(i) If Gn € F, then B ~ M>(F) and every cyclic subgroup of PGL2(F) of order m is 
0 


conjugate to the subgroup generated by Ym = i : ; 

(ii) Otherwise, K = F(én) is a quadratic extension of F with K —> B, and every 
subgroup of PB” of order m is conjugate to the subgroup generated by the image 
of Ym = 1 + Sn. 

The F-algebra Km = F [Ym] is separable and uniquely determined up to isomorphism. 


In contrast to Proposition 32.5.1, there are a great many cyclic subgroups of order 
m = 2 in PB*, described as follows. 
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32.5.6. If y € PB* has order m = 2, then y? = a € F* and y ¢ F*. Therefore, either 
a ¢ F*, equivalently K = F[y] ~ F(ja) is a field, and the embedding K © B is 
unique up to conjugation in B* by the Skolem—Noether theorem; or a € F*?, in which 
case after rescaling y? = 1 so B ~ M2(F) and y is conjugate to G E ; 

The following corollary shows that we can often reduce to the case of an even order 
subgroup. 


Corollary 32.5.7. Let m > 1 be odd. Then PB* contains a cyclic subgroup of order m 
if and only if PB* contains a cyclic subgroup of order 2m. 


Proof. If m = 1, then we are all set by 32.5.6. If m > 3, then Proposition 32.5.1 
applies, and we see that the hypotheses hold for m if and only if they hold for 2m, since 


lam = =m . Oo 


Corollary 32.5.8. PB* contains a cyclic subgroup of order 2m if and only if PB! 
contains a cyclic subgroup of order m. 


Proof. The corollary follows from 32.5.5: the subgroup of PB* of order 2m generated 
by y2m yields the subgroup of PB! of order m generated by fs, and vice versa. o 


32.6 =» Dihedral subgroups 


We now turn to the dihedral case, where we show that every cyclic subgroup extends 
(in general, in many ways) to a dihedral subgroup, continuing to follow Chinburg— 
Friedman [CF2000, Lemma 2.3]. 


Lemma 32.6.1. Let m > 2. Then the following statements hold. 


(a) Every cyclic subgroup of PB” of order m is contained in a dihedral subgroup of 
order 2m; in particular, PB™ contains a dihedral subgroup of order 2m if and 
only if it contains a cyclic subgroup of order m. 

(b) Let y € B* have order m in PB*, and let K = Fly]. For j € B*, we have 


K,b 
(y, j} © PB* dihedral if and only if j? = b € F and B =K + Kj = (=) 


Proof. First (a). The implication (=) is immediate, so we prove (©). Let m > 2 and 
suppose that PB” contains a cyclic subgroup of order m, generated by the image of 
y € BX, and let K = F[y]. Let j € B be orthogonal to K under nrd. Then j? = b € F*, 
and ja = @j = a7! j € PBX for all œ € K, so the subgroup <y, j} is dihedral of order 


b 
2m in PB*, and B = T , as in Exercise 6.2. 


Now (b). We just showed (=) in the previous part, so we show (€). Let Ir c PB” 
be a dihedral subgroup of order 2m, where T = (y, j} has y € B* generating a cyclic 
subgroup of order m in PB% and j € B* satisfies 


j yj =y" € B*/F*. 
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F 

First we show j~!yj = y. This follows from a direct argument using reduced norm 
and trace (see Exercise 32.4), but we have also the following argument. Since K = F[y] 
is semisimple (see 32.5.5) and conjugation by j acts as an F-algebra automorphism of 
K = F[y], it is either the identity or the standard involution, and thus j~'yj = y, y. But 
we cannot have j~!yj = y, because then K[j] € B would be a commutative subalgebra 
of dimension > 3, a contradiction. 

Now by (4.2.16), expanding the trace gives 


K,b 
Let K = F[y]. We claim that B= K + Kj = (=). 


rd(jy) jy+yj ITEY 9) - J) (32.6.2) 
= jY -yj + trd(j)y = trd(j)y. 


Since 1, y are linearly independent we conclude trd(j) = trd(jy) = 0, i.e., j is orthog- 


2 x K,b 
onal to K under nrd, so j^ = b € F” and B = F | o 


The dihedral subgroups of order 2m for m > 2 are classified as follows. 


Lemma 32.6.3. Let m > 2. Then the set of dihedral subgroups of order 2m up to 
conjugation in PB* are in bijection with the group 


Nmx,, /F (Kn) 
(5) F*2 


where Km is as in 32.5.5 and 6 =2+ ln + Gnl. 


(32.6.4) 


Proof. Let T = (y,j) and I” = (y,j’) be two dihedral subgroups as in Lemma 
32.6.1(b) with j? = b and (j^)? = b’. Then j’ € K+ = Kj so j’ = Bj with B € K*, and 
nrd(j’) = b’ = nrd()b, so bb’ € Nmx jr (K%) (as in Exercise 6.4). Then we claim that 
T, T” are conjugate in PB* if and only if 

bb! € F**(6). 


If I’ = a-'Ta with œ € BX, then conjugation by a normalizes the unique cyclic 
subgroup on both sides, so y” j’ = a7! ja for some r, and therefore 
-6' b’ = nrd(y’ j’) = nrd(j) = -b 


as desired. Conversely, if bb’ € F**(6) then so too for nrd(B) = b’b~!; rescaling B 
by F* and replacing £ by y£ if necessary, we may suppose nrd(8) = Nmxjr(B) = 1 
(without changing T”); by Hilbert’s theorem 90, there exists œ € K* such that B = wa“!, 


and conjugation by œ again normalizes the cyclic subgroup and satisfies 
an! ja = Ga! j = Bj 


as desired. oO 
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Remark 32.6.5. One can rephrase Lemma 32.6.3 in terms of a global equivalence 
relation, further encompassing the case m = 2: see Chinburg—Friedman [CF2000, 
Lemma 2.4]. 


Corollary 32.6.6. The group PB! contains a dihedral group of order 2m > 4 if and 
Kom, -1 
ly if B ~ | —— |]. 
won 
Proof. We first prove (>). If PB! contains a dihedral group I = <y, j} of order 2m 
then it contains a cyclic subgroup of order m so by Corollary 32.5.8 the group PB” 
contains a cyclic subgroup of order 2m, which we may take to be generated by yam as 
in 32.5.5 with Kom = F [yam] = F[y]; by hypothesis we have a = —nrd(j) = —1, and 
K m» =l 
so B = | 27 F ) as in the classification in Lemma 32.6.1. 
Next we prove the converse implication (4). We refer to 32.5.5. In case (i) where 
loam € F, we have nrd l Ym) = on = | so we may take I = (Ym: in case 
(ii), where ém ¢ F, we take I = (Lom, j). oO 


32.7 »* Exceptional subgroups 


Finally we treat exceptional groups (cf. Gehring—Maclachlan—Martin—Reid [GMMR97, 
p. 3635]). We found quaternionic realizations of the exceptional groups in section 11.5 
when B = (—1,-1| F) (and V5 € F for As). 


Proposition 32.7.1. The following statements hold. 


(a) PB* contains a subgroup isomorphic to A4 if and only if PB! contains a subgroup 
isomorphic to A4 if and only if B ~ (—1,—-1 | F). 

(b) PB* contains a subgroup isomorphic to S4 if and only if it contains a subgroup 
isomorphic to A4; and PB! contains a subgroup isomorphic to S4 if and only if 
B = (-1,-1| F) and V2 € F. 

(c) PB* contains a subgroup isomorphic to As ifand only if PB! contains a subgroup 
isomorphic to As if and only if B = (—1,-1| F) and V5 € F. 


Any two such exceptional subgroups of PB* (or PB!) are conjugate by an element of 
B* if and only if they are isomorphic as groups. 


Proof. First we prove (a); let T © PB* be a subgroup with T ~ A4. The reduced 
norm gives a homomorphism F — nrd(T) © F*/F x2 but A4 has no nontrivial 
homomorphic image of exponent 2, so nrd(T) C F**. Therefore, there is a unique 
lift of IT to B'/{+1}, and the map B!/{+1} > B*/F* is an isomorphism from this 
lift to H. This shows the first implication; its converse follows from the injection 
PB! > PB*. For the second implication, let i, j € B! generate the V4-subgroup (the 
normal subgroup of index 3 isomorphic to the Klein 4 group) of Aq in PB'. Then 
i,j ¢ F*, andi? = —nrd(i) = —1 = j*; and similarly (ij)? = —1 implies ji = -ij. By 
Lemma 2.2.5, we conclude B = (—1,—1 | F). The converse follows from the Hurwitz 
unit group 11.2.4. 
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For part (b), the implication (=) is immediate; the implication (4) follows by 
taking the Hurwitz units and adjoining the element 1 + i, as in 11.5.4 but working 
modulo scalars. For the second statement, an element of order 4 in B!/{+1} lifts to 
an element of order 8 in B! and therefore has reduced trace +V2 € F ; the converse 
follows again from the explicit construction in 11.5.4. 

For part (c), we argue similarly. Since As is generated by its subgroups isomorphic 
to A4, we may apply (a) to get a lift, and by the reduced trace we get V5 € F. 

The uniqueness statement is requested in Exercise 32.5. oO 


Exercises 


Unless otherwise indicated, let F be a number field with ring of integers R, let B be a 
quaternion algebra over F and let O C B be an R-order. 


1. Let F be a totally real field, let K 3 F be a totally imaginary quadratic extension 
of F, so K = F(Vd) with d totally negative. Let S be the ring of integers of K. 
Consider the group homomorphism 


ġ: S* > S* 
ur u/u 


where ~ is the nontrivial F-involution of K. 


(a) Show that if u € u(S)R* then u € ker ¢. 

(b) Show that ¢(u) is a root of unity for all u € S*. [Hint: It is an algebraic 
integer of absolute value 1 under all complex embeddings. ] 

(c) Let u(S) be the subgroup of roots of unity of S*, and let y: S* > 
u(S)/ (S)? be the map induced by ¢. Show that if u € ker y, so @(u) = £7 
with ¢ € u(S), then ¿71u € R*. Conclude that ker y = u(S)R*. 

(d) Show that [S* : u(S)R*] < [S* : S'R*] < 2. 


[The index [S* : u(S)R*] is known as the Hasse unit index.] 


-1,-1 
2. Let B = oO and let O be the Z-order generated by i,j. Prove that 
O[1/2)* = (2,i,j,1 + i) and describe O[1/2]*/Z[1/2]* as an extension of 
Qs by Z/2Z. 
3. Show that 
[RX R*?] 
O* : O!R*] = [nrd(O*) : R] = —_ 2 
l TEE l [RÖ : nrd(O*)] 
where RX „ is the subgroup of units positive at ramified infinite places of B, 
>Q0 
defined in (32.2.6). 


4. Let B be a quaternion algebra over a field F with char F + 2, and let y, j € B* 
be such that j~'yj = y~!. Show by looking at the reduced norm and trace that 
i 'yi = 7 (cf. Lemma 32.6.1). 
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> 5. Prove the uniqueness statement in Proposition 32.7.1: Show that every two 
isomorphic exceptional subgroups of PB* are conjugate by an element of B”, 
and the same for B!. 


6. Let < O! be a maximal finite subgroup. Combining results from sections 11.2 
and 11.5 and this chapter, prove the following. 


(a) T is isomorphic to one of the following groups: 


cyclic of order 2m with m > 1, 

binary dihedral (dicyclic) 2D2,, of order 4m with m > 1, 
binary tetrahedral 27 of order 24, 

binary octahedral 20 of order 48, or 

binary icosahedral 27 of order 120. 


In the latter three cases, we call T exceptional. 
(b) IFT = 20 then F 2 Q(V2) and ifT = 2/ then F 2 Q(V5). 


-1,-1 
(c) If I is exceptional, then B = (==) 


7. Continuing with the previous exercise, suppose that B is totally definite, so F is 
totally real. Prove the following statements. 


(a) If O! does not contain an element of order 4, then O! is cyclic. 

(b) If O! is quaternion Qg ~ 2D, or exceptional, then R{V-1] © O and 
discrd(O) is only divisible by primes dividing 2. 

(c) If O! ~ 2D, with m > 3, then R[V-1], R[42m] — O and discrd(O) is 
only divisible by primes dividing AZn — 4, where A2m := Lam + &, l : 


8. Continuing further with the previous exercise, we compare O* and O!. 


(a) Show that [O* : O'] = 1, 2,4. [Hint: Use Exercise 32.1.] 

(b) Show that if O! = 20, 2/, then OX = O!. 

(c) Show that if O! = 2T, then [O* : O'] < 2, and equality holds if and only 
if there exists y € (1 + i)F* N O% such that nrd(y) ¢ F*?. 


[For a complete account covering all cases, see Vignéras-Guého [VG74].] 


This chapter is licensed under the terms of the Creative Commons Attribution- 
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nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
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The images or other third party material in this chapter are included in the chapter’s 
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material is not included in the chapter’s Creative Commons license and your intended 
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Chapter 33 


Hyperbolic plane 


In this chapter, we give background on the geometry of the hyperbolic plane. 


33.1 > The beginnings of hyperbolic geometry 


We have seen that the group of unit Hamiltonians H! acts by rotations of Euclidean 
space and therefore by isometries of the unit sphere, and that in spherical geometry the 
discrete subgroups are beautifully realized as classical finite groups: cyclic, dihedral, 
and the symmetry groups of the Platonic solids. 

Replacing H with M2(R), the group SL2(R) of determinant 1 matrices possesses a 
much richer supply of discrete subgroups. In fact, PSL2(R) can be naturally identified 
with a circle bundle over the hyperbolic plane, and so the structure of quaternionic 
unit groups is naturally phrased in the language of hyperbolic geometry. Indeed, it 
was work on automorphic functions and differential equations invariant under discrete 
subgroups of PSL2(R) that provided additional early original motivation to study 
hyperbolic space: their study provides an incredibly rich interplay between number 
theory, algebra, geometry, and topology, with quaternionic applications. This interplay 
is the subject of the final parts of the text. 

In this chapter, we provide a rapid introduction to the hyperbolic plane. Hyperbolic 
geometry has its roots preceding the quaternions, in efforts during the early 1800s to 
understand Euclid’s axioms for geometry. Since the time of Euclid, there had been 
attempts to prove the quite puzzling parallel postulate (given a line and a point not 
on the line, there is a unique line through the point parallel to the given line) from 
the other four simpler, self-evident axioms for geometry. In hyperbolic geometry, the 
parallel postulate fails to hold—there are always infinitely many distinct lines through 
a point that do not intersect a given line—and so it is a non-Euclidean geometry. 

The underpinnings of what became hyperbolic geometry can be found in work by 
Euler and Gauss in their study of curved surfaces (the differential geometry of surfaces). 
It was then Lobachevsky and Bolyai who suggested that curved surfaces of constant 
negative curvature could be used in non-Euclidean geometry, and finally Riemann who 
generalized this to what are now called Riemannian manifolds. Klein coined the term 
“hyperbolic” for this geometry because its formulae can be obtained from spherical 
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geometry by replacing trigonometric functions by their hyperbolic counterparts. See 
[Sco83, $1] for a nice overview of the 2-dimensional geometries. 

Hyperbolic geometry, and in particular the hyperbolic plane, remains an important 
prototype for understanding negatively-curved spaces in general. Milnor [Milno82] 
gives a comprehensive early history of hyperbolic geometry; see also the survey by 
Cannon-Floyd—Kenyon—Parry [CFKP97], which includes an exposition of five models 
for hyperbolic geometry. (It is also possible to work out hyperbolic geometry in a 
manner akin to what Euclid did for his geometry without a particular model, following 
Lobachevsky [LP2010].) 

For further references on hyperbolic plane geometry, see Jones—Singerman [JS87, 
Chapter 5], Anderson [And2005], Ford [For72], Katok [Kat92, Chapter 1], Iversen 
[Ive92, Chapter III], and Beardon [Bea95, Chapter 7]. There are a wealth of geometric 
results and formulas from Euclidean geometry that one can try to reformulate in the 
world of hyperbolic plane geometry, and the interested reader is encouraged to pursue 
these further. 


33.2 Geodesic spaces 


In geometry, we need notions of length, distance, and the straightness of a path. These 
notions are defined for a certain kind of metric space, as follows. 

Let X be a metric space with distance p. An isometry g : X — X is a bijective 
map that preserves distance, i.e., p(x, y) = p(e(x), g(y)) for all x, y € X. (Any distance- 
preserving map is automatically injective and so becomes an isometry onto its image.) 
The set of isometries Isom(X) forms a group under composition. 


33.2.1. A path from x to y, denoted v: x — y, is a continuous map v: [0,1] > X 
where u(0) = x and v(1) = y. (More generally, we can take the domain to be any 
compact real interval.) The length (v) of a path v is the supremum of sums of 
distances between successive points over all finite subdivisions of the path (the path 
is rectifiable if this supremum is finite). Conversely, if X is a set with a notion of 
(nonnegative) length of path, then one recovers a candidate (intrinsic) metric as 


p(x,y) = inf £w), (33.2.2) 
Uix7y 


a metric when this infimum exists (i.e., there exists a path of finite length x — y) for 
all x, y € X. If the distance on X is of the form (33.2.2), we call X a length metric 
space or a path metric space, and by construction (gv) = €(v) for all paths v and 
g € Isom(X). 


Example 33.2.3. The space X = R” with the ordinary Euclidean metric is a path 
metric space; it is sometimes denoted E” as Euclidean space (to emphasize the role of 
the metric). 


33.2.4. If X is a path metric space and v achieves the infimum in (33.2.2), then we say 
v is a geodesic segment in X. A geodesic is a continuous map (—œ%, œ) —> X such 
that the restriction to every compact interval defines a geodesic segment. If X is a path 
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metric space such that every two points in X are joined by a geodesic segment, we say 
X is a geodesic space, and if this geodesic is unique we call X a uniquely geodesic 
space. 


33.2.5. If X is a geodesic space, then an isometry of X maps geodesic segments 
to geodesic segments, and hence geodesics to geodesics: i.e., if g € Isom(X) and 
v: x —> y is a geodesic segment, then gu: gx — gy is a geodesic segment. After all, 
g maps the set of paths x — y bijectively to the set of paths gx — gy, preserving 
distance. 


33.2.6. In the context of differential geometry (our primary concern), these notions 
can be made concrete with coordinates. Suppose U c R” is an open subset. Then 
a convenient way to specify the length of a path in U is with a length element in 
real-valued coordinates. To illustrate, the ordinary metric on R” is given by the length 
element 


ds := dx? +--+ + dx 


n»? 


so if v: [0,1] — U is a piecewise continuously differentiable function written as 
v(t) = (x1 (t), ..., Xn(t)), then 


dan \ 
$ dt (33.2.7) 


as usual. 

More generally, if 4: U — Rso is a positive continuous function, then the length 
element A(x) ds defines a metric (33.2.2) on U, as follows. The associated length 
(33.2.7) is symmetric, nonnegative, and satisfies the triangle inequality. To show that 
p(x, y) > 0 when x # y, by continuity 4 is bounded below by some ņ > 0 on a suitably 
small e ball neighborhood of x not containing y, so every path v: x — y has £(v) > en 
and p(x, y) > 0. 

In this context, we also have a notion of orientation, and we may restrict to isome- 
tries that preserve this orientation. We return to this in section 33.8, rephrasing this in 
terms of Riemannian geometry. 


Remark 33.2.8. The more general study of geometry based on the notion of length in a 
topological space (the very beginnings of which are presented here) is the area of metric 
geometry. Metric geometry has seen significant recent applications in group theory and 
dynamical systems, as well as many other areas of mathematics. For further reading, 
see the texts by Burago—Burago-—Ivanov [BBI2001] and Papadopoulous [Pap2014]. 

In particular, geodesic spaces are quite common in mathematics, including com- 
plete Riemannian manifolds; Busemann devotes an entire book to the geometry of 
geodesics [Bus55]. Uniquely geodesic spaces are less common; examples include sim- 
ply connected Riemannian manifolds without conjugate points, CAT(Q) spaces, and 
Busemann convex spaces. 


The following theorem nearly characterizes geodesic spaces. 
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Theorem 33.2.9 (Hopf—Rinow). Let X be a complete and locally compact length 
metric space. Then X is a geodesic space and every bounded closed set in X is 
compact. 


Proof. See e.g. Bridson—Haefliger [BH99, Proposition 3.7]). oO 


33.3 Upper half-plane 


We now present the first model of two-dimensional hyperbolic space (see Figure 
332): 


Definition 33.3.1. The upper half-plane is the set 
H? := {z = x + iy € C : Im(z) = y > 0}. 


H =H? 


Figure 33.3.2: Upper half-plane H? 


Definition 33.3.3. The hyperbolic length element on H? is defined by 


ydx? +dy? — |dz|_ 


ds := = : 
j y Imz 


(33.3.4) 


As described in 33.2.6, the hyperbolic length element induces a metric on H?, and 
this provides it with the structure of a path metric space. 


Definition 33.3.5. The set H? equipped with the hyperbolic metric is (a model for) the 
hyperbolic plane. 


Remark 33.3.6. The space H°? can be intrinsically characterized as the unique two- 
dimensional (connected and) simply connected Riemannian manifold with constant 
sectional curvature —1. 


The hyperbolic metric and the Euclidean metric on H? are equivalent, inducing 
the same topology (Exercise 33.1). However, lengths and geodesics are different under 
these two metrics, as we will soon see. 


33.3.7. The group GL2(R) acts on C via linear fractional transformations: 


az+b a b 
z= zya for g = (° 4 € SL2(R) and z € C; 
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since 
_ (azt+ byczt+d) _ ac\z + adz + bcz + bd (33.3.8) 
lez + d|? Icz + d|? 
we have F 
etg 
I = i 33.3.9 
mer = ezra PRA 


and so if Imz > 0, then Im gz > 0 if and only if det g > 0. Therefore, the subgroup 
GL; (R) = {g € GL2(R) : det(g) > 0} 


preserves the upper half-plane H?. Moreover, because the action of GL} (R) is holo- 
morphic, it is orientation-preserving. 

The kernel of this action, those matrices acting by the identity as linear fractional 
transformations, are the scalar matrices, since (az + b)/(cz + d) = z identically if 
and only if c = b = 0 and a = d. Taking the quotient we get a faithful action of 
PGL; (R) = GLZ (R)/R* on H°. There is a canonical isomorphism 


PGL; (R) => PSL2(R) 


= : g. 
vaet(g) 


with the same action on the upper half-plane. 


33.3.10. The determinant det : PGL2(R) — R*/IR*? ~ {+1} has the inverse image 
of +1 equal to PGL; (R) both open and closed in PGL,(R); therefore, any g with 
det(g) < 0 together with PGL; (R) generates PGL2(R): for example, we may take 


pE G i) (33.3.11) 


In view of (33.3.9), we extend the action of PGL2(R) on H? by defining for g € PGL2(R) 


and z € H? 
if detg > 0; 
‘Z= 


te (33.3.12) 
gz=ez, if detg <0. 


The elements g € PGL»2(R) with det g < 0 act anti-holomorphically and so are 
orientation-reversing. The matrix g in (33.3.11) then acts by g(z) = —z. 
This action also arises naturally from another point of view. Let 


H?” = {z eC: Imz <0} 
be the lower half-plane, let H?* = H?, and let 
H2+ = H? UH? ={zeC:Imz#0}=C-R. 


Then PGL»(R) acts on H7* (it preserves R hence also its complement in C). Complex 
conjugation z +> Z interchanges H?* and H?~, and there is a canonical identification 


HIG aay H2 
from which we obtain the action (33.3.12) of PGL; (R) on H?. 
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Remark 33.3.13. The fact that PSL2(R) has elements g € PSL2(R) that are matrices up 
to sign means that whenever we do a computation with a choice of matrix, implicitly 
we are also checking that the computation goes through with the other choice of sign. 
Most of the time, this is harmless—but in certain situations this sign plays an important 
role! 


Let Isom*(H?) < Isom(H7) be the subgroup of isometries of H? that preserve 


orientation. 


Theorem 33.3.14. The group PSL2(R) acts on H? via orientation-preserving isome- 
tries, i.e., PSL:(R)  Isom*(H’). 


Proof. Because the metric is defined by a length element ds, we want to show that 
d(gs) = ds for all g € PSL2(R), i.e., 


ld(ge)l _ Ide 
Im(gz) Imz 
for all g € PSL2(R). Since |d(gz)| = |dg(z)/dz||dz|, it is equivalent to show that 
d d 1 
ldg@idz| _ 1 (33.3.15) 
Im gz Imz 
for all g € PSL2(R). 
Let g € PSL2(R) act by 
ae az+b 
cna cz+d 
with ad — bc = 1. Then 
dg (cz + d)a — (az + b)c 1 
= = 3 33.3.16 
dz | (cz + dy? lez + dÊ ( ) 
by (33.3.9), 
Im gz = w 
87 = Tez + dP 


so taking the ratio, the two factors |cz + d 2 exactly cancel, establishing (33.3.15). 
Finally, the action is holomorphic so (by the Cauchy—Riemann equations) it lands 
in the orientation-preserving subgroup. o 


33.3.17. The action of PSL2(R) extends to the boundary as follows. We define the 
circle at infinity to be the boundary 


bd H? := RU {oo} C CU {oo}. 


(The name comes from viewing H? in stereographic projection as a half-sphere with 
circular boundary.) The group PSL»(R) still acts on bd H? by linear fractional trans- 
formations. We define the completed upper half-plane to be 


H” := H? U bd H?. 
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The topology on H” is the same as the Euclidean topology on H?, and we take a 
fundamental system of neighborhoods of the point co to be sets of the form 


{z € H? : Imz > M} U {oo} 
for M > 0 and a system of neighborhoods of the point zo to be 
{zo} U {|z — (Re zo + mi)| < m} 


i.e. open disks tangent to the real axis at zo, together with Zo. 


Remark 33.3.18. Although the hyperbolic plane cannot be embedded in R°, it can be 
locally embedded. One way to visualize plane hyperbolic geometry (locally) is by the 
pseudosphere, the surface of revolution generated by a tractrix: it is a surface with con- 
stant negative curvature and so locally models the hyperbolic plane (Figure 33.3.19). 


Figure 33.3.19: Pseudosphere 
Remark 33.3.20. We will compactify quotients of H? in other ways below. In that 
context, we will add only a subset of the boundary of H?*; this overloading should 
cause no confusion. 


33.4 Classification of isometries 


On our way to classifying isometries, we pause to identify three natural subgroups of 
SL2(R): 


x = Soa) = {{ o st eR} = R/2Z 
—sint cost 
A= Ile o zae Ro) = Rž) =R (33.4.1) 


vel() isen] =n 
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ait+b 


We have K = Stabs, œ)(i) since = iif and only if d = a and c = —b, and 


then the determinant condition implies a? + b? = 1. An element l A acts by 


z+ a’z, fixing the origin and stretching along lines through the origin. An element 


G i acts by the translation z > z + b. 


Proposition 33.4.2 (Iwasawa decomposition). The multiplication map gives a home- 
omorphism 
NXAxK > SLR). 


In particular, for all g € SL2(R), we can write uniquely g = ngagkg with ng € N, 
dg € A, and kg € K in a way continuously varying in g. 


Proof. The multiplication map N x A x K — SL2(R) is continuous and open, so we 
need to show it is bijective. It is injective, because checking directly we see that 


NAN K={lI}=NNA. 
This map is surjective as follows. Let g € SL(R), and let z = g(i). Let ng = 


( a 7) € N, so that (ngg)(i) = yi. Let ag = ka A € A; then (agng8)(i) = 


i, SO AgNgg € Stabs, R) (i) = K, and peeling back we get g € NAK, proving surjectiv- 
ity. o 


Remark 33.4.3. We have AN = NA, and we showed in the proof of Proposition 33.4.2 
that NA acts transitively on H? (by z + az + b). In section 34.6, we reinterpret this 
as providing a direct link between H? and SL2(R). 


Lemma 33.4.4. The group SL2(R) is generated by the subgroups A, N, and the element 


h a) which acts on H? by z > -1/z. 


Proof. Let ‘ i € SL2(R). The lemma follows by performing row reduction on the 


d 
matrix using the given generators. We find that if c + 0, then 


e ado AE oO allo 4) 
(0 aa) =o allo 1") 


The subgroups N, A, K can be characterized by their traces; with a view to working 
on PSL2(R), we consider the absolute traces: 


and if c = 0 then 


Oo 


[Tr(K)| = [0,2], [Tr(A)| = [2, œ), and |Tr(V)| = {2}. 
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Definition 33.4.5. An element g € PSL2(R) with g + +1 is called elliptic, hyperbolic, 
or parabolic according to whether |Tr(g)| < 2, |Tr(g)| > 2, or |Tr(g)| = 2. 


Every nonidentity element g € PSL2(R) belongs to one of these three types (even 
though g need not belong to one of the subgroups N, A, K individually). 


Lemma 33.4.6. An element g € PSL2(R) is 
elliptic a unique fixed point in H?, 
hyperbolic} if and only if g has 4 two fixed points on bd H?, 
parabolic a unique fixed point on bd H?. 


Proof. Let g = É i have det(g) = ad — bc = 1. We look to solve the equation 


d 


az+b 
=z 
cz+d 


or equivalently cz? +(d—a)z—b = 0. The discriminant is (d-a) +4bc = (a+d)*-4 = 
Tr(g)? — 4. Therefore g is elliptic if and only if this discriminant is negative if and only 
if there is a unique root in H?; g is parabolic if and only if this discriminant is zero 
if and only if there is a unique root in bd H?; and g is hyperbolic if and only if this 
discriminant is positive if and only if there are two roots in bd H°. oO 


33.4.7. Let g € PSL2(R). If g is elliptic, then g acts by hyperbolic rotation in a neigh- 
borhood around its fixed point, as in Figure 33.4.8; every such element is conjugate 
to an element of K, fixing i. (Indeed, in the unit disc model with its fixed point as the 
center, an elliptic element acts literally by rotation in the disc; see section 33.7.) 


elliptic 


Figure 33.4.8: Action on H? by an elliptic element 


A hyperbolic element can be thought of as a translation along the geodesic between 
the two fixed points on bd H’, as in Figure 33.4.9. 

Moving these fixed points to 0, co, every such element is conjugate to an element 
of A, acting by z+ a?z with a + 1, as in Figure 33.4.10. 

Finally, a parabolic element should be thought of as a limit of the other two types, 
where correspondingly the fixed point tends to the boundary or the two fixed points 
move together; every such element is conjugate to an element of N, acting by translation 
z} z +n for somen E R. 
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hyperbolic 


Figure 33.4.9: Action on H? by a hyperbolic element 


+ 


Figure 33.4.10: Action on H? by a hyperbolic element with fixed points 0, 1 


Lemma 33.4.11. For all z, z’ € HŽ, there exists g € PSL2(R) such that gz, gz’ € Rsoi 
are pure imaginary. 


Proof. The proof follows directly by using translations and scaling (33.4.1), requested 
in Exercise 33.5. m 


33.5 Geodesics 


In this section, we prove two important theorems: we describe geodesics, giving a 
formula for the distance, and we classify isometries. 


Theorem 33.5.1. The hyperbolic plane H? is a uniquely geodesic space. The unique 
geodesic passing through two distinct points z, z! € H? is a semicircle orthogonal to 
R or a vertical line, and 
; z—z|+lz—-2’ 
p(z, z ) = log ada (33.5.2) 
lz- zl- lem" 
z-z 


h plzz’) =l + 57 a: 
cosh p(z, z’) j 2 Im(z) Im(z’) 


(33.5.3) 


Proof. We first prove the imaginary axis is a geodesic with z,z’ € Roi. Let u(t) = 
x(t) + iy(t) : z > z’ be a path; then 
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ev) 


1 1 
V(dx/dr)? + (dy/dty m f dy/dt y 


ni 


= log y(1) — log y(0) = log 


Z 
x 


with equality if and only if x(t) = 0 identically and dy/dt > O for all ¢ € [0, 1]. This is 
achieved for the path 

u(t) = (zA -H + [zis 
so p(z, 2’) = log |z’/z|, and the imaginary axis is the unique geodesic. 

For arbitrary points z,z’ € H?, we apply Lemma 33.4.11. The statement on 
geodesics follows from the fact that the image of R„oi under an element of PSL2(R) is 
either a semicircle orthogonal to R or a vertical line (Exercise 33.6). 

The formula (33.5.2) for the case z, z’ € Roi follows from (33.5.4) and plugging 
in along the imaginary axis; the general case then follows from the invariance of both 
p(z, z’) and 

Iz-zZ|+lz-2’ 

log ——S 

A= 2 =e! 
under g € PSL»(R), checked on the generators in Lemma 33.4.4 (Exercise 33.10). 
Finally, the formula (33.5.3) follows directly from formulas for hyperbolic cosine, 
requested in Exercise 33.11. o 


Theorem 33.5.5. We have 
Isom(H?) ~ PGL2(R) 


and 


Isom*(H?) = PGLi(R) = PSL2(R). 


Proof. Let Z = {ti : t > O} be the positive part of the imaginary axis. By Theorem 
33.5.1, Z is the unique geodesic through any two points of Z. 

Let fd € Isom(H’). Then ¢(Z) is a geodesic (33.2.5), so by Exercise 33.7, there 
exists g € PSL2(R) such that g¢ fixes Z pointwise. Replacing ¢ by g¢, we may suppose 
ġ fixes Z pointwise. 

Let z=x+iy € H? and z’ = x’ +iy’ = A(z). For all t > 0, 


plz it) = p(Gz, Pit) = plz’, it). 
Plugging this into the formula (33.5.3) for the distance, we obtain 
+O- =? + -1))y. 


Dividing both sides by £? and taking the limit as t — œ, we find that y = y’, and 
consequently that x? = x’? and x = +x’. The choice of sign + varies continuously over 
the connected set H? and so must be constant. Therefore ġ(z) = zor (z) = —Z for all 
z € HŽ. The latter generates PGL2(R) over PSL2(R) (33.3.10), and both statements in 
the theorem follow. Oo 
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33.6 Hyperbolic area and the Gauss—Bonnet formula 


In this section, we consider hyperbolic area. We measure hyperbolic area by considering 
a small Euclidean rectangle whose sides are parallel to the axes at the point (x, y); the 
hyperbolic length of the sides are dx/y and dy/y, and we obtain the hyperbolic area 
form from the product. 


Definition 33.6.1. We define the hyperbolic area form by 


dx dy 


dA = 
y2 


For a subset S C H?, we define the hyperbolic area of S by 


wis) = f faa 
Ss 


Proposition 33.6.2. The hyperbolic area is invariant under Isom(H?). 


when this integral is defined. 


Proof. We verify that the hyperbolic area form is invariant. We first check this for the 
orientation-reversing isometry 


g(z) = g(x tiy) = =x + iy = -Z; 


visibly d(gA) = dA in this case. 
It suffices then to consider g € PSL,(R). Let z = x + iy and let 


(2) az +b 43 
= = =urtiyv, 
BS ROE ag a 
with ad — be = 1. By (33.3.9), v = ——~. We compute that 
|cz + d|? 
d 1 
5 (z) = . (33.6.3) 


dz (cz +d) 


Now g is holomorphic; so by the a equations, its Jacobian is given by 


Olu, v)| _ nic ‘i 
A(x, y)| ~ lez + = 
therefore 
dudy  d(u,v) dxdy 1 lez + d|* 
d(gA) = = = dxd dA. 
(84) y2 O(x,y) v2 Icz+d|* y? r= O 


A major role will be played in what follows by hyperbolic polygons, defined 
formally as follows. 


33.6.4. Let z, z’ € H?” be distinct. Then there is a unique geodesic in H? whose closure 
in H** passes through z, z’, and the segment from z to z’ is denoted [z, z’]. 
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Definition 33.6.5. A hyperbolic polygon is a connected, closed subset of H™ whose 
boundary consists of finitely many geodesic sides of the form [z, z’] with z, z’ vertices. 
A hyperbolic triangle is a hyperbolic polygon with three sides. 


Definition 33.6.6. A subset A C H? is convex if the geodesic segment between any 
two points in A lies inside A. 


An example of a (convex) hyperbolic polygon is given in Figure 33.6.7. 


Figure 33.6.7: A hyperbolic polygon 


A hyperbolic triangle is visibly convex; for more on convexity, see Exercises 33.8- 
33.9. The following key formula expresses the hyperbolic area of a triangle in terms of 
its angles. 


Theorem 33.6.8 (Gauss—Bonnet formula). Let T be a hyperbolic triangle with angles 
a, P, y. Then 
MT) =x-(a+Br+y). 


Proof. See e.g. Katok [Kat92, Theorem 1.4.2]. o 


33.6.9. Let P be a convex hyperbolic polygon with n sides. By convexity, each side 
meets at each vertex a unique side, so P has n vertices with angles 81, .. ., 0n. Trian- 
gulating P and applying the Gauss—Bonnet formula, we conclude that 


MP) = (n— 2)m — (01 +--+ + Oy). 
Remark 33.6.10. Theorem 33.6.8 is called the Gauss—Bonnet formula because it is 


closely related to the more general formula relating curvature to Euler characteristic. 
The simplest formula of this kind is 


f KdA =2rx(X) (33.6.11) 
X 


for a compact Riemann surface X. The expression (33.6.11) is quite remarkable: it 
says that the total curvature of X is determined by its topology; if you flatten out a 
surface in one place, the curvature is forced to rise somewhere else. If instead one has 
a surface X with geodesic boundary, then the formula (33.6.11) becomes 


K dA — 0i) =2. X 
f. + Den eves 
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where 6; are the angles at the vertices. For a triangle X with constant curvature —1 
and angles a, 8, y, we have k KdA = —p(X) and y(X) = V- E +F = 1 (as for any 
polygon), so we find 

—uU(X) +37 -(a+B+y)=20 


and we recover Theorem 33.6.8. 


33.7 Unit disc and Lorentz models 


In this section, we consider two other models for the hyperbolic plane. 
First, we consider the unit disc model. 
Definition 33.7.1. The hyperbolic unit disc is the (open) unit disc 
D? ={weC:|wi<]} 
equipped with the hyperbolic metric 


_ 2idw] 
—1-|w 


The hyperbolic unit disc D? is also called the Poincaré model of planar hyperbolic 
geometry. The circle at infinity is the boundary 
bd D? = {w € C : |w] = 1}. 
33.7.2. For all zo € HŽ, the maps 
Pa : H? > D? pz :D? S H? 
Z — zo ZOW — Zo (33.7.3) 


w ez= 
Z— Zo w-l1 


define a conformal equivalence between H? and D? with zo + (zo) = 0. A particularly 
nice choice is zp = i, giving 


wt+l 


= wes (33.74) 
Z+1 


w-l1 

The hyperbolic metric on D? is the pushforward of (induced from) the hyperbolic 
metric on H? via the identification (33.7.4) (Exercise 33.12). Ordinarily, one would 
decorate the pushforward metric, but because we will frequently move between the 
upper half-plane and unit disc as each has its advantage, we find it notationally simpler 
to avoid this extra decoration. The distance on D? is 


[1 — ww’|+ |w —-w’| 


p(w, w’) = log = 
i= a I- 8 WA (33.7.5) 
w-w 
cosh p(w, w’) = 1 +2 
d -Iw - Iw) 
so that i 
+ 
dee e eo aie hal (33.7.6) 


1-|w| 
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The map ¢ (33.7.3) maps the geodesics in H? to geodesics in D?, and as a Möbius 
transformation, maps circles and lines to circles and lines, preserves angles, and maps 
the real axis to the unit circle; therefore the geodesics in D? are diameters through the 
origin and semicircles orthogonal to the unit circle, as in Figure 33.7.7. 


Figure 33.7.7: Hyperbolic geodesics 


Accordingly, triangles from the upper half-plane map to triangles in the unit disc. 


33.7.8. Via the map ¢, the group PSL»(R) acts on D? as 


b 
$ PSL2(R)¢ ! = PSU(1, 1) = (( n) € M2(©) : la? - |b}? = i H=1}. 
Explicitly, an isometry of D? is a map of the form 


1 (W-a 
wr el? ( — ) 
1-aw 


for a € C with |a| < 1 and 6 € R. (A direct substitution can be used to give an alternate 
verification that these transformations are isometries of D with the hyperbolic metric.) 
The orientation reversing isometry g(z) = —Z on H? acts by g(w) = w on D? with 
the choice p = i (Exercise 33.13). 
The induced area on D? is given by 
4dxd 
Aig ees 
(1 — x2 -= y2)2 
for w = x + yi. 
Second, we present the Lorentz model. 
Definition 33.7.9. The Lorentz metric on R? is the indefinite metric 
ds? = -dt? + dx? + dy’. 
33.7.10. The indefinite Lorentz metric is associated to the quadratic form 
q(t, x, y) = t tty 


in the natural way. Lengths in this metric can be positive or nonpositive. However, on 
the hyperboloid 
-x-y =1, 
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the metric becomes positive definite: a nonzero tangent vector to the hyperboloid has 
positive length (Exercise 33.16). The hyperboloid can be thought of as the sphere of 
radius i about the origin with respect to q, taking an imaginary radius shows that 
hyperbolic geometry is dual in some sense to spherical geometry, where S? c R? has 
real radius 1. 


Definition 33.7.11. The Lorentz hyperboloid is the set 
L? = {(t,x,y) ER? : q(t, x, y) = —1,t > 0} 


equipped with the Lorentz metric. 


The Lorentz hyperboloid is the upper sheet of the (two-sheeted) hyperboloid; it 
is also called the hyperboloid model or the Lorentz model of planar hyperbolic 
geometry, and it can be visualized as in Figure 33.7.13. (The choice of signs has to do 
with the physics of spacetime.) 

The map 

L? > D’ 


; (33.7.12) 
(t,x, y) = (x +iy)/(t +1) 


is bijective and identifies the metrics on L? and D? (Exercise 33.15). Moreover, the 
map (33.7.12) maps geodesics in D? to intersections of the hyperboloid with planes 
through the origin. 


x 


N 


cone 
=r? +y? 


Figure 33.7.13: The hyperboloid model L? 


By pullback, Isom*(L”) ~ PSL2(R). However, other isometries are also apparent: 
a linear change of variables that preserves the quadratic form q also preserves the 
Lorentz metric. Let 


-1 0 0 
O(2, 1) = {g € GL3(R): g'mg =m}, wherem=| 0 1 OJ. 
0 0 1 


Then 
ds? = vmv, where v = (dt, dx, dy)’, 
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so if g € O(2, 1) then immediately dgs? = ds?. 
Next, if g € O(2, 1), then g’mg = m implies det(g) = +1. The elements of O(2, 1) 
that map the hyperboloid to itself comprise the subgroup 
SO(2, 1) = {g € OQ, 1) : det(g) = 1}; 
let SO*(2, 1) < SO(2, 1) be the further subgroup that maps the upper sheet of the 
hyperboloid to itself, the connected component of the identity. 


Remark 33.7.14. We have proven that there is an isomorphism of Lie groups 
SO*(2, 1) = PSL2(R); (33.7.15) 


it corresponds to the isomorphism of Lie algebras $02 = $l. 


33.8 Riemannian geometry 


The hyperbolic metric (33.3.4) is induced from a Riemannian metric as follows. 


33.8.1. A Riemannian metric ds? on an open set U C R” is a function that assigns to 
each point x € U a (symmetric, positive definite) inner product on the tangent space 
T,(U) at x € U, varying differentiably. Such an inner product defines the length of 
a tangent vector || ||, the angle between two tangent vectors, and the length element 
ds = Vds2. In coordinates, we write 


ds? = ` Nij dx; dx; 
i,j 
for standard coordinates x; on R”, and the matrix (77;;) is symmetric, positive definite, 
and differentiable. The metric determines a volume formula as 


dV = ydet ņ dx, --- dxn. 


A Riemannian metric gives U C R” the structure of a path metric space, as 
explained in 33.2.6: if v: [0, 1] — U is continuously differentiable, then we define its 


length to be 
1 
w= fas= f lv Oll dt. 
v 0 


If ¢: RE — R” is continuously differentiable, the pullback metric ¢*(ds7) is 
defined by 
$*(ds°)(v, w) = d? D fO), Df Ow) 


where v, w € T,(U) and D is the derivative map. 


The language of 33.8.1 gives another way to interpret the hyperbolic metric on H?. 
This point of view extends to provide a description of the full isometry group PSL2(R) 
as the unit tangent bundle of H?, as follows. 


33.8.2. The tangent space to H? at a point z € H? is T,H* = C and the tangent bundle 
T(H2) := {(z v) : z € HŽ, v € T-H?} 
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is trivial (parallelizable), with T(H?) = H? xC. The Riemannian metric on H? is then 
defined by the metric on T; H? given by 

o vw 

~ (imz? 


(v, w) 


for v,w € T}(H°), a rescaling of the usual inner product on C over R. In particular, 
\[v|| = |v|/dim z) for v € T;(H°). The angle between two geodesics at an intersection 
point z € T,H? is then defined to be the angle between their tangent vectors in T, H?; 
this notion of an angle coincides with the Euclidean angle measure. 

The action of PSL2(R) on H? extends to an action on T(H”) in the expected way: 


dg(z) az+b 1 
vj = ; vj. 
dz cz +d’ (cz +d}? 


(ZV) = [e 


Since isometries of H? are differentiable, they act on the tangent bundle by differentials 
preserving the norm and angle, and therefore Isom(H?) acts conformally or anti- 
conformally on H?. 

If we restrict to the unit tangent bundle 


UT(H?) := {(z, v) € T(H”) : lvl? = 1} 
then we obtain a bijection 


PSL2(R) ~ UT(H?) 
d 
gH [e oyi) 
(Exercise 33.17). 


Remark 33.8.3. The natural generalization of Euclid’s geometry is performed on a 
Riemannian manifold X that is homogeneous, i.e., the isometry group Isom(X) acts 
transitively on X, as well as isotropic, i.e., Isom(X) acts transitively on frames (a basis 
of orthonormal tangent vectors) at a point. In this way, homogeneous says that every 
point “looks the same”, and isotropic says that the geometry “looks the same in every 
direction” at a point. Taken together, these natural conditions are quite strong, and 
there are only three essentially distinct simply connected homogeneous and isotropic 
geometries in any dimension, corresponding to constant sectional curvatures zero, 
positive, or negative: these are Euclidean, spherical, and hyperbolic geometry, respec- 
tively. Put this way, the hyperbolic plane is the unique complete, simply connected 
Riemann surface with constant sectional curvature —1. For more on geometries in this 
sense, we encourage the reader to consult Thurston [Thu97]. 


To conclude this section, we briefly review a few facts from the theory of Rieman- 
nian manifolds. 


33.8.4. A (topological) n-manifold is a (second-countable) Hausdorff topological 
space X locally homeomorphic to R”, i.e., for every x € X, there exists an open 
neighborhood U > x and a continuous map ¢: U —> R” that is a homeomorphism 
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onto an open subset; the map ¢: U > (U) € R” is called a chart (at x € X), and an 
open cover of charts is called an atlas. 
Two charts ¢,: U; —> R” and ¢2: U2 — R” are (C~-)smoothly compatible if the 
transition map 
$12 = $26)! : 1(U1 N Ur) > $2(U1 N U2) 
(see Figure 33.8.5) is C°-smooth (i.e., has continuous partial derivatives of all orders). 


An atlas on a manifold is smooth on a manifold if all charts are smoothly compatible. 
A smooth manifold is a manifold equipped with a smooth atlas. 


2 (U2) 
R” R” 
Figure 33.8.5: A manifold, by its atlas 


A morphism of smooth manifolds is a continuous map f: Y — X such that for 
the atlases {(¢;, U;)}; of X and {(W;, V;)}; of Y, each map 


pift: WIV A FUD) > bi FV) NA Ui) 


is smooth. An isomorphism (diffeomorphism) of smooth manifolds is a bijective 
morphism f: Y æ X such that f and f~! are (C®-)smooth. 

By the same definition as 33.8.1, we define a Riemannian metric on a smooth 
n-manifold. 


One could similarly define C-smooth manifolds for any 1 < k < œ. 


33.8.6. We similarly define a complex n-manifold, and morphisms between them, by 
replacing R by C and smooth by holomorphic in the definition of a smooth manifold. A 
Riemann surface is a complex 1-manifold. For further reference, see e.g. Donaldson 
[Don2011] or Miranda [Mir95]. 

A complex 1-manifold (Riemann surface) defines a smooth, orientable Rieman- 
nian 2-manifold by choosing the standard Euclidean metric on the complex plane; 
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conversely, a conformal structure on a smooth, oriented Riemannian 2-manifold deter- 
mines a complex |-manifold. In other words, the category of Riemann surfaces is 
equivalent to the category of smooth, orientable Riemannian 2-manifolds with confor- 
mal transition maps and with conformal morphisms. 


Example 33.8.7. The field C of complex numbers is the “original” Riemann surface, 
and every open subset of C is a Riemann surface. 

The simplest nonplanar example of a Riemann surface is the Riemann sphere 
P!(C) = CU {ov}. The atlas on P!(C) is given by the open sets 

U; = P!(C)\{co} = C and Up = P'(C)\{0} 

and charts 6; : U; — C by ¢,(z) = z and ¢2 : U2 —> C by ¢2(z) = 1/z; the map 
626; '(z) = 1/z is analytic on ¢;(U,; N U2) = Cx{0}. Topologically, the Riemann 
sphere is the one-point compactification of C, and becomes a sphere by stereographic 


projection. 


Example 33.8.8. The inverse function theorem implies that if X is a smooth projective 
algebraic variety over C, then X(C) has the canonical structure of a compact, complex 
manifold. 


Exercises 


1. Show that the hyperbolic metric has the same topology as the Euclidean metric 
in two ways. 


(a) Show directly that open balls nest: for all z € H? and all € > 0, there exist 
™1,72 > 0 such that 


p(z.w) <m > kz- w| < € = p(z,w) < m 


for all w € H?. 


(b) Show that the collection of Euclidean balls coincides with the collection of 
hyperbolic balls. /Hint: applying an isometry, reduce to the case of balls 
around i and check this directly; it is perhaps even clearer moving to the 
unit disc model. ] 


2. Check that in R”, the metric specified in (33.2.7) 


lw) = | yi? tet xl (DPdt 


has lines as geodesics. 
3. From differential geometry, the curvature of a Riemann surface with metric 


ds = [f(x y) dx? + g(x, y)dy? 
is given by the formula 
= 1 [4 [1 ove Pee ôf 
Vfg Ox Vf Ox Oy | Vg Oy 
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> 5. 


> 6. 


> 7. 


> 8. 


> 10. 


for suitably nice functions f, g. Using this formula, verify that the curvature of 
H? and D? is -1. 


. Consider C with the standard metric. Let 


Isom” (C) = {g € Isom(C) : g is holomorphic} < Isom(C). 
Exhibit an isomorphism of groups 


a 


Isom” (C) = (k i € GL2(C) : Jal = i 


and an isometry of C that is not holomorphic. /Hint: An invertible holomorphic 
map C > C is of the form z > az + b.] 


Show that for every z,z’ € H?, there exists g € PSL2(R) such that Re gz = 
Re gz’ = 0. [Hint: Work in D7. 


Show that the image of R>oi under an element of PSL2(R) is either a semicircle 
orthogonal to R or a vertical line. /Hint: Look at the endpoints. ] 


We consider the action of PSL2(R) on geodesics in H?. 


(a) Show that PSL2(R) acts transitively on the set of geodesics in H?. 


(b) Show that given z, z’ € H?, there exists g € PSL»(R) such that gz = z’ and 
such that g that maps the geodesic through z and z’ to itself. 


(c) Show that every isometry of H? that maps a geodesic to itself and fixes 
two points on this geodesic is the identity. 


Let z1, z2 € HĈ be distinct. Let 
H(z1, z2) = {z € H? : p(z, z1) < p(z z2)} 


be the locus of points as close to z; as to z2, and let L(z1, z2) = bd A(z, z2). 
Show that H(z1, z2) is a convex (Definition 33.6.6) half-plane, and that 


L(z1, z2) = {z € H? : p(z, z1) = p(z, z2)} 


is geodesic and equal to the perpendicular bisector of the geodesic segment from 
Z1 tO Zo. 


. Show that a hyperbolic polygon is convex if and only if it is the intersection of 


finitely many half-planes H(z), z2) as in Exercise 33.8. 


Show that the expression 

Iz-z’|+|z-2'| 

Iz-z'l-lz-z'l 
with z, z’ € H? is invariant under g € PSL)(R). [Hint: check this on a convenient 
set of generators. ] 
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> 11. 


> 12. 


13. 


14. 


15. 


> 16. 


> 17. 
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Show that = 
k-z'l+iz-zl _ z-z 


cosh lo — z c 
: Iz-z’|-|z-2’| 2 Im(z) Im(z’) 


for all z, z’ € H’. 


Verify that the hyperbolic metric on D? is induced from the hyperbolic metric 
on H? from the identification (33.7.4), as follows. 


(a) Show that 
21¢’(z)| 1 


1= 16)? Imz’ 
(b) Let w = (z), and using part (a) conclude that 
2\dw| _ ldz| 


1-|wl? Imz 
Show that the orientation-reversing isometry g(z) = —z induces the map 
(p8p w) = 
on D? via the conformal transformation ġ: H? > D? in (33.7.4). 
Show that the Iwasawa decomposition (Proposition 33.4.2) can be given explic- 


itly as 


2 — 
k Je pe ale 0) f ur NAK = SLAR) 


where r = Vc? + d?, s = d/r,t = cfr. 
Show that the map 
L? — D? 


x+iy 
t+1 


(t, x, y) => 
identifies the metrics on L? and D?, via pullback. 


Show that the Lorentz metric restricted to the hyperboloid is an honest (Rie- 
mannian) metric. /Hint: Show that a tangent vector v at a point p satisfies 
b(p,v) = 0, where b is the bilinear form associated to q; then show that the 
orthogonal complement to p has signature +2.] 


Show that there is a bijection PSL: (R) = UT(H’) defined by the action of g 
on a fixed base point in UT(H”). [Hint: Observe that elliptic elements rotate the 
tangent vector. ] 
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Chapter 34 


Discrete group actions 


Our ongoing goal in this part of the text is to understand quotient spaces that locally 
look like (products of) hyperbolic spaces. In order to get off the ground, here we put 
the previous two chapters in a more general context, seeking to understand nice group 
actions on topological spaces and indicating how these fit in with more general notions 
in topology. Pathologies exist! Our goal in this chapter is to provide basic context (for 
further references see 34.5.4) before turning to the central case of interest: a discrete 
group acting properly on a locally compact, Hausdorff topological space. 


34.1 Topological group actions 


Group actions will figure prominently in what follows, so we set a bit of notation. 
There are many references for topological groups, including Arhangel’skii-Tkachenko 
[AT2008, Chapter 1] and McCarty [McC2011, Chapter V]. 

Let G be a group and let X be a set. Recall that a left action of G on X is a map 


Gxx ~xX 


—— (34.1.1) 


satisfying 1x = x and (gg’)x = g(g’x) for all x € X and all g, g’ € G. A right action is 
instead a map X x G > X by (x, g) > xg satisfying x1 = x and x(g’g) = (xg’)g for 
all x € X and g, g’ € g. We will need to consider actions both on the left and the right; 
if not specified, a left action is assumed. 

We will also sometimes write GUX for an action of G on X. 


Example 34.1.2. A group G acts on itself by left multiplication, the (left) regular 
group action of G. If H < Gis asubgroup, then H also acts on G by left multiplication. 
For example, if V is an R-vector space with dimg V = n, and A C V is a (full) Z-lattice 
in V, then A ~ Z” is a group and A acts on V by translation. 

Another important and related example is the left action of G on the set of right 
cosets X = G/H again by multiplication, namely 


g(xH)=gxH forg €G and xH e€ G/H. 
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Let G act on X. The G-orbit of x € X is Gx = {gx : g € G}. The set of G-orbits 
forms the quotient set G\X = {Gx : x € X}, with a natural surjective quotient map 
m: X > G\X. 

Remark 34.1.3. We write G\X for the quotient, as G acts on the left; for a right action, 
we write X/G, etc. 


Example 34.1.4. A group G acts transitively on a nonempty set X if and only if G\X 
is a single point; in this case, we call X homogeneous under G. In particular, if H < G, 
then the action of G on G/H is transitive. 

If H < G, then the quotient set H\G is the set of left cosets of H in G. For example, 
if A = Z” < R” = V, then A\V = (R/Z)". 


For x € X, we define the stabilizer of x by Stabg(x) = {g € G : gx = x}. 


Definition 34.1.5. The action of G on X is free (and we say G acts freely on X) if 
Stabg(x) = {1} for all x € X, i.e., gx = x implies g = 1 forall x € X. 


Definition 34.1.6. Let X’, X be sets with an action of G. A map f: X’ —> X is G- 
equivariant if f(gx’) = g(f(x’)) for all x’ € X’ and g € G, i.e., the following diagram 
commutes: 

f 


X —+X 
hf 
yey 
If f: X’ — X is G-equivariant, then f induces a map 
G\X' > G\X 
Gx’ |> Gf(x’), 
well-defined by the G-equivariance of f, and the following diagram commutes: 


f 


X ——X 


|. |. (34.1.7) 
Ox’ — Gx 


Now topology enters. Let G be a topological group (Definition 12.2.1), a group 
with a topology in which the multiplication and inversion maps are continuous. Let X 
be a topological space, and let G act on X. We want to consider only those actions in 
which the topology on G and on X are compatible. 


Definition 34.1.8. The action of G on X is continuous if the map G x X —> X is 
continuous. 


Example 34.1.9. The left regular action of a group on itself is continuous—indeed, 
combined with continuity of inversion (and existence of the identity), this is the very 
definition of a topological group. 


34.1. TOPOLOGICAL GROUP ACTIONS 631 


Lemma 34.1.10. Suppose G has the discrete topology. Then an action of G on X is 
continuous if and only if for all g € G the left-multiplication map 
Ag: X 2X 


X= gx 


is continuous; and when this holds, each Ag is a homeomorphism. 


Proof. Exercise 34.2. o 


From now on, suppose G acts continuously on X; more generally, whenever G is a 
topological group acting on a topological space X, we will implicitly suppose that the 
action is continuous. 


34.1.11. The quotient G\X is equipped with the quotient topology, so that the quotient 
map m: X — G\X is continuous: a subset V C G\X is open if and only if a !(V)cX 
is open. 

The projection x is an open map, which is to say if U C X is open then 2(U) = 
GU C G\X is open: if U is open then m~! (a(U)) = Useg gU is open, so (U) is open 
by definition of the topology. 


34.1.12. If G acts continuously on X, then the topologies on G and X are related by 
this action. In particular, for all x € X, the natural map 


G->GxCcXx 

E&I gx 
is continuous (it is the restriction of the action map to G x {x}). Let K = Stabg(x). 
Then this map factors naturally as 


G/K — Gx (34.1.13) 


where we give G/K the quotient topology; then (34.1.13) then a bijective continuous 
map, a topological upgrade of the orbit—stabilizer theorem. The map (34.1.13) need not 
always be a homeomorphism (Exercise 34.5), but we will see below that it becomes a 
homeomorphism under further nice hypotheses (Exercise 34.6, Proposition 34.4. 11). 


In order to work concretely with the quotient G\X, it is convenient to choose 
representatives of each orbit as follows. We write cl for topological closure and int for 
topological interior. 


Definition 34.1.14. A fundamental set for GOX is a subset x C X such that: 


Gi) cl@nt()) = X; 
(ii) GH = X; and 
(iii) int) Nint(gH) = @ for all 1 + g € G. 
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The condition (i) ensures our basic intuition about tilings (and avoids fundamental 
sets that contain an extraneous number of isolated points); condition (ii) says that = 
tiles X; and condition (iii) shows that the tiles only overlap along the boundary, and 
there is no redundancy in the interior. If there is a fundamental set for GOX, then the 
action is faithful. 


34.1.15. Let x C X be a fundamental set for GOX. Then G induces an equivalence 
relation on x, and G\x > G\X is a bijection. 


Remark 34.1.16. In chapter 37, we place further restrictions on a fundamental set 
to ensure that they retain good properties, calling such a set a fundamental domain 
(Definition 37.1.11). 


34.2 > Summary of results 


We pause to provide a quick summary of the results in this chapter for the special case 
of discrete subgroups of PSL2(R) (proven in Theorem 34.5.1 and Proposition 34.7.2). 
The reader who is willing to accept the theorem below, and will stick to this case, can 
profitably skip the rest of this chapter. 

The group PSL2(R) has a natural topology from the metric on SL2(R) € M2(R) 
(see 34.6.1): intuitively, two matrices in PSL2(R) are close if after a choice of sign all 
of their entries are close. 


Theorem 34.2.1. Let T < PSL2(R) be a subgroup and equip F with the subspace 
topology. Then the following are equivalent: 


(i) T is discrete; 
(ii) For all z € H?, we have # Stabr(z) < œ and there exists an open neigborhood 
U > z such that yU N U + 0 implies y € Stabr(z); 
Gii) For all compact subsets K C H?, we have K N yK + 0 for only finitely many 
y €T; and 
(iv) For all z € W?, the orbit Tz C H? is discrete and # Stabp(z) < o. 


Moreover, if these equivalent conditions hold, then the quotient T\H? is Hausdorff, 
and the quotient map n: H? > T\H? is a local isometry at all points z € H? with 
Stabr(z) = {1}. 


A discrete subgroup I < PSL2(R) is called a Fuchsian group. 


34.3 Covering space and wandering actions 


Throughout the remainder of this chapter, let X be a Hausdorff topological space with 
a (continuous) action of a Hausdorff topological group G. 


Definition 34.3.1. We say the action of G on X is a covering space action if for all 
x € X, there exists an open neighborhood U > x such that gU N U = 0 forall g € G 
with g # 1. 
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34.3.2. If GOX is a covering space action, then the quotient map 7: X > G\X isa 
local homeomorphism, i.e., for every x € X, there exists an open neighborhood U > x 
such that z|y : U —> a(U) € G\X is a homeomorphism. A local homeomorphism need 
not conversely be a covering space map. 


If G acts by a covering space action, then G acts freely on X. This is too strong a 
hypothesis on the group actions we will consider in the rest of this book, so we need 
to look for something weaker. So we consider the following. 


Definition 34.3.3. We say that the action of G is wandering if for all x € X, there 
exists an open neighborhood U > x such that gU NU + 0 for only finitely many g € G. 


Example 34.3.4. If G is a finite group, then every action of G is wandering. 


34.3.5. If the action of G is wandering, then for all x € X, the orbit Gx C X is closed 
and discrete. 


Wandering actions generalize covering space actions, and can be equivalently 
characterized, as follows. 


Lemma 34.3.6. The following are equivalent: 


(i) The action of G is wandering; and 
(ii) For all x € X we have # Stabg(x) < œ, and there exists an open neighborhood 
U > x such that gU N U + 0 implies g € Stabg(x). 


If G acts freely, then these are further equivalent to: 
(iii) The action of G is by a covering space action. 


Proof. The implication (ii) > (i) is immediate; we prove the converse. Let U be a 
neighborhood of x € X such that gU N U + 0 for only finitely many g € G. We have 
# Stabg(x) < oo since g € Stabg(x) implies x € gU N U. Let 


{g€G:gUNU £@and gx # x} = {g1,..., Bn}. (34.3.7) 


Since X is Hausdorff, for all i there exist open neighborhoods V;, W; C X of x, gix, 
respectively, such that V; N W; = Ø. Replacing V; with V; N U (still with x € V; and 
V; AW; = 0), we may suppose that V; C U. Since G acts continuously, there exists an 
open neighborhood W; € X of x such that g;W; © W;. Let U; := V; A W;. Since V; CU 
we have U; C U for all i. Further, x € U; and 


Ui N giUi C UA giW; CV;,NW; =9. (34.3.8) 


We claim that U’ = f; U; has the desired property in (ii). Suppose that gU’NU’ + 0 
for some g € G. Since U; C U for all i we have U’ C U, so gU N U + Q; then by 
(34.3.7), either gx = x or g = gi for some i. We cannot have g = gj, since this would 
imply g;U’ N U’ C g,U; NU; = 0, contradicting (34.3.8). So gx = x, and g € Stabg(x) 
as claimed. 

Finally, if G acts freely, then Stabg(x) is trivial for all x, whence the equivalence 
(ii) © (iii). o 
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34.3.9. Suppose the action of G is wandering. We recall Lemma 34.3.6(ii). At a point 
x with open neighborhood U > x and finite stabilizer StabG(x), we can replace U by 
Mgestabg(x) ZU so that U 5 x is an open neighborhood on which Stabg(x) acts (see 
Exercise 34.12). Then the projection map factors as 


tly: U > Stabg(x)\U > G\X 


and the latter map Stabg(x)\U — G\X is a homeomorphism onto its image; we say 
x is a local homeomorphism modulo stabilizers. If the action of G is free, then we 
recover 34.3.2. 


Remark 34.3.10. If G has the discrete topology and the condition in Lemma 34.3.6(ii) 
holds, then some authors call the action of G properly discontinuous. This is probably 
because G is then as broken (“discontinuous”) as possible: G has the discrete topology, 
and we should be able to find neighborhoods that pull apart the action of G. (Klein 
[Kle79, p. 321] uses the term discontinuous because “points that are ‘equivalent’ with 
respect to [the group] are separated”.) This nomenclature is strange because we still 
want the action to be continuous, just by a discrete group. Adding to the potential 
confusion is the issue that different authors give different definitions of “properly 
discontinuous” depending on their purposes; most of these can be seen to be equivalent 
under the right hypotheses on the space, but not all. We avoid this term. 


It turns out that a wandering action is too weak a property in this level of generality 
for us to work with. However, it is close, and we will shortly see that it suffices with 
additional hypotheses on the space X. 


Remark 34.3.11. Let X be a topological space, and let G be a set of continuous maps 
X — X. Then there is a natural map G — X* defined by g > (gx) x. We give X* the 
compact-open topology and G the subspace topology, so a subbasis of the topology on 
G is given by 

V(K,U) ={f eG: f(K) CU} 


for K C X compact and U € X open. 

If X is Hausdorff and locally compact, then the compact-open topology on G is 
the weakest topology (smallest, fewest open sets) for which the map G x X —> X is 
continuous (also called an admissible topology on G) [McC2011, §VI, pp. 171-172]. 
Under the hypotheses of Exercise 34.4, this implies that the topology of pointwise 
convergence and the compact-open topology coincide. 


34.4 Hausdorff quotients and proper group actions 


In this section we define proper group actions; to motivate this definition, we first ask 
for conditions that imply that a quotient space is Hausdorff. Throughout this section and 
the next, let X be a Hausdorff topological space and let G be a Hausdorff topological 
group acting continuously on X. 


Lemma 34.4.1. The following are equivalent: 


G) The quotient G\X is Hausdorff; 
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(ii) If Gx # Gy € G\X, then there exist open neighborhoods U 3 x and V 3 y such 
that gU NV = 0 for all g € G; and 


Gii) The image of the action map 
GxXX>XxX 


(34.4.2) 
(g, x) > (x, gx) 


is closed. 


Proof. The implication (i) © (ii) follows directly from properties of the quotient map: 
the preimage of open neighborhoods separating Gx and Gy under the continuous 
projection map have the desired properties, and conversely the pushforward of the 
given neighborhoods under the open projection map separate Gx and Gy. 

To conclude, we prove (i) © (iii). We use the criterion that a topological space is 
Hausdorff if and only if the diagonal map has closed image. The continuous surjective 
map 2: X — G\X is open, so the same is true for 


nxn: XxX > (G\X) x(G\X). 


Therefore the diagonal G\X — (G\X) x (G\X) is closed if and only if its preimage is 
closed in X x X. But this preimage consists exactly of the orbit relation 


{(x,x’) E X x X : x’ = gx for some g € G}, 


and this is precisely the image of the action map (34.4.2). Oo 


The conditions Lemma 34.4.1(i)-(ii) can sometimes be hard to verify, so it is 
convenient to have a condition that implies Lemma 34.4. 1 (iii); this definition will seek 
to generalize the situation when G is compact. First, we make a definition. 


Definition 34.4.3. Let f: X — Y be a continuous map. 


(a) We say f : X — Y is quasi-proper if f~'(K) is compact for all compact K £ Y. 
(b) We say f is proper if f is quasi-proper and closed (the image of every closed 
subset is closed). 


Example 34.4.4. If X is compact, then every continuous map f: X — Y is proper 
because f is closed and if K C Y is compact, then K is closed, so f~'(K) c X is closed 
hence compact, since X is compact. 


Lemma 34.4.5. Suppose that Y is locally compact and Hausdorff, and let f: X — Y 
be continuous and quasi-proper. Then X is locally compact, and f is proper. 


Proof. For the first statement, cover Y with open relatively compact sets U; C K;; then 
V; = f-'(U;) is an open cover of X by relatively compact sets. 

Next, we claim that f is in fact already proper; that is to say, we show that f is 
closed. Let W C X be a closed set and consider a sequence {y,}, from f(W) with 
Yn — y. Let K be a compact neighborhood of y containing {yn}; taking a subsequence, 
we may suppose all y, € K. Let x, € f~!(y,) N W be primages. Since f is quasi- 
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proper, we have f~!(K) is compact. Suppose for a moment that f~!(K) is sequentially 
compact (for example, if X is second countable or metrizable). Then again taking 
a subsequence, we may suppose that x, — x with x € W since W is closed. By 
continuity, f(x») > f(x) = y, so f(W) is closed. To avoid the extra hypothesis that 
f—'(K) is sequentially compact, replace the sequence {yn} with a net; the argument 
proceeds identically. o 


Remark 34.4.6. There is an alternate characterization of proper maps as follows: a 
continuous map f: X — Y is proper if and only if the map f xid: X x Z > Y x Z is 
closed for every topological space Z. See 34.5.4 for more discussion. 


Partly motivated by Lemma 34.4. 1(iv), we make the following definition. 


Definition 34.4.7. The action of G on X is proper (G acts properly on X) if the action 
map 
A:GXX ~> Xxx 
(34.4.8) 
(8, x) > (x, gx) 
is proper. 
Proposition 34.4.9. If G is compact, then every (continuous) action of G on (a Haus- 
dorff space) X is proper. 


Proof. Let K C X x X be compact; then K is closed (because X is Hausdorff). Let Kı 
be the projection of K onto the first factor. Then K; is compact, and A~!(K) is a closed 
subset of the compact set G x Kj, so it is compact. This shows that the action map is 
quasi-proper. Finally, the action map is closed. We factor the map as 


GxxXxX ~>GxxxxX ~XxXx 
(g, x) = (g,x, gx) (x, gx); 


the first map is the graph of a continuous map to a Hausdorff space and is closed 
(Exercise 34.9); the second (projection) map is closed, as G is compact (by a standard 
application of the tube lemma). Therefore the composition of these maps is closed. 

o 


Example 34.4.10. If G is a finite discrete group, then G acts properly by Proposition 
34.4.9. 


Proper actions have many of the properties we need. 
Proposition 34.4.11. Let G act properly on X. Then the following are true. 


(a) G\X is Hausdorff: 
(b) The orbit Gx C X is closed for all x € X. 
(c) The natural map 


tx : G/Stabg(x) > Gx 
grexeXx 


is a homeomorphism. 
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(d) The group StabG(x) is compact for all x € X. 


Proof. For part (a), by Lemma 34.4.1, it is enough to note that by definition the image 
of the action map 4 in (34.4.8) is closed. Part (b) follows in the same way, as 


Gx = {x} x Gx = A(G x {x}. 


This also implies part (c) (cf. 34.1.12): the map tx is bijective and continuous, and 
it is also closed (whence a homeomorphism) since tx is a factor of the closed map 
G > Gx. 

Finally, for part (d), let 4 : G x X — X x X be the action map and let x € X. Then 
by definition that AT} (x, x) = Stabg (x) x {x} = Stabg(x), so by definition Stabg(x) is 
compact. o 


34.5 Proper actions on a locally compact space 


When X is locally compact, our central case of interest, there are several equivalent 
characterizations of a proper discrete action G U X. For more about proper group 
actions and covering spaces, see Lee [Lee2011, Chapter 12]. 

Recall our running assumption that X and G are Hausdorff. 


Theorem 34.5.1. Suppose that X is locally compact and let G act (continuously) on 
X. Then the following are equivalent: 


G) G is discrete and acts properly on X; 
(ii) For all compact subsets K C X, we have K N gK + 0 for only finitely many 
gEG; 
(iii) For all compact subsets K,L © X, we have K N gL + 0 for only finitely many 
g € G; and 
(iv) For all x,y € X, there exist open neighborhoods U > x and V > y such that 
U N gV + 0 for only finitely many g € G. 


Moreover, if X is a locally compact metric space and G acts by isometries, then these 
are further equivalent to: 


(v) The action of G on X is wandering; and 
(vi) For all x € X, the orbit Gx C X is discrete and # Stabg(x) < œ. 


Proof. First, we show (i) > (ii). Leta: Gx X — X xX be the action map. Let K € X 
be compact. Then 


AK x K) ={(g,x)€ GX X: x € K, gx € K} 


is compact by definition. The projection of A~'(K x K) onto G is compact, and since 
G is discrete, this projection is finite and includes all g € G such that K N gK + 0. 

Next we show (ii) © (iii): The implication (ii) & (iii) is immediate, and conversely 
we apply (ii) to the compact set K U L to conclude 


KNglLC(KUL)Ng(KUL) #90 
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for only finitely many g € G. 

Next we show (ii) = (iv). For all x € X, since X is locally compact there is a 
compact neighborhood K 2 U > x, with U open and K compact. If U N gU + 0 then 
K N gK + 0 and this happens for only finitely many g € G. 

Finally, we show (iv) > (i). We first show that the action map is quasi-proper, and 
conclude that it is proper by Lemma 34.4.5. Let K C X x X be compact. By (iv), for 
all (x, y) € K, there exist open neighborhoods U > x and V > y such that the set 


W={geG: gUNV #90} 


is finite. The set U x V > (x, y) is an open neighborhood of (x, y) € K, and so the 
collection of these neighborhoods ranging over (x, y) € K is an open cover of K, so 
finitely many U; x Vi > (x;, yi) suffice, and with corresponding sets #W; < ov. Let 
W =U; Wi € G. Let Kk; C X be the projection of K onto the first coordinate. We claim 
that A-'(K) C Wx K\. Indeed, if A(g, x) = (x, gx) € K then x € K; and (x, gx) € U; XV; 
for some i, so gx € gU; N V; and g € W;, and thus (g,x) € W x Ky. Since #W < oo 
and K, is compact, W x Kı is compact; and then since K is compact, K is closed so 
A1(K) c W x K; is also closed, hence compact. 

To conclude that G is discrete, we argue as follows. For all x € X, the orbit Gx € X 
is discrete: taking U = V and a neighborhood U > x with UN gU + 0 for only finitely 
many g € G, we see that U N Gx is finite so Gx is discrete (as X is Hausdorff). By 
Proposition 34.4.11(d), the map 


G/ Stabg(x) > Gx 


is a homeomorphism for all x € X. Therefore, Stabg (x) (the preimage of x) is an open, 
finite (Hausdorff) neighborhood of 1; but then Stabg(x) is discrete, and transporting 
we conclude that the topological group G has an open cover by discrete sets, and thus 
G is discrete. This completes the equivalence (i)—(iv). 

The implication (iv) = (v) holds in all cases: taking x = y, the neighborhood UNV 
is as required in the definition of a wandering action. The implication (v) > (vi) also 
holds in all cases from 34.3.5 and Lemma 34.3.6. 

To conclude, we show (vi) > (ii) under the extra hypothesis that X is a metric 
space with G acting by isometries. Assume for purposes of contradiction that there 
exist infinitely many gn € G such that K N g,K + 0, and accordingly let xn € K with 
8nXn € K. The points x, accumulate in K, so we may suppose x, — x € K; by taking 
a further subsequence, we may suppose also that g,x, —> y € K. We then claim that 
the set {g,x}, accumulates near y. Since #Stabg(x) < oo, we may suppose without 
loss of generality that the points g„x are all distinct. Then, given e€ > 0, 


E€ E€ 
Pl8EnxX, Y) < Pl8EnX, 8nXn) + Pl8nXn, Y) = P(x, Xn) + Pl8&nXn, Y) < a 45-8 


for n sufficiently large, so gnx —> y. Let An = Soir &n € G. By the Cauchy criterion, 
pP(hnx, x) = d(8gnx, n+1X) <E 


for n sufficiently large. Since h,x # x for all n, this contradicts that the orbit Gx is 
discrete, having no limit points. m 
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Remark 34.5.2. The hypothesis “X is a metric space with G acting by isometries” 
providing the equivalent condition Theorem 34.5.1(v) is necessary: see Exercise 34.11. 


34.5.3. From Lemma 34.3.6 and the implication Theorem 34.5.1(v) => (i), we see that 
proper actions generalize covering space actions when X is locally compact metric 
space and G acts by isometries. In fact, a more general statement is true: if G is a 
discrete group with a covering space action on X such that G\X is Hausdorff, then G 
acts properly on X. The (slightly involved) proof in general is requested in Exercise 
34.16. 


Remark 34.5.4. Bourbaki discusses proper maps [Bou60, Chapter I, §10] and more 
generally groups acting properly on topological spaces [Bou60, Chapter II, §§1,4]; 
the definition of proper is equivalent to ours as follows. Let f : X — Y be continuous, 
and say f is Bourbaki proper to mean that f x id: X x Z — Y x Z is closed for 
every topological space Z. If f is Bourbaki proper, then f is proper [Bou60, Chapter 
I, $10, Proposition 6]. In the other direction, if f is proper then f is closed and f~!(y) 
is compact for all y € Y, and this implies that f is Bourbaki proper [Bou60, Chapter I, 
§10, Theorem 1]. 


34.6 Symmetric space model 


In this section, before proceeding with our treatment of discrete group actions in our 
case of interest, we pause to give a very important way to think about hyperbolic space 
in terms of symmetric spaces. The magical formulas in hyperbolic geometry beg for 
a more conceptual explanation: what is their provenance? Although it is important for 
geometric intuition to begin with a concrete model of hyperbolic space and ask about 
its isometries directly, from this point of view it is more natural to instead start with 
the desired group and have it act on itself in a natural way. 


34.6.1. Let G = SL2(R). As a matrix group, G comes with a natural metric. The space 
M2(R) = R* has the usual structure of a metric space, with 


lgl? =a? +b? +e +d, ifg= f 4 € MD(R). 


We give SLo(R) c M2(R) the subspace metric and PSL2(R) the quotient metric. 
Intuitively, in this metric g, h € PSL2(R) are close if there exist matrices representing 
g, h (corresponding to a choice of sign) with all four entries of the matrix close in R. 


34.6.2. Recall from 34.1.12 that if G acts (continuously and) transitively on X, then 
for all x € X, the natural map g + gx gives a continuous bijection 
G/Stabg(x) > Gx = X. 


Let X = H? be the hyperbolic plane and let G = SL2(R). Then G acts transitively on 
X. The stabilizer of x = i is the subgroup K = Stabg(x) = SO(2) < SL2(R), so there 
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is a continuous bijection 


G/K = SL2(R)/ SO(2) > H? = X 


: (34.6.3) 
gK |> gi. 
From the Iwasawa decomposition (Proposition 33.4.2), it follows that 
SL2(R)/ SO(2) = NA. (34.6.4) 


In fact, the map (34.6.3) is a homeomorphism. To prove this, we observe the 
following beautiful equation: for g € SL2(R), 


llel? = 2 cosh p(i, gi). (34.6.5) 


This formula follows directly from the formula (33.5.3) for distance; the calculation is 
requested in Exercise 34.17. It follows that the map G — X is open, and thus (34.6.3) 
is a homeomorphism. In fact, by (34.6.5), if we reparametrize the metric on either 
SL2(R)/ SO(2) or H? by the appropriate factor involving the hyperbolic cosine, the 
map (34.6.3) becomes an isometry. 


We conclude this section with a view to a more general setting where the above 
situation applies. 


34.6.6. Let G be a connected, Hausdorff, locally compact topological group. We recall 
(section 29.3) that G has a Haar measure, a Borel measure u that is left-translation 
invariant (so u(gA) = u(A) for all Borel sets A C G and g € G). The Haar measure is 
unique up to scaling, with the Haar measure on G = R” the usual Lebesgue measure. 

G has a maximal compact subgroup K < G, unique up to conjugation in G, 
and the quotient X = G/K is homeomorphic to Euclidean space—in particular, X is 
contractible. 

A lattice I < G is a discrete subgroup such that u(T\G) < oo. A lattice I acts 
properly on X by left multiplication. 


Remark 34.6.7. More generally, a (globally) symmetric space is a space of the form 
G/K where G is a Lie group and K < Ga maximal compact subgroup. Alternatively, 
it can be defined as a space where every point has a neighborhood where there is an 
isometry of order 2 fixing the point. For more reading on the theory of symmetric 
spaces, and the connection to differential geometry and Lie groups, see the book by 
Helgason [Hel2001]. 


34.7 Fuchsian groups 


We now specialize to our case of interest and consider the group PSL2(R) acting by 
isometries on the geodesic space H?. A gentle introduction to the geometry of discrete 
groups is provided by Beardon [Bea95], with a particular emphasis on Fuchsian groups 
and their fundamental domains—in the notes at the end of each chapter are further 
bibliographic pointers. See also Jones—Singerman [JS87]. 


34.7. FUCHSIAN GROUPS 641 


Lemma 34.7.1. Let T < SL2(R). Then the following are equivalent: 


(i) T is discrete; 
Gi) If Yn ET and yn > 1, then yn = 1 for almost all n; and 
(iii) For all M € Ryo, the set {y ET : |lyll < M} is finite. 


Proof. The equivalence (i) © (ii) is requested in Exercise 34.15. The implication (i) 
© (iii) follows from the fact that the ball of radius M in SL2(R) is a compact subset of 
M2(R), and a subset of a compact set is finite if and only if it is discrete. Slightly more 
elaborately, a sequence of matrices with bounded norm has a subsequence where the 
entries all converge; since the determinant is continuous, the limit exists in SL2(R) so 
T is not discrete. Oo 


In particular, we find from Lemma 34.7.1 that a discrete subgroup of SL2(R) is 
countable. 


Proposition 34.7.2. Let T < PSL2(R) be a subgroup (with the subspace topology). 
Then V has a wandering action on H? if and only T is discrete. 


Proof. The implication > is a consequence of Theorem 34.5.1(v) = (i). Conversely, 
suppose that I is discrete; we show that Theorem 34.5.1(vi) holds: that for all x € X, 
the orbit Gx C X is discrete and #Stabg(x) < œ. 

First we show that the stabilizer of a point is finite. We may work in the unit disc D? 
and take the point to be w = 0 € D?, as in 33.7.8. The stabilizer of w = 0 in SUC, 1) 
is SO(2) ~ R/(2z)2Z, so its stabilizer in T is a discrete subgroup of the compact group 
SO(2) and is necessarily finite (indeed, cyclic). 

Next we show that orbits of I on H? are discrete. We apply the identity (34.6.5). This 
identity with Lemma 34.7.1 shows that the orbit Ti is discrete. But for all z € H?, there 
exists @ € PSL2(R) such that ¢(i) = z, and conjugation by ¢ induces an isomorphism 
T > IT ¢ of topological groups. Since 


plz, 82) = AÒ, gO(i)) = pli, (PH) 


applying the above argument to ¢~'T'¢ shows that the orbit Iz is discrete. This con- 
cludes the proof. 

Alternatively, here is a self-contained proof that avoids the slightly more involved 
topological machinery. We again work in the unit disc D°. First we prove (€). Since T 
is discrete, there is an €-neighborhood U > 1 with U C PSU(1, 1) such that UAT = {1}; 


therefore, if 
a b 
v= (5 Jerv 


then |b| > € or (without loss of generality) |a — 1| > e. We claim that in either case 


b 
ly(0)| = 


-| >e, 
a 
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and thus the orbit is discrete. Indeed, if |b| > e€, then since |a| < 1 anyway immediately 
|b/a| > & if |a — 1| > e then |a| < 1 — € so la? < 1 — e? and 1/ļla? > 1+’, giving 


b 


a 


2 1-]af 


aj >(l+e&)-1l=e’. 


For (=), suppose that T is not discrete; then there is a sequence y,, € I'\{1} of elements 
such that yn — 1. Therefore, for all z € H?, we have yaz — z and y,z = z for only 
finitely many n, so every neighborhood of z contains infinitely many distinct points 
Yanz: o 


With this characterization, we make the following important definition. 
Definition 34.7.3. A Fuchsian group is a discrete subgroup of PSL2(R). 


A Fuchsian group T acts by orientation-preserving isometries on H?; this action is 
proper and wandering by Theorem 34.5.1. 


34.7.4. A Fuchsian group I < PSL2(R) is elementary if there is a nonempty T- 
invariant set in H?Ubd H? that contains at most two points. Equivalently, an elementary 
group is a cyclic subgroup or a (possibly) dihedral group—in particular, an elementary 
group is virtually abelian (has a finite index, abelian subgroup). The elementary groups 
are easy to analyze, but their inclusion into theorems about more general Fuchsian 
groups can cause problems; and so in general we are only interested in non-elementary 
groups. 

Non-elementary Fuchsian groups I are categorized by the set of limit points 
LT) c bd H? of Tz with z € H?. If L(T) = bd H?, then T is said to be a Fuchsian 
group of the first kind; otherwise T is of the second kind, and L(T) is a nowhere- 
dense perfect subset of bd H?, topologically a Cantor set. We will see later that if T 
has quotient with finite hyperbolic area, then T is finitely generated of the first kind. 


34.8 Riemann uniformization and orbifolds 


Our understanding of group actions has an important consequence for the classification 
of Riemann surfaces, and we pause (again) to provide this application. 
First, we have the important structural result. 


Theorem 34.8.1 (Riemann uniformization theorem). Every (connected and) simply 
connected Riemann surface H is isomorphic to either the Riemann sphere P'(C), the 
complex plane C, or the hyperbolic plane H?. 


A consequence of Riemann uniformization is as follows. 


34.8.2 The universal cover X of a compact Riemann surface X is simply connected, so 
by the theory of covering spaces, X is a quotient X ~ T \X where T is the fundamental 
group of X, a subgroup of isometries of X acting by a covering space action. 

When X = P! (C), the only possible group I (acting freely) is trivial. When X=C, 
by classification one sees that the only Riemann surfaces of the form X = C/T are the 
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plane X = C, the punctured plane C* ~ C/{u) with u € C*, and complex tori C/A 
where A C C is a lattice with A = Z?. We will embark on a classification of these tori 
up to isomorphism by their j-invariants in section 40.1. 

All other Riemann surfaces are hyperbolic with X =H’, and so are of the form 
X =T\H? with < PSL,(R) a torsion-free Fuchsian group. 


Remark 34.8.3. Klein and Poincaré conjectured the uniformization theorem for alge- 
braic curves over C, with rigorous proofs were given by Poincaré: see Gray [Gray94]. 


Finally, before departing our topological treatment, we consider quotients of man- 
ifolds by the (continuous) action of a group. As we have seen, it is quite restrictive 
to suppose that this group action is free: we will still want to take quotients by such 
groups. Accordingly, we need to model not spaces that are locally modelled by R” but 
those that are locally modelled by the quotient of R” by a finite group. 


Definition 34.8.4 An n-orbifold X is a (second-countable) Hausdorff topological 
space that is locally homeomorphic to a quotient G\R” with G a finite group act- 
ing (continuously). An atlas for an orbifold X is the data 


(i) An open cover {U;};<7 of charts U; closed under finite intersection; and 
(ii) For each i € J, an open subset V; C R” equipped with the (continuous) action of 
a finite group G; OV;, and a homeomorphism 


$3: Ui > Gi\V; 


phism fj; : G; — G; and a G;-equivariant map Wi; : V; > V; satisfying 6; ° Yij = ¢; 
(see Figure 34.8.5). 


satisfying the atlas axiom: for all U; C U;, there exists an injective group homomor- 


Figure 34.8.5: An orbifold, by its atlas 
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Orbifolds were introduced by Thurston [Thu97, Chapter 13], who adds a wealth 
of motivation and examples; see also the surveys by Scott [Sco83, §2] and Gordon 
[Gor2012] as well as the chapter by Ratcliffe [Rat2006, Chapter 13]. 


34.8.6 We can further ask that the transition maps fij in an atlas be smooth to get 
a smooth orbifold, preserve a G;-Riemannian metric to get a Riemann orbifold, 
etc.; replacing R” by C” and smooth by holomorphic, we similarly define a com- 
plex n-orbifold, locally modelled on the quotient G\C” with G a finite group acting 
holomorphically. 


Definition 34.8.7 Let X be an n-orbifold. 


(a) A point z € X such that there exists a chart U; > z with group G; # {1} fixing z 
is called an orbifold point of X, with stabilizer group (or isotropy group) G;; 
the set of orbifold points of X is called the orbifold set of X. 

(b) If z € U; is an isolated orbifold point and its stabilizer group is cyclic, we call z 
a cone point. 


34.8.8 The prototypical example of an orbifold is the quotient of C by a finite group of 
rotations. Such a group is necessarily cyclic (as a finite subgroup of C*) of some order 
m > 2; the quotient is a cone, a fundamental set for the action being a segment with 
angle 27/m, and the fixed point is a cone point of order m. The cone is homeomorphic 
to R? but it is not isometric: away from the cone point, this space is locally isometric 
to R?, but at the cone point the angle is less than 27, so shortest paths that do not start 
or end at the cone point never go through the cone point. 


34.8.9 Let X be a manifold and let G be a finite group acting (continuously) on X 
such that action of G is wandering (Definition 34.3.3). We define an orbifold [X/G] as 
follows: by Lemma 34.3.6 and 34.3.9, we can refine an atlas of X to one consisting of 
open neighborhoods U; on which GOU; acts, and we make this into an orbifold atlas 
by taking G; = G for each i; the atlas axiom is tautologically satisfied. 

When X is smooth, complex, Riemann, etc., we ask that G act diffeomorphically, 
holomorphically, etc., to obtain an orbifold with the same properties. 


A full, suitable definition of the category of orbifolds—in particular, morphisms 
between them—is more subtle than it may seem. In this text, we will be primarily 
interested in an accessible and well-behaved class of orbifolds obtained as the quotient 
of a manifold. 


Definition 34.8.10 An orbifold is good if is of the form [X/G], i.e., it arises as the 
quotient of a manifold by a finite group. 


34.8.11 The quotient [X/G] of a Riemannian manifold X by a discrete group G of 
isometries acting properly is a good Riemann orbifold. 


Example 34.8.12 A complex 1-orbifold is good if and only if it has a branched cover 
by a Riemann surface. By Exercise 34.18, the only complex 1-orbifolds that are not 
good are the teardrop, a sphere with one cone point, and the football, a sphere with 
two cone points of different orders. 
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34.8.13 Good (topological) compact, oriented 2-orbifolds admit a classification (extend- 
ing the usual classification of surfaces by genus) up to homeomorphism by their sig- 
nature (g;e),...,ex), where g is the genus of the underlying topological surface and 
the e;,..., ex are the orders of the (necessarily cyclic) nontrivial stabilizer groups. 


34.8.14 Putting these two pieces together, now let I < PSL2(R) be a Fuchsian group. 
Then the discrete group TOH? acts properly. Then T\H? has the structure of a good 
complex 1-orbifold, by the main theorem (Theorem 34.2.1 (ii)). 


Remark 34.8.15. From certain topological points of view, especially with an eye 
towards generalizations, an orbifold is best understood as a topological groupoid (a 
point of view first noticed by Haefliger [Hae84, §4]): the objects of the category are 
elements of the disjoint union of the charts U;, and a morphism from u; € U; tou; € U; 
is the germ of a local homeomorphism that commutes with the projections. For more 
on the categorical perspective of orbifolds as groupoids, see Moerdijk [Moe2002] and 
Moerdijk—Pronk [MP1997]. 


Exercises 


Unless otherwise specified, let GOX be an action of a group G ona set X. 


1. Show that the quotient map 7: X — G\X is defined by a universal property: if 
f: X — Z is a G-equivariant map where Z has a trivial G-action, then f factors 
uniquely through 7, i.e., there exists a unique map h: G\X — Z making the 


diagram 
X now Z 
A 
i Z 
z 
G\X 
commute. 


> 2. Prove Lemma 34.1.10, in the following form. Let G be a topological group acting 
on a topological space X. 


(a) Show that if the action is continuous, then for all g € G the map X —> X 
by x > gx is continuous (therefore, a homeomorphism). 
(b) Show the converse of (a) if G is discrete. 


3. Let G be a topological group acting continuously on a topological space X. 
Show that the orbits of G are closed (Gx C X is closed for all x € X) if and only 
if G\X is T; (see Exercise 12.2). 

4. Let X be a metric space. Then Isom(X) has naturally the topology of pointwise 
convergence, as follows. There is an embedding 


Isom(X) => XŠ = I] X 
xEX 


8 +> (8(X))xex- 
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The product X* has the product topology, and so Isom(X) (and every space of 
maps from X to X) has an induced subspace topology. A basis of open sets for 
Isom(X) in this topology consists of finite intersections of open balls 


V(g; x, €) = {h € Isom(X) : p(g(x), h(x) < €). 
Equip the group G = Isom(X) with the topology of pointwise convergence. 


(a) Show that G is a topological group. 
(b) Show that G acts continuously on X. 


. Let G = Z be given the discrete topology, and let GUX = R/Z act by 


xh x+naeéR/Z forn € Z for a € R -— Q. Show that this action is free 
and continuous, and show that for all x € X the map (34.1.13) 


G/Stabg(x) = G —> Gx 
§& > gx 
is (continuous and bijective but) not a homeomorphism, giving Gx C X the 


subspace topology. 


Let G act (continuously and) transitively on X. Suppose that G, X are (Hausdorff 
and) locally compact, and suppose further that G has a countable base of open 
sets. Let x € X and let K = Stabg(x). Show that the natural map G/K —> X isa 
homeomorphism. 


. Let GUX be a free and wandering action, and let U be an open set such that 


gU OU = 0 for all g + 1. Show that the map Gx U > na '(x(U)) is a 
homeomorphism and the restriction 7: G x U — n(U) = U is a (split) covering 
map. 


. Let X be (Hausdorff and) locally compact, let x € X, and let U 5 x be an open 


neighborhood. Show that there exists an open neighborhood V > x such that 
K =cl(V) € U is compact. 


Let X,Y be (Hausdorff) topological spaces, let f: X — Y be a continuous map, 
and let 


er(f):X ~XxY 
xt (x, f(x) 


be the graph of f. Show that f is a closed map. 


One way to weaken the running hypothesis that X is Hausdorff in this chapter 
is to instead assume only that X is locally Hausdorff: every x € X has an open 
neighborhood U > x such that U is Hausdorff. 

Show that a weakened version of Lemma 34.3.6(i) > (ii) is not true with only 
the hypothesis that X is locally Hausdorff: that is, exhibit a locally Hausdorff 
topological space X with a (continuous) wandering action of a group G such 
that 7: X — G\X is not a local homeomorphism, and so Lemma 34.3.6(ii) does 
not hold. [Hint: Let X be the bug-eyed line and G = Z/2Z acting by x >œ -x 
on R* and swapping points in the doubled origin. ] 
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11. 


12. 


13. 


14. 


> 15. 


16. 


> 17. 


18. 


Let G = Z and let G =œ X = R?\{(0,0)} act by n= (x,y) = (2”x, y/2”). 
In other words, G is the group of continuous maps X — X generated by 
(x, y) > (2x, y/2). 


(a) Show that the action of G on X is free and wandering. 

(b) Show that the quotient G\X is not Hausdorff. 

(c) Let K = {(t, 1 — t) : t € [0,1]}. Then K is compact. Show that KN gK #0 
for infinitely many g € G. [So Theorem 34.5.1(v) holds but (ii) does not, 
and in particular that the action of G is not proper. Can you see this directly 
from the definition of proper?] 

Let X be a Hausdorff topological space with a continuous action of a Hausdorff 
topological group G. Suppose that the action of G is wandering. Show that for all 
x € X, there is an open neighborhood U > x such that the finite group Stabg(x) 
acts on U (i.e., gU C U for all g € StabG(x)). 
Show that a subgroup I < R” is discrete if and only if T = Zv +-+- + Zvm 
with vj,..., Vm € T linearly independent over R. As a consequence, show that 
T < R” is a lattice if and only if T is discrete with m = n. 
Exhibit an injective group homomorphism SO(n) —> SO(n + 1) and a homeo- 
morphism 

S” = SO(n + 1)/ SO(n), 


where S” = {x € R”+! : [|x|]? = 1} is the n-dimensional sphere, analogous to 
(34.6.3). 


Let G be a topological group with a countable system of fundamental open 
neighborhoods of 1 € G (for example, this holds if G is metrizable). Show that 
G is discrete if and only if whenever {g,}, is a sequence from G with g, > 1, 
then g,, = 1 for all but finitely many n. 


Let G be a discrete group with a (continuous) covering space action on a Haus- 
dorff space X such that G\X is Hausdorff. Show that G acts quasi-properly 
on X. 


Show that for g € SL2(R), 
Iligli? = 2 cosh pG, gi) 
(cf. 34.6.1). [Hint: Use the formula (33.5.2).] 


(a) Show that a compact, complex 1-orbifold is good if and only if it has a 


branched cover by a compact Riemann surface. 
(b) Use the Riemann—Hurwitz theorem to show that the only compact, complex 


1-orbifolds that are not good are the teardrop (a sphere with one cone point) 


and the football (a sphere with two cone points of different orders). 
(c) Show that every finitely generated discrete group of isometries of a simply 


connected Riemann surface with compact quotient has a torsion free sub- 
group of finite index. [Hint: find a torsion free subgroup of finite index by 
avoiding the finitely many conjugacy classes of torsion in T.] Use this to 
give another proof of (b). 
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19. Show that the stabilizer group of an orbifold point is well-defined up to group 
isomorphism, independent of the chart. 

20. The group SL2(R) acts on P!(R) = R U {ov} by linear fractional transformations. 
Show that G = SL2(Z) < SL2(R) is discrete, but G does not act properly on 
P! (R). [So discrete groups can act on locally compact spaces without necessarily 
acting properly. ] 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s 
Creative Commons license, unless indicated otherwise in a credit line to the material. If 
material is not included in the chapter’s Creative Commons license and your intended 
use is not permitted by statutory regulation or exceeds the permitted use, you will need 
to obtain permission directly from the copyright holder. 
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Chapter 35 


Classical modular group 


In this chapter, we introduce the classical modular group PSL2(Z) < PSL2(R), a 
discrete group acting on the upper half-plane that has received extensive study because 
of the role it plays throughout mathematics. We examine the group in detail via a 
fundamental domain and conclude with some applications to number theory. This 
chapter will serve as motivation and example for the generalizations sought later in 
this part of the text. 

There are very many references for the classical modular group, including Apos- 
tol [Apo90, Chapter 2], Diamond—Shurman [DS2005, Chapter 2], and Serre [Ser73, 
Chapter VIIJ. 


35.1 > The fundamental set 


Definition 35.1.1. The classical modular group is the subgroup of PSL2(R) defined 
by 


PSL2(Z) = h = k 4 : a,b,c,d € Z,ad — bc = 7 /{+1). 


The group PSL,(Z) acts faithfully on the upper half-plane H? by linear fractional 
transformations; equipping H? with the hyperbolic metric, this action is by orientation- 
preserving isometries. 

Since Z C R is discrete, so too is SL2(Z) © M2(Z) © M>(R) discrete and therefore 
PSL2(Z) < PSL2(R) is a Fuchsian group (Definition 34.7.3). 


35.1.2. Our first order of business is to try to understand the structure of the classical 
modular group in terms of this action. Let 


0 -l 1 1 
S:= l 0 i T= b i) € PSL2(Z). 
Then Sz = —1/z for z € H?, so S maps the unit circle {z € C : |z| = 1} to itself, fixing 
the point z = i; and Tz = z + 1 for z € H? acts by translation. We compute that S? = 1 
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(in PSL2(Z)) and 


so (ST)? = 1. 


35.1.3. In a moment, we will see that PSL2(Z) is generated by S and T, with a minimal 
set of relations given by S? = (ST)? = 1. To do so, we examine a fundamental set (cf. 
Definition 34.1.14) for the action of PSL2(Z) on H’, as follows. Let 


mt := {z € H? : |Rez| < 1/2 and |z| > 1}. 


The set x is a hyperbolic triangle with vertices at w = (—1 + V—3)/2 and -w° = 
(1 + V-3)/2 and oo. The translates of m by words in S,T tessellate the plane as in 
Figure 35.1.4. 


Figure 35.1.4: Tessellation of H? into fundamental domains for SL2(Z) 
By the Gauss—Bonnet formula 33.6.8 (or Exercise 35.1), 
m 
WM=a-2-=-. 35.1.5 
area(H{) = 7 353 ( ) 
The elements S, T act on the edges of this triangle as in Figure 35.1.6. 


Figure 35.1.6: Action of S, T on X 
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In the unit disc, the triangle X is as in Figure 35.1.7. 


: 
an 


Figure 35.1.7: x in D? 


The following three lemmas describe the relationship of the set x to T. 
Lemma 35.1.8. For all z € HŽ, there exists a word y € (S, T) such that yz € H. 


Proof. In fact, we can determine such a word algorithmically. First, we translate z so 
that |Re z| < 1/2. If |z| > 1, we are done; otherwise, if |z| < 1, then 


-1 Imz 
Im | —] = —,; > mz. (35.1.9) 
z Iz| 
We then repeat this process, obtaining a sequence of elements z = 2, Z2,... with 
Im z, < Imz2 < .... We claim that this process terminates after finitely many steps. 
Indeed, by (33.3.9) 
mz 


Im(gz) = for g = k 4 € PSL»(R), 


lez +d?’ 


and the number of c,d € Z such that |cz + d| < 1 is finite: the set Z + Zz C Cisa 
lattice, so there are only finitely many elements of bounded norm. (Alternatively, the 
orbit Iz is discrete by Theorem 34.5.1—or the direct argument given in Proposition 
34.7.2— therefore, its intersection with the compact set 


K = {z' € H? : |Re(z’)| < 1/2 and Imz < Imz’ < 1} 
is finite.) Upon termination, we have found a word y in S, T such that yz € X. o 


The procedure exhibited in the proof of Lemma 35.1.8 is called a reduction 
algorithm. 


Lemma 35.1.10. Let z,z’ € X, and suppose z € int(X) lies in the interior of 1. If 
z = yz with y ET, then y = l and z = z’. 


b 


4 e T. We have Imz’ = (Imz)/|cz + d. First 


Proof. Let z’ = yz with y = [ 
suppose that Im z’ > Im z; then 


jez + dÊ = (cRez+d) +c*(Imz) < 1. (35.1.11) 
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Since Imz > Imw = V3/2, from (35.1.11) we conclude that c? < 4/3 so |c| < 1. 
If c = 0 then ad — bc = ad = 1 soa = d = +1, and then z’ = yz = z + b, which 
immediately implies b = 0 so y = 1 as claimed. If instead |c| = 1, then the conditions 


(cRez +d}? <1-(Imz)? <1-3/4=1/4 and |Rez|< 1/2 


together imply d = 0; but then |cz + d| = |z| < 1, and since z € int() we have |z| > 1, 
a contradiction. 

If instead Imz’ < Imz, we interchange the roles of z, z’ and have strict inequality 
in (35.1.11); by the same argument and the weaker inequality |Re z| < 1/2, we then 
obtain |z| < 1, a contradiction. o 


Lemma 35.1.12. The elements S,T generate T = PSL2(Z). 


Proof. Let z = 2i € int(x). Let y € T, and let z’ = yz. By Lemma 35.1.8, there exists 
y’ a word in S,T such that y’z’ € mx. By Lemma 35.1.10, we have y’z’ = (y’y)z = z, 
so y'y = l and y = y’ € (S, T}. o 


Although we have worked in PSL2(Z) throughout, it follows from Lemma 35.1.12 
that the matrices S,T also generate SL2(Z), since S? = -1. See Exercise 35.3 for 
another proof of Lemma 35. 1.12. 


Corollary 35.1.13. The set = is a fundamental set for PSL2(Z) OH. 


Proof. The statement follows from Lemmas 35.1.8 and 35.1.10 (recalling the definition 
of fundamental set, Definition 34.1.14). o 


35.1.14. If z € x has Stabr(z) + {1}, then we claim that one of the following holds: 
(i) z = i, and Stabr (i) = (S) = Z/2Z; 
(ii) z = w, and Stabr(w) = (ST) = Z/3Z; or 


Gii) z = —w2, and Stabr(—w2) = (TS) = T Stabr(w)T-!. 


Indeed, let yz = z with y = i 4 and y + 1. Then cz? + (d — a)z — b = 0, so c #0 
and 
-d)+ VD 
= — where D = Tr(y)” — 4 € Zep. 
c 
Thus D = —4 or D = -3. In either case, since z € 4 we have Imz > v3/2, we 
must have c = +1, and replacing y « -y we may take c = 1. If D = —4, then 


Tr(y)=a+d = 0soz =a +i, anda = 0 = d and c = 1 = —b, i.e., z =i and we are in 
case (i). If the discriminant is —3, then a similar argument gives z = ((a + 1) + V=3)/2 
so a = 0, and we are in cases (ii) or (iii). 

Therefore, if y € PSL2(Z) has finite order, then y fixes a point, thus a conjugate 
of y would fix a point in X, and therefore by the above y is conjugate in PSL2(Z) to 
either S or ST. 
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Let Y = I'\H’. Gluing together the fundamental set, we obtain a homeomorphism 
Y = T\H? = P\(C)\{oo} = C. 


The orbit of the limit point œ under T is P!(Q) € bd H?; letting H” = H? u P! (Q), 
there is a homeomorphism 
X =T\H* = P!(C) 


as in Figure 35.1.15. 


Figure 35.1.15: Y as orbifold and as Riemann surface 


Away from the orbits Ti, œ% with nontrivial stabilizer, the complex structure on 
H? descends and gives the quotient Y\{Ti, Tw} the structure of a Riemann surface. By 
studying the moduli of lattices, later we will give an explicit holomorphic identification 
j:Y OC. 


35.1.16. By 34.8.11, the quotient Y has the structure of a good complex 1-orbifold, 
when we keep track of the two nontrivial stabilizers. 

Alternatively, we can also give X the structure of a compact Riemann surface as 
follows. Let zo € = U {oo}. If zo = œ, we take the chart z > e77"%. Otherwise, let 
e = #Stabr(zo) < œ, let w = (z — zo)/(z — Zo) be the local coordinate as in (33.7.3), 
and take the chart z > w° at Zo. 


Lemma 35.1.17. Every relation among S,T is obtained from S? = (STÐ = 1 after 
conjugation by T, so that T has the presentation 


T =(S,T | S? = (ST) = 1). 


Thus I = PSL(Z) is the free product of Z/2Z and Z/3Z. 


Proof. Consider a relation 6; --- 6, = 1 where 6; € {S,T, T-!} (recall S = Sv). We 
may suppose that ô; + Ong fori = 1,...,7r — 1 or else we can cancel adjacent terms. 
Define y; := 6,---6; fori = 1,...,r, so that y, = 1 and yj+1 = y;6;41 for all 
i=1,...,r—1.Let z = 2i and let 


Zi = Yizo = (61 +++ 6;)Z0. 
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We claim that z;,; and z; are in adjacent Į -translates of x: indeed, 
Vir NY E Yili H NH) +0 (35.1.18) 


is a side of both. Draw the geodesic (shortest path, across the corresponding side) 
between z;+1 and z; for each i. Because 6; + ôl there is no backtracking; taken 
together, they define a loop in the upper half-plane, since z+ = y;Zo = zo. 

We first conjugate the relation by T or T7! so 6, = S. If 6, = S as well, then 
we conjugate by S and begin again. So without loss of generality 6, = T,T~'; we 
explain the case 6, = T, the case 6, = T`! is similar. Then our relation looks like 
S62:+++6,-1T = 1, and so y, = 1 = yy-16- = yr-ıT implies z-~1 = ¥r-1Z0 = T! zo = 
2i -1. 

If zi = Yizo = zo for some 0 < i < r, then y; = 6;---6; = 1 and similarly 
Ôi+1 <+: ô, = 1, so we may argue separately with each such relation and so may suppose 
that z; # zo for alli = 1,...,r — 1. It follows that the loop intersects x only in the path 
from zo to zı and from z;-1 to zo, because any other intersection would necessarily 
have source or target Zo. 

We observe that w is in the interior of the loop, since by continuity any path from i 
to 2i — 1/2 in H? that does not intersect x must go from right to left through a highest 
value —1/2 + it with 0 < t < V3/2. 

The proof proceeds by induction on the number of points in the intersection of 
the interior of the loop with the set of vertices [w. We have shown in the previous 
paragraph that if there are no such points, then the relation is trivial. In the general 
case with relation S62 ---6,-;T = 1, expanding (STP = 1 we get T = (ST~')*S and 
substituting we obtain another relation Sô - - - ô- (ST7D?S = 1, as in Figure 35.1.19. 


Zp =2i— 1 Zo = 2 
+ 


zs » 
ôr = T 


Figure 35.1.19: Simplifying a relation 


Reading this relation as above, we see that the loop encloses one fewer point in Trw 
(it starts and ends with a backtracking step), and so the same is true for the conjugate 
relation 52 -+-5;-\(ST~!)* = 1. Cancelling any new adjacent terms and conjugating 
the relation does not change the number of enclosed interior vertices, so the result 
holds by induction. 
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(Alternatively, for a proof in the style of Lemma 35.1.10, see Exercise 35.5.) O 


Remark 35.1.20. Alperin [Alp93] uses the action on the irrational numbers to show 
directly that PSL2(Z) is the free product of Z/2Z and Z/3Z (but note the typo B(z) = 
1 — 1/z on the first page). 


35.2 Binary quadratic forms 


We pause to give an application to quadratic forms and class groups, after Gauss. An 
integral binary quadratic form, abbreviated in this section to simply form, is an 
expression 

Q(x, y) = ax? + bxy + cy” € Z[x, yl. 


We define the discriminant of a form Q to be 
disc(Q) := b? — 4ac. 


A form Q is primitive if gcd(a, b,c) = 1, and Q is positive definite if Q(x, y) > 0 
for all nonzero (x, y) € R*; after completing the square, we see that a form is positive 
definite if and only if a > 0 and disc(Q) < 0. 

For d < 0, let 


Qa := {O(x, y) = ax? + bxy + cy” : a > 0, disc(Q) = d} 


be the set of primitive, positive definite forms of discriminant d. The group I acts on 
Qua the right by change of variable: for y € T, we define (Q”)(x, y) = Q(y(x, y)*), so 


that if y = (; s) then 


(Q”)(x, y) = O(rx + sy, tx + uy). 


We verify that disc(Q”) = disc(Q) = d for y € T. We say that Q, Q’ are (I'-)equivalent 
if Q’ = Q” for some y € T = SL2(Z). 

We claim that the number of equivalence classes h(d) := #Qq/T is finite. Indeed, 
to every Q € Qu, we associate the unique root 


= dli 
o = DENA oh? 
2a 


of Q(z, 1) = 0. Then zor = y7! (z) for y € I. Therefore, by the reduction theory of 
the previous section, we can replace Q up to equivalence by a form such that zg € X. 
If we further insist that Rez < 1/2 and Rez < O if |z| = 1, then this representative is 
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unique, as in Figure 35.2.1. 


Figure 35.2.1: A choice of unique representative in X 


Thus i ij 
==- < R = —=—— 
T 


NI = 


so —a < b < a, or equivalently, 
lb] <a and (b > Q if |b| =a); 
and 
2 


=— = >21 
zol 4a? 


a 


soa < cand b > Oif equality holds. In sum, every positive definite form Q is equivalent 
to a (SL2(Z)-)reduced form satisfying 


lbb|<a<c withb>Oif|b| =aora=c. 


We now show that there are only finitely many reduced forms with given discrim- 
inant d < 0, i.e., that h(d) < œ. The inequalities |b| < a < c imply that 


\d| = 4ac — b? > 3a’, 


so a < y\d|/3 and |b| < a, so there are only finitely many possibilities for a,b; and 
then c = (b? — d)/(4a) is determined. This gives an efficient method to compute the 
set Qy/T efficiently. 

Let S = Z@Z[(d + Vd)/2] c K = Q(Vd) be the quadratic ring of discriminant 
d < 0. Let Pic(S) be the group of invertible fractional ideals of S modulo principal 
ideals. Then there is a bijection 


Qu/T © Pic(S) 


[ax? +bxy+ cy’] > [a] = e +z] 


(Exercise 35.9). In the same stroke, we have proven the finiteness of the class number 
# Pic(S) < œ. 
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35.3 Moduli of lattices 


In this section, we realize PSL2(Z)\H? as a moduli space of complex lattices. 


35.3.1. A (complex) lattice A C C is a subgroup A = Zz, + Zz2 with z1, z2 linearly 
independent over R; the elements z1, z2 are a basis for A. 


Two lattices A, A’ are homothetic if there exists u € C* such that A’ = uA, and 
we write A ~ A’. Let A = Zz, + Zz be a lattice. Then without loss of generality 
(interchanging z1, z2), we may suppose Im(z2/z;) > 0, and then we call zj, z2 an 
oriented basis. Then there is a homothety 


CELE 
Zi 


where T = z2/z1 € H?. 


Lemma 35.3.2. Let A = Z + Zt and A = Z + Zt’ be lattices with t,t’ € H*. Then 
A ~ A’ ifand only if Tt = Iv. 


Proof. Since t,t’ € HŽ, the bases 1,7 and 1,7’ are oriented. We have A = Z + 


Zt ~ Z+ Zr = AN if and only if there exists an invertible change of basis matrix 


g= É 4 € GL2(Z) and u € C* such that 


so u(at + b) = T’ and u(ct + d) = 1. Eliminating u gives equivalently 


at +b 

~ et+d' 
Therefore g € SL2(Z), and since g is well-defined as an element of IT = PSL2(Z), the 
result follows. o 


35.3.3. By Lemma 35.3.2, there is a bijection 


Y = T\H? > {Ac C lattice}/ ~ 


(35.3.4) 
Tr > [Z + Zr]; 


that is to say, Y = T\H? parametrizes complex lattices up to homothety. 


To a lattice A, we associate the complex torus C/A (of rank 1); two such tori C/A 
and C/A’ are isomorphic as Riemann surfaces if and only if A ~ A’. Therefore, the 
space Y also parametrizes complex tori. 


We return to this interpretation in section 40.1. 
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35.4 Congruence subgroups 


The finite-index subgroups of PSL2(Z) play a central role, and of particular importance 
are those subgroups defined by congruence conditions on the entries. 


Definition 35.4.1. Let N € Z>1. The full congruence subgroup I'(V) < PSL2(Z) of 
level N is 


T(N) := ker(PSL2(Z) > PSL2(Z/NZ)) 


= ty € PSL,(Z): y = k a =+ E i (mod wy} ; 


c 
To avoid confusion, from now on we will now write [(1) = PSL2(Z). 


35.4.2. By strong approximation for SL2(Z) (Theorem 28.2.6), the map SL2(Z) —> 
SL2(Z/NZ) is surjective for all N > 1, so there is an exact sequence 


1 > F(N) > Td) > PSL2(Z/NZ) > 1. 


Definition 35.4.3. A subgroup F < T(1) is a congruence subgroup if I > T(N) for 
some N > 1; if so, the minimal such N is called the level of T. 


Remark 35.4.4. Noncongruence subgroups (finite-index subgroups not containing 
T(N) for any N > 1) also play a role in the structure of the group SL2(Z): see 
the recent survey by Li-Long [LL2012] and the references therein. 


35.4.5. In addition to the congruence groups I (N) themselves, we will make use of 
two other important congruence subgroups for N > 1: 


To(N) := ly = i 4 € PSL»(Z) : c = 0 (mod wh 


* 


= ly € PSL(Z):y=+ h ') (mod wh (35.4.6) 


T(N) := ly € PSL(Z): y = + f i (mod m) 


Visibly, T(N) < Tı(N) < To(N). We accordingly write 
Yo(N) := To(N)\H? 


35.4.7 
Xo(N) := To(N)\H™* 


where H** := H? U P!(Q), and similarly Y, (N) and Y(N). 


In the remainder of this section, we consider as an extended example the case 
N = 2. We can equally well write 


r(2) = f. (‘ d € PSL2(Z) : b = c = 0 (mod 2| (35.4.8) 
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From 35.4.2, 
T(1)/TQ) = PSL2(Z/2Z) = GL2(F2) = $3 (35.4.9) 


the nonabelian group of order 6, so in particular [[(1) : r(2)] = 6. 
We can uncover the structure of the group I(2) in a manner similar to what we did 
for (1) in section 35.1—the details are requested in Exercise 35.10. The group I(2) 


is generated by 
1. 2 d 1 0 
o ah 1 


which act on H? by z > z + 2 and z+ z/(2z + 1), respectively, and a fundamental set 
X is given in Figure 35.4.10. 


Figure 35.4.10: A fundamental set for [(2) O H? 


(In fact, later we will see from more general structural results that I(2) is freely 
generated by these two elements, so it is isomorphic to the free group on two generators.) 

The action (2) © H? is free: by 35.1.14, if yz = z with z € H? and y € [(2) < 
T(1), then y is conjugate in (1) to either S, ST; but [(2) <T(1) is normal, so without 
loss of generality either $ = h al or ST = i 7 belongs to (2), a contradic- 
tion. 

Let Y(2) :=T (2)\H?. Then gluing together the fundamental set, there is a homeo- 
morphism 

Y(2) = P'(C)\{0, 1, œ}. 


The limit points of x in bd H? are the points —1,0,1,co and the points —1,1 are 
identified in the quotient (by translation). The orbit of these points under I(2) is 
P!(Q) € bd H?, so letting H** = H? U P! (Q), there is a homeomorphism 


X(2) :=TQ)\H* = P! (C). (35.4.11) 
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We have a natural holomorphic projection map 
X(2) = T(2)\H** > X(1) = T(1)\H”; (35.4.12) 
via (35.4.9), the group GL2(F2) acts on X(2) by automorphisms: 
GL2(F2) OX) > X(2) 
yT(2)z) = r(2)yz 


where y € T(1) is a lift, so the map (35.4.12) is obtained as the quotient by GL2(F2). 


Finally, the congruence conditions (35.4.6) imply that To(2) = T1 (2) has index 2 in 
T(2), with the quotient generated by T, and we obtain a fundamental set by identifying 
the two ideal triangles in x above. 


Exercises 


1. Prove that Y(1) = SL2(Z)\H* has area(Y(1)) = 2/3 by direct integration (veri- 
fying the Gauss—Bonnet formula). 


1 0 
i 1} 
by two parabolic elements (of infinite order), just as it is generated by elements 
of order two and three. ] 


2. Show that PSL2(Z) is generated by T and U = [So PSL2(Z) is generated 


3. Prove Lemma 35.1.12 using Lemma 28.3.3 (elementary matrices). 


4. In this exercise, we link the fact that PSL2(Z) is generated by S,T to a kind of 
continued fraction via the Euclidean algorithm. [So the reduction algorithm is a 
way to visualize the Euclidean algorithm.] Let a,b € Zs, witha > b. 


(a) Show that there exist unique q,r € Z such that a = qb -r and q > 2 and 
O<r<b. 
From (a), define inductively rọ = a, r, = b, and r;-1 = qiri — ri+1 With 
0 < ri+1 < r;; we then have r1 > r >- >rt > T1 = 0 for some t > 0. 
b) Show that gcd(a, b) = r;, and if gcd(a, b) = 1 then 


a 1 
z =e 


qt 
Such a continued fraction is called a negative-regular or Hirzebruch— 
Jung continued fraction. [The Hirzebruch—Jung continued fraction plays 
a role in the resolution of singularities [Jun08, Hir53].] 
c) Show (by induction) that 
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For all q € Z, write 


0 1 
B j) ESD EPs 


as a word in S,T, and interpret the action of this matrix in terms of the 
reduction algorithm to the fundamental set x for PSL2(Z). 


d) Let A = 2 g € SL2(Z). Show that gcd(a, c) = 1 and conclude from (c) 
that there exists W € (S, T} such that 
1 y 
wa-(} °) 


with b’ € Z. Conclude that (S,7) = PSL2(Z). (So how, in the end, does 
this procedure to write A in terms of S and T relate to the one given by the 
reduction algorithm in Lemma 35.1.8?) 


5. In this exercise, we give a “matrix proof” that a complete set of relations satisfied 
by S, T in PSL2(Z) are $? = (STP = 1. 
(a) Show that it suffices to show that no word S(ST)*! S(ST)@ ... S(ST)® with 
e; = 1,2 is equal to 1. 
(b) Observe that S(ST) = T and S(ST)? have at least one off-diagonal entry 
nonzero and can be represented with a matrix whose entries all have the 
same sign. 


(c) Show that if A = : has at least one off-diagonal entry nonzero and 


d 
all entries of the same sign, then these properties hold also for both S(ST)A 
and S(ST)*A. Conclude that (a) holds. 


[This argument is given by Fine [Fin89, Theorem 3.2.1].] 
6. Show that the commutator subgroup I” <I = PSL2(Z) (the subgroup generated 
by commutators yoy~!5~! for y, 6 € T) has index 6 and T/T” = Z/6Z. 
7. Compute the class number /(d) and the set of reduced (positive definite) binary 
quadratic forms of discriminant d = —71. 
8. Let Q4 be the set of primitive, positive definite binary quadratic forms of dis- 
criminant d < 0 and let Q = Ug Qu. 
(a) Show that the group GL2(Z) acts naturally on Q by change of variables, 
with PGL2(Z) acting faithfully. 
(b) Consider the action of PGL2(Z) on H?. Show that every Q € Qa is equiv- 
alent to a GL2(Z)-reduced form ax? + bxy + cy? satisfying 


O<b<a<c. 


[Hint: Find a nice fundamental set 1 for PGL2(Z).] 
(c) By transport, 35.1.14 computes the stabilizer of PSL2(Z) on Q € Qa. 
Compute StabpoL,(z)(Q) for Q € Qa. 
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>9. Let 
S = Z@Z[(d + Vd)/2] c K = Q(Vd) 


be the quadratic ring of discriminant d < 0. Let Pic(S) be the group of invertible 
fractional ideals of S modulo principal ideals. Show that the map 


Q,4/T > Pic(S) 


[ax? + bxy + cy] => [a] = 5 


pad 


is a bijection, where Q,/T is the set of (SL2(Z)-)equivalence classes of (primi- 
tive, positive definite) binary quadratic forms of discriminant d. 


oi 


generate I(2) using the fundamental set 


> 10. Show that the elements 


H = {z € H? : |Rez| < 1,|2z+1]> 1}. 


[Hint: adapt the method used in section 35.1.] 
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Chapter 36 


Hyperbolic space 


In this chapter, we extend the notions introduced for the hyperbolic plane to hyperbolic 
space in three dimensions; we follow essentially the same outline, and so our exposition 
is similarly brief. 


36.1 Hyperbolic space 


A general, encyclopedic reference for hyperbolic geometry is the book by Ratcliffe 
[Rat2006]. For further reference, see also Elstrodt-Grunewald—Mennicke [EGM98, 
Chapter 1], Iversen [Ive92, Chapter VIIJ, and Marden [Mard2007]. 


Definition 36.1.1. The upper half-space is the set 
H? := C x Ryo = k(x, y) = (41 + miy) € CXR: y > O}. 


Hyperbolic space is the set H? equipped with the metric induced by the hyperbolic 
length element 


_ Idx]? + dy? _ dx? + dx? + dy? 


ds? : 

y? y? 
36.1.2. The space H? is the unique three-dimensional (connected and) simply con- 
nected Riemannian manifold with constant sectional curvature —1. The volume ele- 


ment corresponding to the hyperbolic length element is accordingly 
dV c= dx, dx2 dy 
y3 


36.1.3. A vertical half-plane in hyperbolic space is a set of points with y arbitrary 
and the coordinate x confined to a line in C. The hyperbolic length element restricted 
to every vertical half-plane is (equivalent to) the hyperbolic length element on the 
hyperbolic plane. Therefore, H? contains many isometrically embedded copies of H?. 


36.1.4. The sphere at infinity is the set 
bd H? = P! (C) = C U {00} 
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(analogous to the circle at infinity for H”), with the image of C corresponding to the 
locus of points with t = 0. We then define the completed upper half-space to be 


H* := H? U bd HÈ. 


The topology on H** is defined by taking a fundamental system of neighborhoods of 
the point at oo to be sets of the form 


(x, y) € H? : y > M}U {oo} 


for M > 0, and the open balls tangent to z € C together with z. 


36.1.5. The metric space H? is complete, and the topology on H? is the same as the 
topology induced by the Euclidean metric. 

The geodesicsin H? are the Euclidean hemicircles orthogonal to C and vertical half- 
lines: every two points lie in a vertical hyperbolic plane (see 36.1.3), so this statement 
can be deduced from the case of the hyperbolic plane. (Alternatively, by applying an 
element of PSL2(C) it is enough to show that the vertical axis Z = {(0, y) : y > 0} 
is a geodesic, and arguing as in (33.5.4) we obtain the result.) Accordingly, H? is a 
uniquely geodesic space. 


36.1.6. Just as in distinct points determine a geodesic, so do three distinct points 
determine a geodesic plane, the union of all geodesics through the third point and 
a point on the geodesic between the other two (the choice taken arbitrarily). In a 
geodesic plane, the geodesic between two points in the plane is contained in the plane. 
By the preceding paragraph, the geodesic planes in H? are the Euclidean hemispheres 
orthogonal to C and the vertical half-planes, as in Figure 36.1.7. 


Figure 36.1.7: Geodesic lines and planes in H? 
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36.2 Isometries 


Analogous to the case of H?, with orientation-preserving isometries given by PSL2(R) 
acting by linear fractional transformations, in this section we identify the isometries of 
hyperbolic space H? as coming similarly from PSL2(C). 


36.2.1. The group PSL(C) acts on the sphere at infinity P'(C) by linear fractional 
transformations. We extend this action to H? (with almost the same definition!) as 
follows. We identify 


H? > H=C+Cj 
(Gy) Zaxtyj 
where we recall that jx = xj for x € C = R + Ri c H. We then define the action map 


SL2(C) x H? > H? 


4 (36.2.2) 
(g,z) = gz = (az + b)(cz + d) 
for g = k 4 € SL2(C). If z = x + yj, then in coordinates (Exercise 36.3) 
b d) + acy” + yj 
aoa ea ale E (36.2.3) 


licz + dll? 


where 
lez + d|? = nrd(cz + d) = |ex + d? + |c? y’. 


Therefore the image of this map lies in H?. (Compare this formula with the action of 
SL2(R) in (33.3.8).) 
Lemma 36.2.4. The map (36.2.2) defines a group action of SL3(C) on H°. 


Proof. We define the quaternionic projective line to be the set 
P'(H) := {(a, 8) : a, B + (0,0) € HY}/ ~ 


under the equivalence relation (a, 8) ~ (ay, By) for y € H*, and we denote by 
(a : B) € P'(A) the equivalence class of (a, 8). We verify that the group SL7(C) acts 
on P! (H) by 

ç 


‘ 4) (a: B) = (aa +BB: ca + dB) 


the left action of SL2(C) commutes with the right action of HI*. The restriction of this 
action to H? —> P!(H) by zh (z: Lis 


(2:1) (az +b: cz + d) = ((az + bX(cz +d)! : 1) 


as above. o 
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We now show that PSL,(C) acts on H? by isometries. This can be verified directly 
by the formula, with some effort; we prefer to verify this on a convenient set of 
generators, and so we are first led already to the following decomposition of SL2(C) 
(cf. Proposition 33.4.2). 


36.2.5. Let 


K := SU(2) = (5 2) EMO : lal +P = i ~ H! 


i a 
TERE 


We have K = Stabs, (œ) (j): from (36.2.3), we see that gj = (aj + b)(cj + dy! if 
and only if |c|? + |d|? = 1 and a€ + bd = 0; plugging the first equation into the second, 
and using ad — bc = 1 gives a = d and then b = —<. 

Letting z = x + yj, the other elements act as: 


c ia ja) ©) = 28 + yD 
i (36.2.6) 
o |0 =6+9+. 


Lemma 36.2.7 (Iwasawa decomposition). The multiplication map gives a homeomor- 
phism 

NxAxK — SL(©). 
Proof. We apply the same method as in the proof of Proposition 33.4.2. For surjectivity, 


we let z = g(j) = x + yj, let ng = 


dg = ki a € A, SO (dgNgg)(j) = j and dgngg E€ Stabs_,(~(j) = K. oO 


( ‘i € N so that (ngg)(j) = yj; then let 


Lemma 36.2.8. The group SL2(C) is generated by the subgroups A, N, and the element 


( al which acts on H? by 


0 -1 -1 T 3,5 ; 
z)=-z7 = —-(-x + 36.2.9 
f 3} eet) (36.2.9) 
where ||z||* = nrd(z) = |x|? + y? forz = x + yj. 
Proof. The proof is identical to the one in Lemma 33.4.4. o 


Remark 36.2.10. In fact, the generators k 


9 are redundant, but we will not use 
0 1/a 


this fact here. 
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We are now ready to investigate the consequences of this decomposition for the 
geometry of hyperbolic space. 


Theorem 36.2.11. The map (36.2.2) defines a faithful, transitive action of PSL2(C) on 
H? by isometries. 


Proof. We use the generators in Lemma 36.2.8. The fact that the action is faithful 
follows directly. For transitivity, we show that H? is the orbit of j.Ifz=x+yj¢H 
1 -x 
0 1 

to the case of the hyperbolic plane. 

Next, we show that PSL2(C) —> Isom*(H?). Verification that dg(s) = ds for ga 
generator in one of the first two cases of Lemma 36.2.8 is immediate, from the definition 
of the metric; the third case can be checked directly (Exercise 36.4). Orientation is 
preserved in each case. o 


then we first apply a translation to reduce to the case z = yj and then reduce 


36.2.12. The group PSL2(C) acts transitively on geodesics and consequently on pairs 
of points at a fixed distance: by the transitive action of PSL2(C) on H?, every point can 
be mapped to j; and applying an element of Stabs, (œ j = SU(2), every other point u 
can be brought to tj with t > 1, with logt = p(j, u) by the distance in the hyperbolic 
plane. It follows that 


k-z ajy t eGo 
2yy 2yy’ 

by verifying (36.2.13) in the special case where z = j and z’ = yj with y > 0, and then 

using the preceding transitive action and the fact that the right-hand side of (36.2.13) 


is invariant under the action of SL2(C), verified again using the generators in Lemma 
36.2.8. 


cosh p(z, z’) = 1+ (36.2.13) 


Theorem 36.2.14. We have 
Isom*(H*) =~ PSL2(C) (36.2.15) 


and 


Isom(H?) ~ PSL3(C) x Z/2Z (36.2.16) 


where the nontrivial element of Z/2Z acts by complex conjugation on PSL2(C) and 
(z) + Et) on H. 


Proof. We argue as in Theorem 33.5.5. Let ¢ € Isom(H°), and let 
Z={yj:y>0} CH’. 


Then Z is a geodesic (see 36.1.5), so (Z) is also a geodesic. By transitivity, there 
exists an isometry g € PSL2(C) that maps ¢(/) back to j, and we may suppose without 
loss of generality that 6(j) = j, and arguing as in the case of H? we may suppose 
in fact that ¢ fixes each point of Z. Let H = R + Rj c H°. Then H is a geodesic 
half-plane containing Z, so ¢(H) is as well and must be a vertical half-plane. The 
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id 


: f . (e 0 i . 
isometric rotation 0 ei? fixes Z, and applying such a rotation we may suppose 


further that ¢ fixes H. 
Now let z = x + yj and ¢(z) = z’ = x’ + y'j. Letr + sj € H. Then 
Plr + sj) = PA) or + sj)) = p(z’,r + sj) 
so from (36.2.13) 


barPe Gas) karla =a. 


7 ’ 


2sy 2sy 
letting s — œ we find that y = y’ and |x -r| = |x’ — r| for all r € R, thus Re x = Re x’ 
and Imx = +Imx’. By continuity, the sign is determined uniquely by g, and we 
conclude that either g(z) = z or g(z) = x + yj, as claimed. o 


36.2.17. The isometry group PSL2(C) also admits a ‘purely geometric’ definition via 
the Poincaré extension, as follows. 

An element g € SLo(C) as a Mobius transformation, induces a biholomorphic map 
of the Riemann sphere P!(C) = CU {co}. This map can be represented as a composition 
of an even number (at most four) inversions in circles in P!(C), or circles and lines in 
C (Exercise 36.2). We have identified P!(C) = bd H? as the boundary, and for each 
circle in P!(C) there is a unique hemisphere in H? which intersects bd H? in this circle; 
if this circle is a line, then we take a vertical half-plane. We then lift the action of 
g € PSL2(C) one inversion at a time with respect to the corresponding hemisphere or 
half-plane. It turns out that the action of this product does not depend on the choice of 
the circles. 

To verify that PSL2(C) acts by isometries, we need to know that inversion in a 
hemisphere or vertical half-plane is an isometry of H; after observing that the first 
two types of generators in Lemma 36.2.8 (stretching and translating) are isometries, 
one reduces to the case of checking that inversion in the unit hemisphere, defined by 


=> pa 
Izl? 


is an isometry; and this boils down to the same calculation as requested in Exercise 
36.4. 


36.2.18. We have a similar classification of isometries of HÌ as in the case of H? as 
follows. Let g € PSL2(C). 


(i) If + Tr(g) € (—2, 2), then g is elliptic: it has two distinct fixed points in bd H? 
and fixes every point in the geodesic between them, called its axis, acting by 
(hyperbolic) rotation around its axis. 

(ii) If +Tr(g) € R\[-2, 2], then g is hyperbolic; if +Tr(g) € C \ R, then g is 
loxodromic. (Some authors combine these two cases.) In these cases, g has two 
fixed points in bd H? and the line through these two points is stabilized, and g 
has no fixed point in H°. 

(iii) Finally and otherwise, if + Tr(g) = +2, then g is parabolic: it has a unique fixed 
point in bd H° and no fixed point in H°. 
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36.3 Unit ball, Lorentz, and symmetric space models 


Definition 36.3.1. The hyperbolic unit ball is the (open) unit disc 


D? := {w = (w1, w2, w3) € RÈ : |||? < 1} (36.3.2) 
equipped with the hyperbolic metric 
2|Idw]| 
= (36.3.3) 
1 — ||w|? 
and volume 
dwidw2d 
= eee) (36.3.4) 
(1 = Ilwll) 
The sphere at infinity is the boundary 
bd D? = {w € R?: ||w]] = 1}. 
36.3.5. The maps 
¢:H > D? ¢':D? > H? 
zrewa(z— fl — jz! wre za(w+ ft jwy! 


define a conformal equivalence between H? and D? with j => (j) = 0. The hyperbolic 
metric on D? is the pushforward of (induced from) the hyperbolic metric on H? via 
the identification (36.3.5). We find that 


ilw — w’|? 
(1 = [lw = liw 


cosh p(w, w’) = 1 +2 (36.3.6) 


In the unit ball model, the geodesics are intersections of D? of Euclidean circles 
and straight lines orthogonal to the sphere at infinity, and similarly geodesic planes 
are intersections of D? with Euclidean spheres and Euclidean planes orthogonal to the 
sphere at infinity. 


36.3.7. The isometries of D? are obtained by pushforward from H°. Explicitly, we first 
identify 


D>’ oH 
; > (36.3.8) 
Wr? wi + Wl + w3J. 
We then define the involution 
*: H>H 
; i REE ; i (36.3.9) 
a =t+xi+yj+zk}> kæk ` =t+xi+yj-zk 
and the group 
SU2(H, *) = li a : œ, 8 € H, nrd(a) — nrd(8) = 7 : (36.3.10) 
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We find that 
SU2(H,*) = 6SLx(C)¢! 


with ¢ as in 36.3.5. The group SU2(H, *) acts on D? by 
gw = (aw + BY B*w+a*y!. 
36.3.11. Finally, there is the Lorentz model 
L? = {x E RÍ: t + x7 +5 +x = l, t > 0} (36.3.12) 
with 
ds? := —dt? + dx? + dx + dx3 


and orientation-preserving isometries given by the subgroup SO*(3,1) < SO(3, 1) 
of elements mapping L? to itself. The relationship between the Lorentz model and 
the upper half-space model relies on the exceptional isomorphism of Lie algebras 
503,1 = 5h, c and the double cover SL2(C) —> SO*(3, 1). 


To conclude, we find the symmetric space model of H°, analogous to section 34.6. 


36.3.13. The group G := SL2(C) has the structure of a metric space induced from the 
usual structure on Mo(C) = C4. Since G acts transitively on H°, and the stabilizer of 
jis K = SU(2), 


G/K = SL2(C)/ SU(2) > H? 
/ eae (36.3.14) 
gk > gj; 


from the Iwasawa decomposition (Lemma 36.2.7), there is a homeomorphism 
SL2(C)/ SU(2) = NA. 

From the identity 
lIgll” = 2cosh pG, gj) (36.3.15) 


for g € SL2(C), proven in the same way as (34.6.5), the map (36.3.14) is a homeo- 
morphism, and even an isometry under the explicit reparametrization (36.3.15) of the 
metric. 


Remark 36.3.16. Similar statements about the unit tangent bundle hold for PSL2(C) 
in place of PSL2(R), as in 33.8.2. 
Remark 36.3.17. More generally, one defines hyperbolic upper half-space 
|dx|? + dy? 
yo 


The space H” is a uniquely geodesic space and a model for hyperbolic n-space. The 
geodesics in H” are orthocircles, and via a conformal map. The upper half-space maps 
isometrically to the (open) unit ball model 


H” := {(x, y) € R” xR: y > 0} with ds? = 


D” := {x € R” : |x| < 1} with ds? = J 
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and the hyperboloid model 
L” := {(f, x) €R™): —1? + x? te + x? = -1, t > 0} 


with 
ds? = —dt? + dx? +--+ +dx2. 


These models (and more) are introduced and compared in Cannon—Floyd—Kenyon-— 
Parry [CFKP97], and treated in detail in the works by Benedetti—Petronio [BP92] and 
Ratcliffe [Rat2006]. 

Hyperbolic n-space H” also admits a symmetric space description, as follows. 
The group of isometries of H” is SO(n, 1), and the subgroup of orientation-preserving 
isometries is SO*(n, 1), the component of SO(n, 1) containing the identity matrix. The 
stabilizer of every point in H” is conjugate to SO(n) (rotation around the origin in the 
unit ball model, with the fixed point at the origin), and it follows that 


H” = SO*(n, 1)/ SO(n). 
36.4 Bianchi groups and Kleinian groups 


Theorem 36.4.1. Let G := PSL2(C) and let T < G be a subgroup. Then the following 
are equivalent: 


(i) T is discrete (with the subspace topology); 
(ii) For all z € H?, we have # Stabr(z) < œ and there exists an open neighborhood 
U > z such that yU N U + O implies y € Stabr(z); 
Gii) For all compact subsets K C HÌ, we have K N yK + 0 for only finitely many 
y €T; and 
(iv) For all z € H®, the orbit Tz C H? is discrete and # Stabr(z) < o. 


Moreover, if these equivalent conditions hold, then the quotient T\ H is Hausdorff, 
and the quotient map n: H — T\HÈ is a local isometry at all points z € H with 
Stabr(z) = {1}. 


Proof. Combine Theorem 34.5.1 and the appropriately modified proof of Proposition 
34.7.2. The stabilizer of a point is finite because the stabilizer of w = 0 in SU2(H,, *) 
is SU(2), so its stabilizer in T is a discrete subgroup of the compact group SU(2) thus 
is necessarily finite (not necessarily cyclic). In particular, a subgroup I < PSL2(C) is 
discrete if and only if the action of IT on H? is wandering, hence proper. o 


Definition 36.4.2. A Kleinian group is a discrete subgroup of PSL2(C). 


Let F C C be an imaginary quadratic field with ring of integers R = Zp. Since 
R C C is discrete, and PSL2(R) C PSL2(C) is discrete. 


Definition 36.4.3. The Bianchi group over F is the Kleinian group PSL2(R) € 
PSL2(C). 


Remark 36.4.4. The Bianchi groups are so named after work of Bianchi [Bial892]; 
he studied them as discrete groups acting on hyperbolic space and found generators in 
certain cases. For more, see the book by Fine [Fin89]. 


672 CHAPTER 36. HYPERBOLIC SPACE 


36.5 Hyperbolic volume 


In this section, we consider volumes of hyperbolic polyhedra, following Milnor’s 
chapter in Thurston [Thu97, Chapter 7], published also in Milnor [Milno82, Appendix]; 
see also the full treatment by Ratcliffe [Rat2006, §10.4]. 


Definition 36.5.1. The Lobachevsky function is defined to be 


£:R-R 


0 (36.5.2) 
£0) = -f log |2 sin t| dt. 
0 


The Lobachevsky function is also called Clausen’s integral or more conventionally 
the log sine integral. 


36.5.3. The first derivative of the Lobachevsky function is £L’(@) = — log |2 sin 8| by 
the fundamental theorem of calculus, so £ attains its maximum value at L(7/6) = 
0.50747... and minimum at L(57/6) = —L(7/6). The second derivative is £L” (6) = 
— cot 0. A graph of this function is sketched in Figure 36.5.4. 


N C 


Figure 36.5.4: The Lobachevsky function £ 


Lemma 36.5.5. £(6) is odd, periodic with period n, and satisfies the identity 


n-1 
L(n) =n 25 LO + jn/n) (36.5.6) 
j=0 
for alln € Z. 
Proof. Since L'(@) = —log|2 sin 6| is an even function and £(0) = 0, we conclude 


L£(6) is an odd function, i.e., L(—0) = —L(A) for all 6 € R. 
Let n € Z. From 


n-1 


z” sfe | Je p eriin) 


J=0 
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substituting z = e7?" for t € R and using |e”? — 1| = |1 — e”? | = |2 sin 8| for all 6 € R 
gives 


n-1 


n-1 
I2 sin(nt)| = |1 — e2"*| = | fe” — eiin] = | fe sin(t + jx/n)| 
j=0 j=0 


for all t € R. Integrating and changing variables x = nt gives 


1 0 n-1 O+jn/n 
1f log |2 sin x|dx => / log |2 sin x|dx (36.5.7) 
n Jo F0 jain 
which yields 
n-1 n-1 
1 
-= = 0+j - j 5. 
—L£(nd) ae + jx/n) 2, aei (36.5.8) 


for all 6 € R. Plugging in 8 = z/n into (36.5.8) yields by telescoping 


1 
zE = L) - LO) = L) 


so L(m) = 0. 
Now since L'(6) is periodic with period z and £(0) = L£(7) = 0, we conclude that 
L(0 + 2) = L(6) is also periodic with period z. Finally, 


n-1 n-1 n-1 n-1 

$ Lain) =- X}, Ljan) =- 9L- jan) = X £in/n) 

j=0 j=0 j=0 j=0 
so Da, L£(jz/n) = 0, and the result follows from (36.5.8). (m 
Corollary 36.5.9. We have 


L(20) = 2L(0) + 2L(0 + 7/2) = 2L(80) - 2L(7/2 - 0). 


Corollary 36.5.9 is called the duplication formula for £. 


Proof. Take n = 2 in Lemma 36.5.5. m 


With the relevant function having been defined, we now return to our geometric 
application. 


Definition 36.5.10. An ideal tetrahedron is a tetrahedron whose vertices lie on the 
sphere at infinity and whose edges are (infinite) geodesics. 


36.5.11. An ideal tetrahedron is determined by the three dihedral angles a, 6, y along 
the edges meeting at any vertex; the sum of these angles is 27, as the shadow triangle 
made in C has angles that sum to 27. For an illustration, see Figure 36.5.12. 
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a 


Figure 36.5.12: Ideal tetrahedra and its shadow in C 


Proposition 36.5.13. The volume of an ideal tetrahedron with dihedral angles a, p, y 
is L(a) + L(B) + L(y). 


Proof. We follow Milnor [Milno82, Appendix, Lemma 2]; see also Thurston(—Milnor) 
[Thu97, Theorem 7.2.1] and Ratcliffe [Rat2006, Theorem 10.4.10]. We may suppose 
without loss of generality that one vertex is at oo and the finite face lies on the unit 
sphere. Projecting onto the unit disc in the x-plane, we obtain a triangle inscribed in 
the unit circle with angles a, £, y with a + B + y = 2n. We make the simplifying 
assumption that all three angles are acute (the argument for the case of an obtuse angle 
is similar). We take the barycentric subdivision of the triangle and add up 6 volumes. 
We integrate the volume element dx,dx2dy/y* over the region T(@) defined by the 
inequalities 


y> V1-ļ|x, 0<x <x tana, O< x, <cosa. (36.5.14) 


Integrating with respect to y we have 


Il oe Jf- ae 
T(a) y=VI-lkP 


= dx dx 
E -5 O<x|<cos @ toa 


O<x2<x; tana 1 2 
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We substitute x; = cos 6, so dx; = — sin 6d6 and 7/2 > 0 > a; by partial fractions, we 
have 
/ adu 1 l a+u 
= -lo : 
a-u? 2 g a-u 


So witha = 4/1 — x? = sin 0, integrating with respect to x2 gives 


1 w w fo sin @dx2 
2 Je 0 sin? 6 — x2 
/2 


n cos 0 tan œ 


dO log sin 0 + x2 


— 


sin 6 — x2 20 


sin 0 cos œ + cos 0 sin œ (36.5.15) 
sin 8 cos œ — cos 0 sin œ 
2sin(0 + @) 
°F |) sin(6 — a) 


= -1 (L(n/2 + a) ~ £20) - L(r/2- a). 


dé 


Il 
Ale Ale ALE 
m 
3 
D 


Finally, we use the duplication formula (Corollary 36.5.9), which reads 
L2a) = 2L(a) + La + 7/2) - L(/2 - a); 


substituting gives the volume L£(@)/2, and summing over the other 5 triangles gives 
the result. o 


We define now a standard tetrahedron for use in computing volumes. Let Ty,,, be the 
tetrahedron with one vertex at oo and the other vertices A, B, C on the unit hemisphere 
projecting to A’, B’, C’ in C with A’ = 0 to make a Euclidean triangle with angle 7/2 
at B’ and a at A’. The dihedral angle along the ray from A to œ is a. Suppose that the 
dihedral angle along BC is y. The acute angles determine the isometry class of Ta,y, 
and we call To, the standard tetrahedron with angles a, y. For an illustration, see 
Figure 36.5.18. 


Corollary 36.5.16. We have 
1 
vol(Ta,y) = 7 (Lía +y) + Lla — y) +2L(7/2 - a)). 


Proof. One proof realizes the standard tetrahedron as a signed combination of ideal 
tetrahedra, and uses Proposition 36.5.13. A second proof just repeats the integral 
(36.5.15) to get 


1 n/2 
vol(Ta,y = z / log 
Y 


= -1 (Ll /2 + a) - Lla + 9) ~ Lin /2-- a) + LY- 0) 


2 sin(@ + œ) 


—————| dé 
2 sin(@ — a) 


(36.5.17) 


which rearranges to give the result. o 
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Figure 36.5.18: Standard tetrahedron 


36.5.19. By Exercise 36.11, we have the Fourier expansion 


1 < sin(2n0) 
0)= > i 36.5.20 
LO = 5 2, = (36.5.20) 
The series (36.5.20) converges rather slowly, but twice integrating the Laurent series 
expansion for cot 0 as in Exercise 36.12 gives 


[Bar| Lea. 
4n (2n+ 1)! 


L£(0) = 0 | 1 — log|26| + > 


where 


5o k 2 4 
1 1 
x =) Bo ae x lx 4 
e* -1 TE k! 2 62! 304! 


so |B2| = 1/6, |B4| = 1/30, etc. are the Bernoulli numbers. 


36.6 Picard modular group 


In this section, analogous to the case of the classical modular group PSL2(Z) we 
consider the special case of a full Bianchi group with K = Q(i). 
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Definition 36.6.1. The group PSL2(Z[i]) is called the (full) Picard modular group. 


Throughout this section, we write T = PSL»2(Z[i]). In order to understand the 
structure of the group T, we follow the same script as in section 35.1, and first we seek 
a fundamental set. 


Proposition 36.6.2. Let 
m={z=x+yjeH?: |z? > 1, |Rex| < 1/2, 0<Imx < 1/2}. (36.6.3) 


Then X is a fundamental set for T O HÈ, and PSL2(Z[i]) is generated by the elements 


1 1) (1 i) (i 0) (0 -1 
k Dele alg MEE a aa 


The set x in Proposition 36.6.2 is displayed in Figure 36.6.5. 


y 


Imz 


Figure 36.6.5: A fundamental set x for the Picard group PSL2(Z[i]) O H? 


Proof. First, we show that for all z € H? , there exists a word y in the matrices 
(36.6.4) such that yz € X via an explicit reduction algorithm. Let z = x + yj € H°. 
7 A for b € Z[i] act by translation 
zt» z + b, so repeatedly applying matrices from the first two among (36.6.4), we may 
i 
0 
z e (izg)(-i)! = i?(x — yj) = -x + yj, we may suppose Im x > 0. Now if z € x, 
which is to say iiz|? > 1, we are done. Otherwise, ||z||? < 1, and we apply the matrix 


y= : 9 } sich by (36.2.9) acts by 


Recalling the action (36.2.6), the element 


suppose that |Re x|, |Im x| < 1/2. Then applying the element N which acts by 


ri 


=f =z C¥+yj) =x + yj 
IIzll 
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and y’ = y/ iizl? > y, and we repeat. Since Iz is discrete, this terminates after finitely 
many steps: the set 


Tz {z =x’ +y j eH? : [Rex’|, [Imx’| < 1/2, y< y’< 1} 
is discrete and compact, hence finite. 
Next, if z,z’ € H and z € int(x) with z’ = yz for y € I, then y = 1 and z = 2’; 
this can be proven directly as in Lemma 35.1.10 (the details are requested in Exercise 


36.7). It follows that the matrices (36.6.4) generate PSL2(Z[i]) as in Lemma 35.1.12, 
taking instead zo = 2j € int(). o 


A slightly more convenient set of generators, together with the gluing relations 
they provide on the fundamental set, is given in Figure 36.6.7. 


Remark 36.6.6. By a deeper investigation into the structure of the fundamental set X, 
in chapter 37 we will find a presentation for I as 


T= (y Yaya 74177 = =y =l, 
BYP = 3y = (ysya)” = Qari)? = (vay)? = 1). 


3 


Figure 36.6.7: Generators for Picard group PSL2(Z[i]) O H? 
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36.6.8. We compute the volume of this fundamental domain using formulas from the 
previous section. First, we use symmetry to triangulate (tetrahedralize) x, as in Figure 
36.6.9. 


x, = Rex 
Figure 36.6.9: Triangulation of x 
Let 
T := {z = xı + xi + yj € H? :0 <x <x < 1/2, x? +x3 +y” 21}. (86.6.10) 


Applying the symmetries of I’, we see that vol(x) = 4vol(T). We have T = Ty, a 
standard tetrahedron, with œ = 7/4 and dihedral angle y = 7/3. 
Now by the hard-earned volume formula (Corollary 36.5.16) we have 


vol(T) = LLa + 1/3) + L(n/4 — n/3) + 2L(7/4)). (36.6.11) 
By Lemma 36.5.5 with n = 3, we have 
$ L14) = L(n/4) + L(n/4 + 27/3) + L(n/4 + 27/3); (36.6.12) 


since 


LOTA) = Ln- 2/4) = -L4 
L(n/4 + 27/3) = L(n/4 + 27/3 — n) = L(a/4 - 2/3), 


substituting (36.6.12) into (36.6.11) gives 


vol(T) = ; (2 =1= ;] L(n/4) = Ltr /4) = 0.07633... (36.6.13) 
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vol(x) = 4 vol(T) = z £04) = 0.30532... 


36.6.14. We conclude with a beautiful consequence of this volume calculation, giving a 
preview of the volume formula we will prove later. By the Fourier expansion (36.5.20), 


we have 
1 sin(ar/2) 1^ x(n) 
L(7/4) = 5 Dy os » (36.6.15) 


ne 
where 
0, if 2|n; 
x(n) = 41, if n = 1 (mod 4); 
-l, ifn= -1 (mod 4). 


is the nontrivial Dirichlet character modulo 4. We can analytically continue the sum 
(36.6.15) to C via the L-series 


L(s, x) = > x) 
n=1 


for s € C with Res > 1 whose general study was the heart of Part III of this text. Here, 
we can just observe that L(2, y) = 2£(7/4) = 0.915965 ..., so 


1 
vol(x) = vol([\H3) = 34 X) = 0.30532... 


More generally, the volume of the quotient by a Bianchi group is connected to an 
L-value attached to the associated imaginary field; we will pursue this topic further in 
chapter 39. 


Exercises 


1. For z € H? and g = (‘ B ) € PSL2(C), show that 


d 
(az + bX(cz + dy! = (zc + dy! (za + b). 


> 2. Inversion in the circle of radius r in C centered at the origin is defined by the 


map 
2 

Z r 
zer? A = 

Izl Z 


Sending 0 + œ and œ > O under this map, we obtain an anti-holomorphic map 
P!(C) > P!(C). Inversion in a line in C is reflection in the line. 

Verify that every element of PSL2(C) can be written as a composition of at most four 
inversions in circles and lines in C (or equivalently, by stereographic projection, circles 
in P'(C)). 


as in Figure 36.6.16. 
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> 3. 


> 4, 


> 7, 


> 9, 


Figure 36.6.16: Inversion in the circle of radius r 


Verify (36.2.3) for the action of SL2(C) on H? CH. 
0 -l 3 
Let g = 1 olf SL2(C). Show that g acts on H? by 


gz (=x + yj) 


liz? 


where ||z/|? = |x|? + y, and that g is a hyperbolic isometry. 


. Let I < PSL2(C) be a subgroup (with the subspace topology). Show that F has 


a wandering action on H? if and only if T is discrete (cf. Proposition 34.7.2). 


. Show that the reduction algorithm in Proposition 36.6.2 recovers the Euclidean 


algorithm for Z[i] in a manner analogous to Exercise 35.4 for Z. 


Consider the fundamental set x for Tr = PSL2(Z[i]) (36.6.3). Show that if 
zz’ € H and z € int(Xx) with z’ = yz for y € T, then y = 1 and z = z’ (cf. 
Lemma 35.1.10). 


. Let w = e?'/3 e C. The field Q(w) = Q(V-3) is Euclidean under the norm, just 


like Q(i). Give a description of a fundamental domain for the group PSL2(Z[w]) 
analogous to Proposition 36.6.2. [The fields Q(Vd) with d < 0 are Euclidean 
if and only if d = —3, —4, —7, —8, —11, so similar—but increasingly difficult— 
arguments can be given in each of these cases. See Fine [Fin89, §4.3] for 
presentations. ] 


We consider hyperplane bisectors in H? (cf. Exercise 33.8). Let zı, z2 € H? be 
distinct. Let 
H(z1,22) = (2 € H? : p(z, z1) < PG z2)} 


be the locus of points as close to z; as to z2, and let 


L(z1, z2) = bd H(z), z2). 
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Show that H (z1, z2) is a convex half-space (for every two points in the half-space, 
the geodesic between them is contained in the half-space), and that 


L(z1, 22) = {z € H? : p(z, z1) = p(z, z2)} 


is geodesic and equal to the perpendicular bisector of the geodesic segment from 
Z1 to z2. 


10. Prove the duplication formula for the Lobachevsky function £(6) using the 
double angle formula, given that L(7/2) = 0 


> 11. In this exercise, we prove the Fourier expansion 


H= > — (36.6.17) 


n=1 


(a) Define the dilogarithm function by 


show that this series converges for |z| < 1 and that 


Lizz) = - I “ee y 


w 
(b) Prove that 
2 
2i L(0) = Liz(e”®) — = +70-6. (36.6.18) 


[Hint: Differentiate both sides for O < 0 < x, using the limiting value as 
0 — 0 to compute the limiting value Li2(1) = 1 /6.] 


(c) Take imaginary parts of (36.6.18) to prove (36.6.17). 
> 12. As in 36.5.19, we define the Bg € Q for k > 0 by 


x 1 x? 1 x4 
pe sje 7 bane 36.6.19 
=m k 2°62! 304!” ( ) 


-1 
(a) Plug in x = 2iz into (36.6.19) to get 


a 


coors t+) aS 


(b) Integrate twice in (a) to prove 


|Bon| oD ae 
n n+)! 


£0) = 6 | 1 — log|26| + > 


[See also Exercise 40.3.] 
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Chapter 37 


Fundamental domains 


We have seen in sections 35.1 and 36.6 that understanding a nice fundamental set for 
the action of a discrete group T is not only useful to visualize the action of the group 
by selecting representatives of the orbits, but it is also instrumental for many other 
purposes, including understanding the structure of the group itself. In this chapter, we 
pursue a general construction of nice fundamental domains for the action of a discrete 
group of isometries. 


37.1 » Dirichlet domains for Fuchsian groups 


In this introductory section, we preview the results in this chapter specialized to the case 
of Fuchsian groups. Let I c PSL2(R) be a Fuchsian group; then I is discrete, acting 
properly by isometries on the hyperbolic plane H?, with metric p(-,-) and hyperbolic 
area u. 

A natural way to produce fundamental sets that provide appealing tessellations of 
H? is to select in each orbit the points closest to a fixed point zo € H?, as follows. 


Definition 37.1.1. The Dirichlet domain for T centered at zo € HŽ is 
m(3 zo) = {z € H? : p(z, zo) < plyz, zo) for all y € T}. 


As the group T willst vary, we suppress the dependence on T and often write simply 
H(zo) = X(T; zo). 


37.1.2. The set (zo) is an intersection 


(zo) = (| HO; zo) (37.1.3) 
ye 
where 
H(y; zo) := {z € H? : p(z, zo) < plz, zo) = p(z, ¥'z0)}. (37.1.4) 


In particular, since each H(y; zo) is closed, we conclude from (37.1.3) that (zo) is 
closed. 
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The sets H(y; zo) can be further described as follows. If zo = y'zo, then H(y; zo) = 
H?. So suppose zo # y~! zo. Then by Exercise 33.8, H(y; zo) is a (half!) half-plane 
consisting of the set of points as close to Zo as y 20, and H(y; zo) is convex: if two 
points lie in the half-plane then so does the geodesic segment between them. The 
boundary 


bd H(y; zo) = L(y; zo) := {z € H? : p(z, zo) = (zy! z0)} 


is the geodesic perpendicular bisector of the geodesic segment from zo to y~!zo, as in 
Figure 37.1.5. 


H(y; zo) 


L(y; 20) 


Figure 37.1.5: The half-plane H(y; zo) and its boundary L(y; zo) 


From the description in 37.1.2, the sketch of a Dirichlet domain looks like Figure 
37.1.6. 


t (20) 


Figure 37.1.6: Sketch of a Dirichlet domain 


The fundamental sets we have seen are in fact examples of Dirichlet domains. 
Example 37.1.7. We claim that the Dirichlet domain for Fr = PSL2(Z) centered at 
zo = 2i is in fact the fundamental set for I introduced in section 35.1, i.e., 

1(T; zo) = H(2i) = {z € H? : [Rez] < 1/2, |z| > 1}. (37.1.8) 
Recall the generators S,T € I with Sz = —1/z and Tz = z+ 1. By (33.5.3) 


Iz — 2il? 
4Imz ` 


cosh p(z, 2i) = 1 + 
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Let z € H?. Visibly 
p(z 2i) < p(Tz,2i) & Rez -1/2 (37.1.9) 


or put another way 
H(T;2i) = {z € H? : Rez > -1/2}. 


Similarly, H(T7!;2i) = {z € H? : Rez < 1/2}. Equivalently, the geodesic perpendicu- 
lar bisector between 2i and 2i + 1 are the lines Re z = +1/2. 
In the same manner, we find that 
i , lz- 2il -1/2 - 2i? _ 4lzłlz — i/2? 
Z, 2i) < p(Sz, 2i © < = 
plz, 2i) < pl ) Imz Im(-1/z) |z|? Imz 


© |z-2il? < 4z- i/2? 


so 
p(z 2i) < p(Sz,2i) © k21 (37.1.10) 


and H(S;2i) = {z € H? : |z| > 1}. To see this geometrically, the geodesic between 
2i and S(2i) = (1/2)i is along the imaginary axis with midpoint at i, and so the 
perpendicular bisector L(S; 2i) is the unit semicircle. 

The containment (C) in (37.1.8) then follows directly from (37.1.9)-(37.1.10). 
Conversely, we show the containment (2) for the interior—since 1(27) is closed, this 
implies the full containment. Let z € H? have [Re z| < 1/2 and |z| > 1, and suppose 
that z ¢ (21); then there exists y E€ PSL2(Z) such that z’ = yz has p(z’, 2i) < p(z, 2i), 
without loss of generality (replacing z’ by Sz’ or Tz’) we may suppose |Re z’| < 1/2 
and |z’| > 1; but then by Lemma 35.1.10, we conclude that z’ = z, a contradiction. 

(Note that the same argument works with zo = ti for all t € Rs.) 


With this example in hand, we see that Dirichlet domains have quite nice structure. 
To make this more precise, we upgrade our notion of fundamental set (Definition 
34.1.14) as follows. 


Definition 37.1.11. A fundamental set x for T is locally finite if for each compact set 
K CH’, we have yK N x # 0 for only finitely many y € T. 

A fundamental domain for T © H? is a fundamental set x C H? such that 
b(bd x) = 0. 


The first main result of this section is as follows (Theorem 37.5.3). 


Theorem 37.1.12. Let zọ € H? satisfy Stabr(zo) = {1}. Then the Dirichlet domain 
(T; zo) is a connected, convex, locally finite fundamental domain for T with geodesic 
boundary. 


By geodesic boundary we mean that the boundary bd X (zo) is a finite or countable 
union of geodesic segments. As for the hypothesis: for a compact set K c H?, there are 
only finitely many points z € K such that Stabr(z) # {1}, indeed there are only finitely 
many y €T such that yK N K + 0 (as T is discrete), and every such y + 1 has at most 
one fixed point in H? (Lemma 33.4.6). 
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37.2 > Ford domains 


In this section, we reinterpret Dirichlet domains in the unit disc D?, as it is more 
convenient to compute and visualize distances this model. Let zo € H?. We apply the 
map (33.7.3) 


¢: H > D? 
w- 222 
Zz- Zo 


with zo > ¢(zo) = wo = 0. Then by (33.7.6), 


1+ 
p(w, 0) = log |w! 


= 2 tanh™! |w] (37.2.1) 
1 — |w] 


is an increasing function of |w]. 


Example 37.2.2. The Dirichlet domain from Example 37.1.7 looks like Figure 37.2.3, 
drawn in D? (with zo = 27). 


a 


Figure 37.2.3: Dirichlet domain in D? 


Let I < PSL2(R) be a Fuchsian group, and (recalling 33.7.8) to ease notation, we 
identify r with [%. We analogously define a Dirichlet domain 4(wo) for a Fuchsian 
group T centered at wo € D? and 


6(4(zo)) = H(wo) c D? 


if 6(zo) = wo. In particular, the statement of Theorem 37.1.12 applies equally well to 
(wo) € D?. 

For simplicity (and without loss of generality), we only consider the case where 
wo = 0, and then from (37.2.1) 


m(T:0) = {w € D? : w| < lyw| for all y € T}. (37.2.4) 
Let 


_{d ft j 
«-(? 7] €PSUCL 1) © D 


with c, d € C satisfying |d|? — |c|? = 1. 
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37.2.5. We now pursue a tidy description of the set (37.2.4). From (37.2.1), we have 
p(w, 0) < p(gw, 0) if and only if 


dw+ce 
cwrd 


|w] < ; (37.2.6) 


expanding out (37.2.6) and with a bit of patience (Exercise 37.5), we see that this is 
equivalent to simply 
lcw+d|> 1. 


But we can derive this more conceptually, as follows. The hyperbolic metric (Definition 
33.7.1) on D? is invariant, so 


Idw| Id(gw)| 


“= Tat? = lew? 8 
so, by the chain rule, 
d 1-|gw 
|= (Th) 
Therefore 
lwi<|lgwl © ra = a <l © |ew+ad|2>1. (37.2.7) 
dw low + d|? 


dg 
aw 
i.e., g acts as a Euclidean isometry at the point w (preserving the length of tangent 
vectors in the Euclidean metric). So we are led to make the following definition. 


The equivalence (37.2.7) also gives p(w, 0) = p(gw, 0) if and only if =1, 


Definition 37.2.8. The isometric circle of g is 


I(g) = [v eC: CT 
dw 


=1}=(w eC :lew a=, 


We have c = 0 if and only if g(0) = 0 if and only if g € Stabpsuci,1)(0), and in this 
case, gw = (d/d)w with \d/d| = | is rotation about the origin. Otherwise, c # 0, and 
then /(g) is a circle with radius 1/|c| and center —d/c € C. 


37.2.9. Summarizing, for all g € PSU(1, 1), we have 


ext(I(g)), 
p(w, 0) 4 = 7 p(gw,0) according as w € 4 I(g), 


> int(I(g)). 


In particular, 


HT; 0) = a clext I(y). 


yel-Stabr (0) 


This characterization is due to Ford [For72, Theorem 7, §20]. 
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This description of a Dirichlet domain as the intersection of the exteriors of iso- 
metric circles is due to Ford, and so we call a Dirichlet domain in D? centered at 0 a 
Ford domain, as in Figure 37.2.10. 


Figure 37.2.10: A typical Ford domain 


Remark 37.2.11. In the identification H? —> D7, the preimage of isometric circles 
corresponds to the corresponding perpendicular bisector; this is the simplification 
provided by working in D? (the map ¢ is a hyperbolic isometry, whereas isometric 
circles are defined by a Euclidean condition). 


37.3 > Generators and relations 


Continuing with our third and final survey section focused on Fuchsian groups, we 
consider applications to the structure of a Fuchsian group I’. For more, see Beardon 
[Bea95, §9.3] and Katok [Kat92, $3.5]. 

Let x = (I; zo) be a Dirichlet domain centered at zo € H?. A consequence of the 
local finiteness of a Dirichlet domain is the following theorem (Theorem 37.4.2). 


Theorem 37.3.1. T is generated by the set 
{fyeT: an yu #9}. 


So by Theorem 37.3.1, to find generators, we must look for “overlaps” in the 
tessellation provided by x. If z € x N yH with y € T \ {1}, then there exists z, z’ € X 
such that z = yz’, and hence 


p(z zo) < ply !z, zo) = p(z’, zo) < plz’, zo) = p(z, Zo) (37.3.2) 


(twice applying the defining property of 11), so equality holds and (viz. 37.1.2) z € bd xt. 
Since the boundary of x is geodesic, to understand generators we should organize the 
matching provided along the geodesic boundary of x. 

We will continue to pass between H? and D?, as convenient. 
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37.3.3. A side is a geodesic segment of positive length of the form x N yx with 
y €T \ {1}. If two sides intersect in D’, the point of intersection is called a vertex of 
X; equivalently, a vertex is a single point of the form x N yon y'm with y, y’ ET. 

However, we make the following convention (to simplify the arguments below): if 
L = N yx isa maximal geodesic subset of x and y? = 1, or equivalently if yL = L, 
then y fixes the midpoint of L, and we consider L to be the union of two sides that 
meet at the vertex equal to the midpoint. The representation of a side as X N yH is 
unique when y? # 1. 

Because X is locally finite, there are only finitely many vertices in a compact 
neighborhood (Exercise 37.7). 

An ideal vertex is a point of the closure of x in D” that is the intersection of the 
closure of two sides in D**, as in Figure 37.3.4. 


a 


ideal vertex 


rtex 


Figure 37.3.4: A vertex and an ideal vertex 


37.3.5. Let S denote the set of sides of x. We define a labeled equivalence relation on 
S by 
P = {(y, L, L*) : L* = y(L)} cT x (Sx S). (37.3.6) 


We say that P is a side pairing if P induces a partition of S into pairs, and we denote 
by G(P) the projection of P to T. Since (y, L, L*) € P implies (y~!, L*, L) € P, we 
conclude that G(P) is closed under inverses. 


Lemma 37.3.7. A Dirichlet domain x has a side pairing P. 


Proof. Let L be a side with L C H N yx for a unique y. Recalling the convention in 
37.3.3, if y? + 1, then equality holds, and 
yiLennytnaL sb 


so by uniqueness, the equivalence class of L contains only L, L*. If y? = 1, then L 
meets yL = L* at the fixed point of y, and again the equivalence class of L contains 
on L, L*. In either case, we conclude that P (37.3.6) is a side pairing. o 


37.3.8. We now provide a standard picture of Xx in a neighborhood of a point 
webdx. 
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Because x is locally finite, there is an an open neighborhood of w and finitely 
many distinct ôo, 61,...,5n € I with ôo = 1 such that U C (Ji o 6; and w € 6; for 
alli. Shrinking U if necessary, we may suppose that U contains no vertices of X except 
possibly for w and intersects no sides of 6;{ except those that contain w. Therefore, 
we have a picture as in Figure 37.3.9. 


62% 
? 6, 


6,44 


Figure 37.3.9: Standard picture 


When n = 1, then either w can be either a vertex (fixed point of 6) or not. 


37.3.10. Let v = vı be a vertex of x. The standard picture in a neighborhood of v can 
be reinterpreted as in Figure 37.3.11. 


L3 =V] 
Li =4 N7 H Lol Lə 
V1 
V = Vi x ` 
LE Ym Ym—1 ( $ 
= / Liat 
Lim Vm 
mH = yi” ' 


Figure 37.3.11: The standard picture in a neighborhood of v 
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Let Lı be the side containing vı traveling clockwise from the interior. Then by the side 
pairing (Lemma 37.3.7), there is a paired side Lý with Lj = yıLı and yı € G(P). (In 
fact, then L} = H N y n and Li = H N y,H.) Let vz = yıvı. Then v2 is a vertex of 
X, and so is contained in a second side Ly. Continuing in this way, with v;+1 = yivi, 
by local finiteness we find after finitely many steps a final side L}, with next vertex 
Vn = V1. 

In terms of the standard picture (Figure 37.3.9), we see that 6, = vo and by 
induction 6; = (yi-++yi)7', since (yj-++y1)(V1) = vier. Thus yi = ôn} ô; for 
i=0,...,m—1.Let 6 = ôm. Then 6(v) = v, and 6 acts by counterclockwise hyperbolic 
rotation with fixed point v—and m is the smallest nonzero index with this property. It 
follows that for all 0 < j < n, writing j = gm +i with q > 0 and O <i < m we have 
6; = ôl ôi, and in particular that m | (n + 1). Similarly, x N Ivi = {vi, . - -, Vm-1}- 

Let e = (n + 1)/m. Then 6° = 1, and we call this relation the vertex cycle 
relation for v. If v’ = yv, then the vertex cycle relation for v’ is the conjugate relation 
(y7!8y) = y~!6°y = 1. Let R(P) be the set of (conjugacy classes of) cycle relations 
arising from IT -orbits of vertices in X. 


Example 37.3.12. We compute the set R(P) of cycle relations for r = PSL2(Z). The 
two T-orbits of vertices for X are represented by i and p. The vertex i is a fixed point 
of 6; = S, and we obtain the vertex cycle relation S? = 1, and S = S“!. 


694 = H 


Figure 37.3.13: The cycle relation at p 


At the vertex p, we have a picture as in Figure 37.3.13. 
We find that 6, = T~! and 6 = 6) = T~'S, and e = 6/2 = 3 so 6° = (T'S) = 1. 
Taking inverses (so 57! acting instead clockwise), we find the relation (ST) = 1. 


Proposition 37.3.14. The set G(P) generates T with R(P) a set of defining relations. 
In other words, the free group on G(P) modulo the normal subgroup generated by the 
relations R(P) is isomorphic to T via the natural evaluation map. 


Proof. Let I* < T be the subgroup generated by G(P). By Theorem 37.3.1, we need 
to show that if x N yx + 0 then y € I*. So let w € H N yx with y € [—{1}. We refer 
to the standard picture (see 37.3.8); we have y = 6; for some j. For alli = 0, 1,...,n, 
the intersection 6; N 6;4;4 is a side of 6;,;0, so H N 6; 6104 is a side of x, and 
thus 67 ôi € G(P) is a side pairing element. Since 69 = 1, by induction we find that 
6; € I* for all i, so y = 6; € I* as claimed. 

We now turn to relations, and we give an algorithmic method for rewriting a relation 
in terms of the cycle relations. Let y;y2--- yz = 1 be a relation with each y; € G(P), 
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and let z; = (y1 ++- Yi)Xzo fori = 1,...,k. Exactly because yı € G(P), we have that x 
and y; share a side, and since X is connected, we can draw a path zo — zı through 
the corresponding side. Continuing in this way, we end up with a path zo — Zk = Zo, 
hence a closed loop, as in Figure 37.3.15. 


Figure 37.3.15: A closed loop 


Let V be the intersection of the I orbit of the vertices of x with the interior of the 
loop; by local finiteness, this intersection is a finite set, and we proceed by induction 
on its cardinality. The proof boils down to the fact that this loop retracts onto the loops 
around vertices obtained from cycle relations, as H? is simply connected. 

If the path from zo — zı crosses the same side as the path zķ-1 — zk = Zo, then 
Z1 = Zķ-1 and so yz’ = yı, since Stabr(zo) = {1} (see Figure 37.3.16). 


e 21 = Zk-1 


Sf a 


Figure 37.3.16: Simplifying a relation: setup 


Conjugating the relation by yı and repeating if necessary, we may suppose that yr’ + 
Y1, SO Zk-1 # Z1; the set V is conjugated so it remains the same size. In particular, if V 
is empty, then this shows that the original relation is conjugate to the trivial relation. 

Otherwise, the path zọ — zı crosses a side and there is a unique vertex v on this 
side that is interior to the loop (working counterclockwise): see Figure 37.3.17. 


Figure 37.3.17: Simplifying a relation: using the cycle relation 


The cycle relation 6° = 1 at v traces a loop around v, and without loss of generality 
we may suppose 6° = ay, with œ a word in G(P). Therefore, substituting this relation 
into the starting relation, we obtain a relation y2---yx(6°a@) with one fewer interior 
vertex; the result then follows by induction. o 
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In section 37.5, we consider a partial converse to Proposition 37.3.14, due to 
Poincaré: given a convex hyperbolic polygon with a side pairing that satisfies certain 
conditions, there exists a Fuchsian group I with the given polygon as a fundamental 
domain. 


37.4 Dirichlet domains 


We now consider the construction of Dirichlet fundamental domains in a general 
context. Let (X, p) be a complete, locally compact geodesic space. In particular, X is 
connected, and by the theorem of Hopf—Rinow (see Theorem 33.2.9), closed balls in 
X are compact. 

Let I be a discrete group of isometries acting properly on X. Right from the get 
go, we prove our first important result: we exhibit generators for a group based on a 
fundamental set with a basic finiteness property. 


Definition 37.4.1. Let A C X. We say A is locally finite for T if for each compact set 
K c X, we have yK N A # 0 for only finitely many y ET. 


The value of a locally finite fandamental set is explained by the following theorem. 


Theorem 37.4.2. Let x be a locally finite fundamental set for T. Then T is generated 
by the set 
{fyeT: uN yo +0 (37.4.3) 


Proof. Let [* <T be the subgroup of T generated by the elements (37.4.3). We want 
to show I* =T. 

For all x € X, by Theorem 37.4. 18, there exists y € T such that yx € x. If there is 
another y’ € T with y’x € x, then 


y'xen nyy H 
so y’y~! e T* and in particular Py = T*y’. In this way, we define a map 
f: X oP\V 


37.4.4 
xe I*y ( ) 


for all y € T such that yx € =. 

We now show that f is locally constant. Let x € X. Since x is locally finite, for 
every compact neighborhood K 3 x we can write K C |; yi; with a finite union, and 
by shrinking K we may suppose that x € y,;x for all i. In particular, f(x) = A ia for 
any i. But then if y € K, then y € yX for some i, so f(y) = Ty, * = f(x). Thus f is 
locally constant. 

But X is connected so every locally constant function is in fact constant and 
therefore f takes only the value I“. Let y € I and let x € x. Then 


T = f(x) = fox) =T"y 


so y € I“, and the proof is complete. o 
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We now seek a locally finite fundamental set with other nice properties: we will 
choose in each I’-orbit the closest points to a fixed point x9 € X. So we first must 
understand the basic local properties of intersections of these half-spaces (as in 37.1.2). 


37.4.5. For x1, x2 E€ X, define the closed Leibniz half-space 
A(x), x2) = {x € X : p(x, x1) < p(x, x2)}. (37.4.6) 


If xı = x2, then H(x1, x2) = X. If xı + x2, then H(x1, x2) consists of the set of points 
as close to x; as x2, SO 


int H(x1, x2) = {x € X : p(x, x1) < p(x, x2)}. (37.4.7) 


and 
bd H(x1, x2) = L(x1, x2) = {x € X : p(x, x1) = pl, x2)} 


is called the hyperplane bisector (or equidistant hyperplane or separator) between 
xı and x2. 


Remark 37.4.8. In this generality, unfortunately hyperplane bisectors are not neces- 
sarily geodesic (Exercise 37.10). 


Definition 37.4.9. A set A C X is star-shaped with respect to xo € A if for all x € A, 
the geodesic between x and xo belongs to A. 


A set A C X that is star-shaped is path connected, so connected. 
Lemma 37.4.10. A Leibniz half-plane H(x,, x2) is star-shaped with respect to xı. 


Proof. Let x € H(x, x2) and let y be a point along the unique geodesic from x to x1. 
Then 


P(*1,y) + p(y, x) = p(xı, x). 
If y ¢ A(x, x2), then p(x2, y) < p(x, y), and so 
P(X2, x) < p(%2, y) + P, x) < pO, y) + P, x) = pO, x) 

contradicting that x € H(x1, x2). So y € A(x, x2) as desired. o 

Now let xo € X. 
Definition 37.4.11. The Dirichlet domain for I centered at xo € X is 

(T; xo) = {z € X : p(x, xo) < p(yx, xo) for all y € T}. 
We often abbreviate x (xo) = H(T; xo). 


37.4.12. Since p(yx, xo) = p(x, y7! xo), 
n (xo) = (HO, 77x0); 
ye 


each half-space is closed and star-shaped with respect to xo, so the same is true of 
M(x9). In particular, X (xo) is connected. 
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A Dirichlet domain satisfies a basic finiteness property, as follows. 


Lemma 37.4.13. If A c X is a bounded set, then A C H(y; xo) for all but finitely many 
yer. 


In particular, for all x € X we have x € H(y; x) for all but finitely many y €T. 
Proof. Since A is bounded, 
sup({e(x, xo) : x E A}) =r < œ. 


By Theorem 34.5.1, the orbit Ixo is discrete and # Stabr(xo) < 00; since closed balls 
are compact by assumption, there are only finitely many y € I’ such that 


(7X0. x0) = p(%o, Y~" xo) < 2r 
and for all remaining y € T and all x € A, 
p(x, y7" xo) = plo. y7" xo) = PC, Xo) > 2r =r =r > p(x, xo) 


so x € H(y; xo), as in Figure 37.4.14. 


Figure 37.4.14: The bounded set A and the orbit of xo 


This concludes the proof. o 


37.4.15. Arguing in a similar way as in Lemma 37.4.13, one can show (Exercise 37.8): 
if K is a compact set, then K N L(y; xo) # 0 for only finitely many y €T. 


Lemma 37.4.16. We have 
int H (xo) = {x € H : p(x, xo) < plyx, xo) forally € TN Stabr(xo)}. 


It follows from Lemma 37.4.16 that x € bd (xo) if and only if there exists 
y EIN Stabr(xo) such that p(x, xo) = p(yx, xo). 
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Proof. Let x € x, and let U > x be a bounded open neighborhood of x. By Lemma 
37.4.13, we have U C H(y; xo) for all but finitely many y € T, so 


UNK = U N ( HC, y7 'xo) 


the intersection over finitely many y; € I with y; ¢ Stabr(xo) (see Figure 37.4.17). 


Figure 37.4.17: The neighborhood U 


Thus 
U nintia) =UN () int H(xo, y;!x0). 


L 


The lemma then follows from (37.4.7). o 
The first main result of this chapter is the following theorem. 


Theorem 37.4.18. Let xo € X, and suppose Stabr (xo) = {1}. Then X(T; xo) is a locally 
finite fundamental set for T that is star-shaped with respect to xọ and whose boundary 
consists of hyperplane bisectors. 


Specifically, in a bounded set A, by Lemma 37.4.13 


A nbd (03 xo) € |_] Lo, yixo) 
for finitely many y; EF N {1}. 


Proof. Abbreviate X = (xo). We saw that x is (closed and) star-shaped with respect 
to xo in 37.4.12. 
Now we show that X = U, yx. Let x € X. The orbit Tx is discrete, so the distance 


p(x, xo) = inf({p(yx, xo): y ETH (37.4.19) 


is minimized at some point yx € X with y € I. Thus (x9) contains at least one point 
from every I’-orbit, and consequently . 

We now refer to Lemma 37.4.16. Since X is complete, this lemma implies that 
cl(int(x)) = x. And int(x) N int(yt) = 0 for all y € T \ {1}, because if x, yx € int() 
with y + 1 then 


p(x, x0) < plyx, x0) < ply (yx), x0) = p(x, xo), (37.4.20) 


a contradiction. 
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Finally, we show that X is locally finite. It suffices to check this for a closed disc 
K C X with center xo and radius r € Rso. Suppose that yK meets x with y € T; then 
by definition there is x € x such that p(xo, y~ tx) < r. Then 


p(x0, Y7! xo) < p(xo, ¥'x) + ply! x, y7 xo) < r + p(x, xo). 
Since x € x, we have p(x, xo) < p(y! x, xo) < r, so 
p(xo, y lxo) <rt+r=2r. 


This setup can be seen in Figure 37.4.21. 


yK 
x reykKuUx 
N 
AN x y ine K 
\ 
K "£0 


Figure 37.4.21: x is locally finite and star-shaped 


For the same reason as in Lemma 37.4. 13, this can only happen for finitely many y € T. 
o 


Remark 37.4.22. Dirichlet domains are sometimes also called Voronoi domains. 


37.5 Hyperbolic Dirichlet domains 


We now specialize to the case X = H where H = H? or H = H? with volume p; then 
T is a Fuchsian or Kleinian group, respectively. 


Definition 37.5.1. A fundamental domain for [ © X is a connected fundamental set 
mt C X such that (bd x) = 0. 


(A domain in topology is sometimes taken to be an open connected set; one also 
sees closed domains, and our fundamental domains are taken to be of this kind.) 
We now turn to Dirichlet domains in this context. 


37.5.2. In Theorem 37.4.18, the hypothesis that Stabr(zo) = {1} is a very mild hypoth- 
esis. If K is a compact set, then since X is locally finite, the set of points z € K with 
Stabr(z) + {1} is a finite set of points when H = H? and a finite set of points together 
with finitely many geodesic axes when H = H°. 
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In spite of 37.5.2, we prove a slightly stronger and more useful version of Theorem 
37.4.18, as follows. If T'o = Stabr(zo) is nontrivial, we modify the Dirichlet domain by 
intersecting with a fundamental set for To; the simplest way to do this is just to choose 
another point which is not fixed by an element of Tọ and intersect. 


Theorem 37.5.3. Let z € H, let To = Stabr(zo), and let up € H be such that 


Stab, (uo) = {1}. Then 
ML; zo) N Ho; uo) 


is a connected, convex, locally finite fundamental domain for I with geodesic boundary 


inH. 


Proof. Abbreviate 
H = H(I; zo) N (To; uo). 

First, we show that z € m: we have zo € X(T; zo), and by Theorem 37.4.18, 
x (To; uo) is a fundamental set for To so there exists yọ € Tg such that youo = uo € 
x (To; uo). 

Now we show that x is a fundamental set for T. First we show H = Uy yH. Let 
z € H, and let y € T be such that p(yz, zo) is minimal as in (37.4.19). Let yo € To such 
that yo(yz) € x (To; uo). Then 


P(Y0(Y2), 20) = PZ, zo) 


so yoyz € H. And int) N int(yH) = @ for all y € T N {1}, because if z, yz € int(x) 
with y + 1, then either y ¢ Io in which case we obtain a contradiction as in (37.4.20), 
or y € Io — {1} and then we arrive at a contradiction from the fact that (19; ug) is a 
fundamental set. 

We conclude by proving the remaining topological properties of x. We know that 
X is locally finite, since it is the intersection of two locally finite sets. We saw in 37.1.2 
that the Leibniz half-spaces in H? are convex with geodesic boundary, and the same is 
true in H? by Exercise 36.9. It follows that is convex, as the intersection of convex 


sets. Thus i 
bd x c U L(zo, y zo) 
yT \ {1} 


is geodesic and measure zero, since L(zo, y~!zp) intersects a compact set for only 
finitely many y by 37.4.15. oO 


37.6 Poincaré’s polyhedron theorem 


Continuing with the notation from the previous section, we now turn to a partial 
converse for Theorem 37.5.3 for H, we need one additional condition. Let x be a 
convex (finite-sided) hyperbolic polyhedron equipped with a side pairing P. 


37.6.1. First suppose X C H?. For a vertex v of x, let (x, v) be the interior angle of 
x at v. We say that x satisfies the cycle condition if for all vertices v of x there exists 
e € Zyo such that 
2 
omv) ==. 


e 
viel vnn 
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Put another way, x satisfies the cycle condition if the sum of the interior angles for a 
T-orbit of vertices as in the standard picture is an integer submultiple of 27. 

Now suppose X © H?. Now we work with edges: for an edge £ of x, let #(x, £) be 
the interior angle of x at £. We say that x satisfies the cycle condition if for all edges 
£ of x there exists e € Zs such that 


A O, li) = za 


€;<Tenn 7 
Lemma 37.6.2. Let be a Dirichlet domain. Then x satisfies the cycle condition. 


Proof. We explain the case where x C H’; the case of H? is similar. Let v be a vertex 
of X. Referring to the standard picture 37.3.8, 


2n = X A6, v) = > O(m, 571). 


0 J= 


n 


J 


In 37.3.10, we proved that ôm acts by hyperbolic rotation around v and has order 
e = (n+ 1)/m, and 


m-—1 


n m-l 
Xama v) = e X 904, 6;'v) =e Y o vi) 
j=0 i=0 i=0 


with x N Tv = {v1,...,V¥m_1}. Combining these two equations, we see that the cycle 
condition is satisfied. o 


37.6.3. There is another condition at certain points at infinity that a fandamental 
domain must satisfy. We say a point z € bd H is a infinite vertex if z lies in the 
intersection of two faces and is tangent to both. We define a sequence of tangency 
vertices analogous to the cycle transformations to get an infinite vertex sequence and 
a infinite vertex transformation. We say that the side/face pairing is complete if the 
tangency vertex transformation is parabolic. 


Theorem 37.6.4 (Poincaré polygon theorem). Let x be a convex finite-sided hyper- 
bolic polygon/polyhedron with a side/face-pairing P. Suppose that 1 satisfies the cycle 
condition and P is complete. 

Then the group I := (G(P)) generated by side/face-pairing elements is a Fuch- 
sian/Kleinian group, X is a fundamental domain for T, and R(P) forms a complete set 
of relations for G(P). 


Proof. Unfortunately, it is beyond the scope of this book to give a complete proof of 
Theorem 37.6.4. See Epstein—Petronio [EP94, Theorem 4.14] or Ratcliffe [Rat2006, 
§11.2, §13.5]; our statement is a special case of the theorem by Maskit [Mas71], but 
see Remark 37.6.5. oO 


Remark 37.6.5. The proof of Poincaré’s theorem [Poil882, Poil883] has a bit of 
a notorious history. From the very beginning, to quote Epstein—Petronio [EP94, $9, 
p. 164]: 
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It is clear that Poincaré understood very well what was going on. How- 
ever, the papers are not easy to read. In particular, the reader of the 
three-dimensional case is referred to the treatment of the two-dimensional 
case for proofs; this is fully acceptable for a trail-blazing paper, but not 
satisfactory in the long term. 

There are a number of reasonable published versions of Poincaré’s The- 
orem in dimension two. Of these, we would single out the version by 
de Rham [dR71] as being particularly careful and easy to read. Most 
published versions of Poincaré’s Theorem applying to all dimensions are 
unsatisfactory for one reason or another. 


This sentiment is echoed by Maskit [Mas71], who presents a proof for polygons with 
an extension to polyhedron, with the opening remark: 


There are several published proofs of [Poincaré’s classical] theorem, but 
there is some question as to their validity; Siegel [Sie65, p. 115] has com- 
mented on this and given an apparently valid proof under fairly restrictive 
conditions. None of the published proofs are as general as they might be, 
and they all have a convexity condition that is never really used. 

This note is an attempt to clarify the situation. The problem and the solu- 
tion presented below arose during the course of several informal conver- 
sations. Present at one or more of these conversations were L. V. Ahlfors, 
L. Bers, W. Magnus, J. E. McMillan, and B. Maskit. 


Epstein and Petronio [EP94, §9, p. 165] then have this to say: 


Maskit’s paper contains a nice discussion of completeness, though again it 
is not a constructive approach. He limits his discussion to hyperbolic space 
in dimensions two and three. We are not confident that the arguments in 
the paper are complete. For example, there seems to be an assumption 
that the quotient of a metric space, such that the inverse image of a point 
is finite, is metric. This is false, as is shown by identifying x with —x in 
[-1, 1], provided O < x < 1. 


Maskit published a book [Mas88] containing an expanded version of the proof for 
polyhedra, to which Epstein and Petronio [EP94, §9, p. 164] review: 


The treatment in [Mas88] is difficult to understand. For example in H.9 on 
page 75, it is claimed that a metric is defined in a certain way, and this fact 
is said to be “easy to see”, but it seems to us an essential and non-trivial 
point, which is not so easy to see, particularly when the group generated 
by the face-pairings is not discrete. ... The Proposition in IV.1.6 on page 
79 of this book is incorrect—a counter-example is given in Example 
9.1—because there are no infinite cycles or infinite edges according to the 
definitions in the book. 
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37.7 Signature of a Fuchsian group 


As an application of Theorem 37.5.3, we relate area and signature for good orbifolds 
obtained from Fuchsian groups. 


37.7.1. We first recall 34.8.13: a good compact, oriented 2-orbifold X is classified 
up to homeomorphism by its signature (g;e1,...,ex), where g is the genus of the 
underlying topological surface and the e;,...,e, are the orders of the (necessarily 
cyclic) nontrivial stabilizer groups. This extends to good orbifolds Y with finitely 
many points removed: we define the signature (g;e1,..., eg, ô) where 6 > O is the 
number of punctures. 


Now let IT < PSL2(R) be a Fuchsian group such that the quotient Y(T) = T \H? has 
finite hyperbolic area u(Y(T)). 


Theorem 37.7.2. (Siegel). Suppose that u(T\H?) has finite hyperbolic area. Then 
every Dirichlet domain X has finitely many sides. 


Proof. The proof estimates the contribution to the volume from infinite vertices: for 
Fuchsian groups, see Siegel [Sie45, p. 716-718], or the expositions of this argument 
by Gel’ fand—Graev—Pyatetskii-Shapiro [GG90, Chapter 1, §1.5] and Katok [Kat92, 
Theorem 4.1.1]. o 


37.7.3. By Theorem 37.5.3 and Siegel’s theorem (Theorem 37.7.2), a Dirichlet domain 
x for T is connected, convex, hyperbolic polygon. In particular, there are m € Z>o vertex 
cycles, which by 37.3.10 correspond to cyclic stabilizer groups of orders e1, ..., €m € 
Z>ı listed so that e1,...,eg = 2, and finitely many 6 € Z>ọ infinite vertex cycles, 
corresponding to 6 stabilizer groups that are infinite cyclic. 


Proposition 37.7.4. We have 


Proof. Let x be a Dirichlet domain for I with 2n sides and n finite or infinite vertices. 
The hyperbolic area of x is given by the Gauss—Bonnet formula 33.6.9: summing 
vertex cycles using the cycle condition (Lemma 37.6.2), we have 


k 
2 
pw) = Q2n-2)n- Y, = 
i=l ' 


The quotient T'\H? is a (punctured) oriented topological surface of genus g, with k + ô 
vertices, n edges, and 1 face. By Euler’s formula, we have 


2-2g=(k+6)-n+1 


so 
n-1=2g-—2 + (k +ô). 
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Therefore 
E k 1 
u(t) = 27 ae-2)+1k+5)- )) 2] -2r(os-2+ 5 (1- =| +o) 
ia © i=l ei 
as claimed. o 


37.8 The (6, 4, 2)-triangle group 


We pause to refresh the quaternionic thread and consider a particularly nice example, 
showing how quaternion algebras arise naturally in the context of Fuchsian groups. 

Consider a hyperbolic triangle T with angles 7/2, 2/6, 2/6 and vertices placed as 
in Figure 37.8.1. 


n a 


Figure 37.8.1: The hyperbolic triangle T with angles 7/2, 7/6, 1/6 


By symmetry, we may define the side-pairing P as shown in Figure 37.8.1. This polygon 
satisfies the cycle condition, so by the Poincaré polygon theorem (Theorem 37.6.4), 
there exists a Fuchsian group A C PSL2(R) generated by the two side pairing elements 
in P and with fundamental domain T. In this section, we construct this group explicitly 
and observe some interesting arithmetic consequences. 


37.8.2. We seek the position of the vertex z € H? corresponding to w € D?. Zooming 
in, we obtain the picture as in Figure 37.8.3. 


T/4 


-¢ m/12 0 
lew +d)? =1 


-2 =1>d=V1 +2 


Figure 37.8.3: Finding the position of z 
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The edge containing w and its complex conjugate is defined by an isometric circle 
|cw + d| = 1 with d? — ¢? = 1 and by symmetry c, d € Ryo. With the angles as labeled, 
we find that w = (—d + e7"/!”)/c, and since arg(w) = 37/4 we compute that 


V1 +c? —cos(/12) = sin(a/12) (37.8.4) 


so c? = 2cos(s/12) sin(z/12) = sin(7/6) = 1/2 thus c = 1/V2 and d = V372, so this 
isometric circle is defined by |w + vV3P = 2, and w = (-1+i)/(1+ V3). By coincidence, 
we find that z = z = (-1 + i)/(1 + V3) = w as well. The intersection of this circle 
with the imaginary axis is the point z3 = (V3 — 1)i/ V2. 

The unique element mapping the sides meeting at i is obtained by pulling back 
rotation by —7/4 in the unit disc model; it is given by the matrix 


Cee OV a. 1f 
a=; al 0 ons) ( j) ~ya\l 1 E82) 
with z-1 
64(z) = ——, for z € H’, 
z+1 


and ô$ = 1 € PSL,(R). From a similar computation, we find that the other side pairing 
element acting by hyperbolic rotation around z3 is 


1 0 -1 + V3 
62 = — 37.8.6 
EE (a -43 0 ) ee) 
and 6; = 1 € PSL2(R). We have (6264)(z4) = 62(z2) = z4, so the element 
-_1fl+v3 1-73 
= os 7.8.7 
96 = (6204) " = 5 (eee al eet) 


fixes the vertex z4, with 66(z,) = (1 + V—3)/2 and oe = | € PSL,(R). These actions 
are summarized in Figure 37.8.8. 


na 


Zy=1 


-1_1f1+v3 1-¥v3 
ôs = (6264) =y 1A) 


Figure 37.8.8: Side-pairing elements for the (6, 4, 2)-triangle group 
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So recording these cycle relations, 
A = (5p, 64 | 6; = 64 = (5264)° = 1) 
= (52, 64, 56 | 65 = 57 = ôf = 625456 = 1). 


37.8.9. From this setup, we can identify a quaternion algebra obtained from (appro- 
priately scaled) generators of A. We have the characteristic polynomials 


65 +1 = 62 — V264 +1 = 62 — V355 + 1 =0. 


To obtain rational traces, and to simplify the presentation a bit further we consider the 
Z-subalgebra B C M2(R) generated by 


i := V264, j := V366, k := —V66>; 


we have 2 ; ; = 
iv =2i-2 jk =-3i 
pf =3j-3 ki = -2j (37.8.10) 
k? =-6 ij = -k 


so as in (22.1.2) with (a, b,c, u,v, w) = (—3, —2, —1, 2, 3,0), we obtain an order O = 
Z + Zi + Zj + Zk of reduced discriminant —24 + 12 + 18 = 6 and with associated 
ternary quadratic form 


—3x? — 2y? — z? + 2yz + 3xz. 


37.8.11. We now try to simplify the presentation (37.8.10) as much as possible. This 
kind of activity is more aesthetics than mathematics, but for this example there is a 
preferred way of writing the algebra and order (regrettably, not in a good basis) as 
follows. We write i’ = i — 1 and j’ = i(j — 3/2i) = -3i + k + 3 so that now (i’)* = —1 
and (j’)* = 3 and j’i’ = —i’j’; the remaining basis element of the order in terms of 
these generators can be taken to be k’ = (1 +i’ + j’ +i'j’)/2 =3-i-jtk. 


37.8.12. Throwing away primes from the previous paragraph, we have the algebra 


B:= (55) of discriminant disc B = 6 and order 


Ltitj4i 
O:=ZeZieZjeZk, k= Ht 
with k? — k — 1 = 0. (By 23.1.1, since discrd(O) = disc B = 6, we conclude that O is 
a maximal order in B.) 

This algebra came with the embedding 


(37.8.13) 


loo: B = M(R) 
+. [0 =) fa 0 
LJ => 1 olo -v3 


t+xi+yj+zij > t+yvV3 —x + zv3) _ 
yJ J tea t- yv3 = &. 
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Following the transformations back, we recover the triangle group as a subgroup of 
B*/F* after rescaling to be 


ô := 3i +ij = —1 + 2i — j + 2k, 
ô, := 1 +i, (37.8.14) 
ôs :=3 +3i+j+ij=2+2i + 2k; 
and we confirm 
65=-6, ð =-4, 68 =-1728, 626456 = -12 


so the images of these elements in B*/F* generate the group A. 
In fact, the elements 69, 64, 66 E€ Ngx(O) normalize the order O: 


65/162 =-L+i-j+2k 65' jd. =-j ôz'kô&2=1-k 
6, 'iô4 =i ô; jô4 = -ij ôz kôs=1+i+j-k 
6g i66=—-lL+i-j+2k  &'jôs=2-i+2j-4k  óz'kős=1+j-k 


37.9 Unit group for discriminant 6 


In this section, we continue the example from the previous section, specifically the 
order O in 37.8.12. Let 


T := to(O!/{+1} < PSL2(R). 


By organized enumeration, we will see that I’ is a Fuchsian group and compute a 
fundamental domain for the action of I’. We return to this example in section 38.1 as 
a basic example of the general theory of arithmetic groups arising from quaternion 
algebras. 


37.9.1. Moving to the unit disc via the map z œ> @(z) = w = (z - i)/(z +i), 


we t—ix V3(y — iz) 
V3(y +iz) t+ix 

To avoid cumbersome notation, we identify g with g®. 
The isometric circle of such an element g is a 


1 —(t +ix 
circle of radius —— centered at ( ) 


BO + 22) V3(y + iz) 


when y? +z? #0; when y? +z? = 0, the center w = 0 € D? is stabilized, with stabilizer 


Stabr(0) = (S) where S$ = f 


0 °) acts by rotation by 2. We have 


(130) = () clext/(y). 


yel-(S) 
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To make a fundamental domain, we need to intersect 4(0) with a fundamental domain 
for (S), and we take the left half-plane. Then 


m= 0(1;0) N {w € D? : Rew < 0} 
is a fundamental domain for I. 


37.9.2. We list elements g = t + xi + (y + zi)j with detg = £ + x? -3(y? + 27) = 1 
and t, x, y,z € iZ whose doubles are all of the same parity. We enumerate them in 
increasing order of the (square) inverse radius y? + z? (ignoring the factor 3). The 
case y? +z? = 0 gives us the stabilizer. By parity, we cannot have y? + z? = 1/4. 
If y? +z? = 1/2 then we find y,z = +1/2 and £? + x? = 5/2. Sifting out all of the 
possibilities, we find 


All of these elements have the radius of /(g) equal to ¥2/3 = 0.82..., and the centers 
are +1.15 + 0.57V¥-1 and +0.57 + 1.15V—1. The corresponding external domain is 
given in Figure 37.9.3. 


Figure 37.9.3: A starting external domain 


37.9.4. If we continue in this way, listing elements according to decreasing radius, we 
find that all remaining elements have too small a radius to cut away anything extra from 
the external domain. The corresponding external domain looks like Figure 37.9.5. 
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LE y; 
XN 


Z, 
= 


eS. 
Ay 


y 


\ IN 


yes 
Co 


“2 AX 
NAROVA 
ANNA 


Figure 37.9.5: The external domain 


It follows that the X is cut out by the left half with side pairing as in Figure 37.9.6. 


fal 
> 


Figure 37.9.6: n c D? 
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Pulling back to the upper half-plane, we obtain the domain in Figure 37.9.7. 


n 


Vi = y—1 


=vV3+v-1I _ 
2 =n 


Figure 37.9.7: n c H? 


The corresponding tessellation of the upper half-plane looks like the one in Figure 
379.8. 


Figure 37.9.8: Tessellation of H? by T 


In addition to this side pairing, we check the cycle relations on the four orbits of 
vertices: the fixed points vj, v3, vs and the orbit v2, v4, ve. In conclusion, 


T= (yn 73 |Y] =V = 73 = rN)" = 1) (37.9.9) 
and letting y4 = y3y2y1 = —2i + j, we can rewrite this more symmetrically as 
L = (yr Y2 Y3 Y4 |V] = 79 = ¥3 = V4 = Y47321 = 1). 


The order of the stabilizers tell us the angles at each vertex, and so by the Gauss—Bonnet 
formula (Theorem 33.6.8) we compute that the area is 


MOA) = 3a — (2/3 + 2/3 + Ia = 27/3. 
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37.9.10. Finally, the quotient X(T) = T\H? has the structure of a good complex 1- 
orbifold (see 34.8.14), a Riemann surface but with finitely many orbifold points. Since 
the fundamental domain X is compact, via the continuous surjective projection map 
n — X(T) we see that X(T) is compact, and u(X(T)) = u(x) = 27/3. This orbifold 
‘folds’ up to a surface with topological genus 0, so the signature 34.8.13 of X(T) is 
(0; 2, 2, 3,3): see Figure 37.9.11. 


V2, V4, V6 


orbifold signature 
(0; 2,2,3,3) 


V3 V5 


v1 


ww 


Figure 37.9.11: X(T) as an orbifold and Riemann surface 


We have seen that the norm 1 group contains the (2, 4, 6)-triangle group A, so we 
have a map X(T) > X(A) = A\H?; by Gauss—Bonnet, we have 


UXA) = 2(4 — (1/2 — 1/4 — 1/6)x) = 7/6, 


we see 


[A: T] = wX(1))/u(X(A)) = (27/3)/(2/6) = 4, 


4 
as is visible from Figure 37.9.12 for I < A. 
Figure 37.9.12: Triangulation of x 


Remark 37.9.13. The discriminant 6 quaternion algebra has been a favorite to study, 
and the fundamental domain described above is also given by e.g. Alsina—Bayer 
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[AB2004, §5.5.2] and Kohel—Verrill [KV2003, §5.1]. We return to this example in 
section 43.2 in the context of abelian surfaces with quaternionic multiplication. 


Exercises 


1. 


2. 


> 7. 


Let IT = PSL»(Z). Describe the Dirichlet domain 4(z) for an arbitrary z € H? 
with Imz > 1. 


Let I = PSL2(Z[i]) (cf. section 36.6). Show that 
a; 2j)={z=x+yjEe H? : Re x|, Im x| < 1/2 and |iz|| > 1} 
and that 


i 0 


Stabr (2j) = (i )) = Z/2Z 


—i 


so X(T; 27) is a union of two copies of a fundamental set for T. 


. LetT be the cyclic Fuchsian group generated by the isometry z + 4z, represented 


by 0 1/2 € PSL2(R). Give an explicit description of the Dirichlet domain 


x(T;i) c H? and its image x(I;0) c D? with i > 0. 


. Let g = [ s) € PSL2(R) satisfy gi + i. Let H be the perpendicular bisector 


between i and gi. 


(a) Show that lIgll? > 2. 
(b) Under the map H? — D? taking i + 0, show that the perpendicular 
bisector between i and gi is the isometric circle I(g7!) inside D?. 
lgi? -2 
(c? +d? — 1)? 


€ R, where lell? =a’ +b? +c? +d. [Hint: as a check along 


(c) Show that H is the half-circle of square radius centered at 


ac + bd 


c? +d? -1 
the way, H is described by the equation \(d — ic)z + i(a + ib)| = |z + i|.] 


. Let 


fdc 2 
e=[? 7] €PSUCL D U D 


with c,d € C satisfying jdi? —|c|? = 1. Show directly that |gw| = |w| for w € D? 
if and only if 
lcw+d|=1 


by expanding out and simplifying. 


. Show that for all g € PSU(1, 1), we have g/(g) = Ie"), where /(g) is the 


isometric circle of g. (Equivalently, show that if g € PSL2(R) that gL(g; zo) = 
L(g7!; zo) for all zo € H?.) 

Let x be a Dirichlet domain for a Fuchsian group I’. Show that in every compact 
set, there are only finitely many sides and finitely many vertices of x. 
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8. 


10. 


11. 


Let I’ be a discrete group of isometries acting properly on a locally compact, 
complete metric space X, and let x9 € X. Recall the definition of H(y; xo) 
(37.4.6) for y € T and L(y; zo) = bd H(y; zo). Show that if K is a compact set, 
then K N L(y; xo) + 0 for only finitely many y ET. 


. Extend Theorem 37.4.18 as follows. Let (X, p) be a complete, locally compact 


geodesic space, let be a discrete group of isometries acting properly on X. Let 
xo € X, let To = Stabr(xo), and let uo € X be such that Staby, (uo) = {1}. Show 
that Ip is a discrete group of isometries acting properly on X, and 


(I; zo) A (19; uo) 
is a locally finite fundamental set for I that is star-shaped with respect to xo and 


whose boundary consists of hyperplane bisectors. 


Consider the egg of revolution, a surface of revolution obtained from convex 
curves with positive curvature as in Figure 37.9.14. 


top 
geodesic 
bisector 
bottom 


Figure 37.9.14: Egg of revolution 


An egg of revolution has the structure of a geodesic space with the induced 
metric from R*. Show that the separator between the top and bottom, a circle of 
revolution, is not geodesic. [In fact, Clairaut’s relation shows that the geodesic 
joining two points in the same circle of revolution above crest in the x-axis never 
lies in this circle of revolution. ] 


In this exercise, we consider a Fuchsian group constructed from a regular quadri- 
lateral. 
(a) Show that for every e > 2, there exists a regular (equilateral and equian- 


gular) quadrilateral m c D?, unique up to isometry, with interior angle 
m/(2e). 


Conclude from Poincaré’s theorem that there is a Fuchsian group, unique 
up to conjugation in PSL2(R), with fundamental domain x and side pairing 
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= 


Figure 37.9.15: A special quadrilateral Fuchsian group 


as in Figure 37.9.15. 


(b) Give a presentation for this group for all e > 2, and find explicit matrix 
generators for the special case when e = 2. 
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nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 
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Chapter 38 


Quaternionic arithmetic groups 


In this chapter, we now apply our topological and geometric interpretation of discrete 
groups in our case of interest: quaternion unit groups. 


38.1 > Rational quaternion groups 
The classical modular group PSL2(Z) (chapter 35) was obtained as follows: we 


e started with the matrix algebra M2(Q), 

e looked inside for integral elements to find the order M>(Z), 

e took its unit group GL2(Z), and finally 

e restricted attention to the Fuchsian group PSL2(Z) to get a faithful action on 
the upper half-plane by orientation-preserving isometries. 


The wonderful thing that gives life to this part of our monograph is this: the same 
thing works if we replace M2(Q) with a quaternion algebra B over a global field! In 
this section, we derive key aspects of this program for quaternion algebras over Q ina 
self-contained way, before embarking on a more general study in the remainder of the 
chapter. 

In section 32.1, we already dealt with the case when the quaternion algebra was 


a, 


Q 


loss of generality (for convenience of presentation), we may further suppose a, b > 0 


„b 
and both a, b € Z. To be indefinite means that B @g R = — = M>(R); we obtain 


definite, finding a finite unit group; so here we take B = indefinite. Without 


such an embedding via a conjugate of the left-regular representation (2.3.2) 


loo: B => Md(R) 
: : t + xya (y + org (38.1.1) 
t+xityjtzkph jean i. 


This embedding is not unique, but another embedding would correspond to post- 
composition by an R-algebra automorphism of M2 (R), which by the Skolem—Noether 
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theorem (Corollary 7.1.4) is given by conjugation by an element of GL2(R), and so we 
can live with a choice and this ambiguity. 
Let 
O := Zli, j) = Z + Zi + Zj + Zk 


then O C B is an order, and t..(O) © M2(Z) consists of the subset of matrices in 
(38.1.1) with ¢, x, y, z € Z. Following the script above, we define 


O! := {y € O* : nrd(y) = 1} 


and let 
T! (O) := (O')/{+1} € PSL2(R). 


Lemma 38.1.2. The group T! (O) < PSL(R) is a Fuchsian group. 


Recall that a Fuchsian group (Definition 34.7.3) is a discrete subgroup of PSL2(R); 
by Theorem 34.2.1, a Fuchsian group acts properly on the upper half-plane H? by 
orientation-preserving isometries. 


Proof. Because T!(O) is a group, it suffices to find an open neighborhood of 1 con- 
taining no other element of T '(O). We take 


U= E a € PSLa(R) : Ig11 — 1l, lgi2, 1821|, 1g22 — 1| < i}. 
821 822 
If y = (gi); € UOT '(O), then 
[2(t — 1)| = lgu +82 -2| < lgu — 1 4+ 1e22 - 1) < 1 
[2y Vol = |gi2 + gail < Igi2l + Igail < 1 


and since t, y € Z we have t = 1 and y = 0. Then 


IxVal = lgu] < 1/2 
IzVab| = |g12| < 1/2 


and since a, b, x, z € Z with a, b + 0, we conclude x = z = 0, andy = +1. o 


If B ~ M(Q), then T!(O) = PSL(Z) and we investigated this case already in 
detail. So suppose from now on that B # M2(Q), or equivalently that B is a division 
algebra over Q. 


Proposition 38.1.3. The quotient 1'(O)\H? is compact. 


This proposition is analogous to the finiteness of the class set (as in section 17.5), 
and the proof is again inspired by the geometry of numbers. We give a proof in Main 
Theorem 38.4.3. 

The compactness result above implies nice properties for [!(O), read off from a 
fundamental domain (as presented in section 37.1). Let zo € H? have trivial stabilizer 
Stabrı (O) zo = {1}. Then the Dirichlet domain 


mt = u(r! (O); zo) = {z € H? : p(z, zo) < plyz, zo) for all y € '(O)} 
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is a closed, locally finite fundamental domain for T! (O) with geodesic sides by Theorem 
Sh LZ. 


Corollary 38.1.4. x is a compact, finite-sided hyperbolic polygon, and the group 
T1(O) is finitely presented. 


Proof. We write T = T! (O). Since T\ H? is compact by Proposition 38.1.3, the distance 
p(T zo, Iz) for Iz € X(T) is bounded. Thus by construction, the Dirichlet domain is 
contained in a bounded set and is therefore compact. Since X is locally finite, we 
conclude that yx N x + 0 for only finitely many y € T. But the set of such elements 
are the side pairing elements and they generate [ (Theorem 37.3.1), so X has finitely 
many sides and I is finitely generated. Thus x has finitely many vertices, the set of 
vertex cycle relations is finite, and these generate the relations (Proposition 37.3.14). 
Thus T is finitely presented. o 


Two subgroups H, H2 < G are commensurable if H; A H2 has finite index in both 
H, A. 


38.1.5. If T < PSL2(R) is commensurable with T!(O), we say that T is an arithmetic 
Fuchsian group with defining quaternion algebra B. This definition is independent 
of the choice of order O: another suborder or superorder has finite index, so the 
corresponding unit groups will also have finite index. 

For every Fuchsian group I < PSL2(R) commensurable with T '(O), the conclu- 
sions of Proposition 38.1.3 and Corollary 38.1.4 remain true: for a containment I” < T 
of finite index, the corresponding map I’’\H? — T\H? is finite-to-one, and the desired 
properties pass from one quotient to the other. 


In this way, we have completed our task: starting with an indefinite quaternion 
algebra B over Q, we constructed a Fuchsian group T! (O) generalizing PSL2(Z) acting 
on the upper half-plane. We pursue a more general construction in this chapter (con- 
sidering an indefinite quaternion over a number field) and its geometry and arithmetic 
properties in the remainder of this book. 


38.2 Isometries from quaternionic groups 


In the remainder of this chapter, we investigate discrete groups obtained from unit 
groups of quaternion algebra over number fields. Throughout, let F be a number field 
with r real places and c complex places, so that [F : Q] =r+2c =n. Let B be a 
quaternion algebra over F. 


38.2.1. Suppose that B is split at t real places and ramified at the remaining r — t real 
places, so that 


B > Bp := B 89 R © MRY x H™ x Ma(C)°. (38.2.2) 


Let 
t= (t,.--5l¢4¢)1 B > Ma(RÝ x Mo(C)* (38.2.3) 
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denote the map (38.2.2) composed with the projection onto the matrix ring factors. 
Then as long as t + c > 0, we have 


u(B*) c GL2(R)' x GL2(C)°. (38.2.4) 


We have t = c = 0 if and only if F is totally real and B is totally definite; in this 
case, the geometry disappears, and we are reduced to considering finite unit groups 
(see section 32.3). There is still more to say for this case, and we will return to it in 
chapter 41 in the context of modular forms. But in this part of the text we are following 
geometric threads, so 


suppose from now on that t + c > 0; (38.2.5) 


this is another way of saying that B is indefinite or equivalently satisfies the Eichler 
condition (Definition 28.5.1). 


38.2.6. We now restrict to a subgroup acting faithfully via orientation-preserving 
isometries. Recall that 


FX, = {x € F* : v(x) > 0 for all real places v} 
is the group of totally positive elements of F. Let 
Be, := {a € B* : nrd(a) € Fy} (38.2.7) 


be the group of units of B of totally positive reduced norm. Then F* C B*) because 
nrd(a) = &@ € F% for all a € F*. Let 


PRS, := B%9/Z(B%o) = Boy (38.2.8) 
be the quotient by the center; then z induces an inclusion 
Pi(Bx,) C PSL2(R} x PSL2(C)° 


(we have PSL2(C) = PGL2(C) and PGL; (R) ~ PSL2(R), rescaling by the determi- 
nant). Therefore, the group Pu(BE,) acts on 


H := (H’)! x (H°)° (38.2.9) 


on the left faithfully by linear fractional transformations as orientation-preserving 
isometries (Theorems 33.3.14 and 36.2.14), that is to say 


Pu(BX,) C Isom* (H). (38.2.10) 
Remark 38.2.11. One can equally well consider Pu(B*) C Isom(H). Indeed, 
PSL2(R)! x PSL2(C)*° = Isom (H? x Isom*(H#)° < Isom*(H) 


is the subgroup of isometries that preserve each factor and preserving orientation. The 
full group Isom(#1) includes more: permuting factors of the same kind is an isometry, 
and with these we have an isomorphism of groups 


Isom(H) = (Isom(H?)! x Isom(H?)°) x (S, X Se) (38.2.12) 
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where Sm denotes the symmetric group. Since PSL2(R) = Isom*(H7) < Isom(H7) has 
index 2 and the same for HŽ, we conclude that 


[Isom(H) : PSL2(R)' x PSL2(C)°] = 2°*¢t!e!. 


The orientation-preserving subgroup Isomt(H) < Isom(H) has index 2, contain- 
ing for example elements which reverse orientation in two components and preserve 
orientation in the others; if the orientation is reversed in two components H?, then the 
resulting isometry is orientation-preserving but not holomorphic in these components. 


38.3 Discreteness 


We now seek discrete subgroups, as follows. 
38.3.1. Let R = Zr be the ring of integers of F and let O C B be an R-order. Let 
O! := {y € O* : nrd(y) = 1} < O% (38.3.2) 
be the subgroup of units of reduced norm 1, and let 
PO! := O'/Z(O') = O! {+1} 


38.3.3 
T!(O) := Po!) ) 


Then by (38.2.10), we have T!(O) < Isom*(H). 


Definition 38.3.4. A subgroup [ < Isom*(H) is arithmetic if I is commensurable 
with T!(O) for a quaternion algebra B and an order O C B (with respect to some 
embedding 1). 


If T is commensurable with T! (O) for an order O then it is commensurable with 
T1(0’) for every other order O’, since every two orders have finite R-index, thus finite 
index—so we could equally well compare to one fixed (e.g. maximal) order. The class 
of arithmetic groups contains in particular the quaternionic unit groups with which 
we started, but contains other groups of interest (including subgroups and discrete 
supergroups with finite index). 

Remark 38.3.5. There is a more general definition of arithmetic group which reduces 
to this one; see section 38.5. 


Right away, we show that arithmetic groups are discrete. To do so, we will need 
two short lemmas. 


Lemma 38.3.6. Let K, X be Hausdorff topological spaces, and suppose K is compact. 
Let n: K x X — X be the projection, and let Y C K x X be discrete and closed. Then 
m(Y) € X is discrete. 


Proof. First, Y has no limit points: a limit point of Y would belong to Y, but then Y is 
discrete so every point is isolated point. For the same reason, every subset of Y is also 
(discrete and) closed: a limit point of the subset would be a limit point of Y. 

Now let x € 2(Y) and let Y, = Y\a7!(x). Then Y, C K x X is closed. The set 
(K x X)\¥, is open and contains K x {x}, so by the tube lemma, it contains an open 
set K x U. Then U 5 x is the desired neighborhood. oO 
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Lemma 38.3.7. Let G be a Hausdorff topological group and let H < G be a discrete 
subgroup. Then H is closed. 


Proof. Since H is discrete, there is a neighborhood U > 1 such that UN H = {1}. 
Further, there exists a neighborhood V C U such that V-'V c U (multiplication and 
inversion are continuous, see Exercise 12.4). 

We show GNH is open. For x € G, we have xV > x an open neighborhood, and if 
hh’ € xV A H then x-'h,x7'h’ € V and so (x7! hy! th’) = Ah’ € VIV CU. 
Therefore h~'h’ = 1 by the hypothesis on U, so h = h’. Thus xV contains at most one 
element of H. Since G is Hausdorff, when x ¢ H we can shrink V if necessary to get 
xV N H = 0, as desired. oO 


Proposition 38.3.8. Let T < Isom*(H) be an arithmetic subgroup. Then T is discrete. 


Proof. It is enough to prove the proposition for I = T'!(O), as discreteness is preserved 
between commensurable groups (having finite index in their intersection). 

The image O —> Bp as in (38.2.2) is discrete by 17.7.6: we argued using coordinates 
and noted that R = Zp — Fr is discrete. Therefore the image 


O! => BL = (H~ x SL2(R)! x SL2(C)* (38.3.9) 


is discrete (by restriction). Further, since H! is compact, by Lemmas 38.3.6 and 38.3.7, 
the image of O!  SL,(R) x SL2(C)* under the projection is discrete. (Any further 
projection turns out not to be discrete; see Exercise 38.2.) o 


38.3.10. The group T! (O) is a Fuchsian group if and only if t = 1 and c = 0, i.e. F is 
totally real and B is ramified at all but one real place; T!(O) is a Kleinian group if and 
only if t = 0 and c = 1, i.e. F has exactly one complex place and B is ramified at all 
real places. 


Just as for Fuchsian and Kleinian groups, discrete groups admit several equivalent 
characterizations as follows. (For the notion of a good orbifold, see Definition 34.8.10.) 


Proposition 38.3.11. Let T < Isom*(H) be a subgroup. Then the following are equiv- 
alent: 


(i) T is discrete (with the subspace topology); 
(ii) For all z € H, we have #Stabr(z) < œ and there exists an open neigborhood 
U > z such that yU N U + O implies y € Stabr(z); 
Gii) For all compact subsets K C H, we have K N yK + 0 for only finitely many 
y €T; and 
(iv) For all z € H, the orbit Vz C H is discrete and # Stabr(z) < ©. 


Moreover, if these equivalent conditions hold, then the quotient T\H has the structure 
of a good Riemann orbifold of dimension m = 2t + 3c, and the quotient map 


nm: H —>T\H 
is a local isometry at all points z € H with Stabr(z) = {1}. 


Proof. We proved this statement in Propositions 34.7.2 and 36.4.1 when t = 0 or 
c = 0; the general case follows similarly. o 
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38.4 Compactness and finite generation 


We now consider further properties of arithmetic groups. Refreshing our notation, let 
T < Isom*(H) be an arithmetic group arising from a quaternion algebra B. 


38.4.1. The arithmetic discrete groups arising from the case B = M>(F) of the matrix 
ring are of particular interest: they include the case F = Q giving rise to the classical 
modular group I’ = PSL2(Z) studied in chapter 35 as well as the case F = Q(i) 
giving rise to the Picard group T = PSL2(Z[i]) examined in section 36.6 and more 
generally the Bianchi groups. Let B = M2(F). Then necessarily r = t (the matrix ring 
is already split!), soH = (H?) x(H?)°, and the embedding ¢ in (38.2.3) has the simple 
description 
t: Mo(F) > Mo(F yp = MaR)" x Ma(C)° 


_fa b ay b, (38.4.2) 
ole al” cy dy}), 


where we embed matrices componentwise, indexed by the archimedean places of F. 

Much is written about the compactification of Y(I), and unfortunately it would 
take us too far afield to fully treat this important topic: the groups so obtained behave 
differently in several respects than the case where B is a division algebra. 


We suppose throughout this section that B is a division algebra. 
Let X(T) := F\H be the quotient, a good Riemann orbifold. 


Main Theorem 38.4.3 (Hey). The orbifold X(T) is compact. 


Proof. We follow Zassenhaus [Zas72, §1]; see also Kleinert [K1t2000, Theorem 1.1]. 
First, the group I is commensurable with (O!) for an R-order O; by comparison 
under maps of finite index, we may suppose = T! (O). 
Second, we claim that it suffices to show that O'\BL is compact. Indeed, recall 
(38.3.9) that 
O! < B} = (Hl!) x SLR} x SLC). 


From the symmetric space models (34.6.3) and (36.3.14), we have homeomorphisms 
SL2(R)/ SO(2) > H? 
SL2(C)/ SU(2) > H; 

so the fibers of the continuous projection map 
O'’\Be > T\H =Y) 


are given by 
(H')’* x SOR) x SU(2)° 


and therefore compact, and the claim follows. 
Now from (38.2.2), we have 


O > Bp = M)(R)’ x H™™ x M,(C)°; 
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choosing an R-basis for Bg, we have Br = R^” and under the standard metric, this gives 
O = Z“ the (non-canonical) structure of a Euclidean lattice. Let X C Bp bea compact, 
convex, symmetric subset with volume vol(X) > 24" covol(O). (The precise value of 
this covolume will not figure in the argument, and anyway would depend on the choice 
of Euclidean structure; all such structures induce the same topology.) Therefore, by 
Minkowski’s convex body theorem (Theorem 17.5.5), there exists nonzero œ € ON X. 
Moreover, for all g = (gy) € Br we have 


vol(gX) = vol(X) 


since [],, det(g,) = 1, and gX is again compact, convex, and symmetric, so similarly 
there exists nonzero ag € ON gX. 

We will show that the quotient space o! \Bp is sequentially compact. To this end, 
let g„ be a sequence from B}. By the previous paragraph, there exist œ„ € O such that 
Qn = 8nXn With x, € X nonzero. Since X is compact, we can restrict to a subsequence 
such that x, — x € X converges. 

The reduced norm nrd: B — F extends by scaling to a continuous function 
nrd: Br — Fp. Since X C Bp is bounded so too is nrd(X) C Fr bounded. But 


nrd(q@,,) = nrd(g,,) nrd(x,,) = nrd(x,,) € nrd(X), 


and the values nrd(a,,) € nrd(O) € Zr lie in a discrete subset, so there are only finitely 
many possibilities for nrd(a,,). Moreover, the left ideals [,, = Oa, have 


NU) = Nmpjo(nrd(I,))” = Nmpjg(nrd(an))” (38.4.4) 


bounded, so there are only finitely many possibilities for 7, by Lemma 17.7.26. Thus, 
by the pigeonhole principle and restricting to a subsequence, we may suppose J, = 
Oa, = Oa, and nrd(a@,) = nrd(q@}) for all n. Therefore a, = y,a; with yn € O! for 
all n. 

To conclude, we note that since B is a division algebra, 


nrd(x,) = nrd(a,) = nrd(a,) + 0 


so 
x,! = x, nrd(x,) | = X;nrd(q,)! > xnrd(a,) | = x! 
converges. Therefore 


-1 -1 -1 -1 -1 
Yn &n = Yn QnXn D aX, > wx 


converges as well. Therefore the quotient o! Wr is sequentially compact, and therefore 
compact. o 


Remark 38.4.5. Main Theorem 38.4.3 was proven by Hey [Hey29, Hilfssatz 4] in her 
1929 Ph.D. thesis (see also Remark 29.10.24) in the case where B is a division algebra 
over Q. In particular, there is no need to suppose that B is central, so it contains the 
Dirichlet unit theorem as a consequence: see Exercise 38.6. 

After treating the decomposition of the adelic coset space, we will also see Hey’s 
theorem as an essentially direct consequence of Fujisaki’s lemma (Main Theorem 
27.6.14): see Theorem 38.7.21. 
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Theorem 38.4.6. The group T is finitely generated. 


Proof. Let xo € X(T) satisfy Stabr (xo) = {1}. Then by Theorem 37.4.18, the Dirichlet 
domain X = X(T; xo) is a locally finite fundamental set for I. By Main Theorem 
38.4.3, X(T) is compact, so too is x. Then since X is locally finite, we conclude that 
yH N x + 0 for only finitely many y € T. But then by Theorem 37.4.2, the group T is 
generated by such elements and T is finitely generated. m 


Remark 38.4.7. In fact, IT is finitely presented, and an argument similar to Proposition 
37.3.14 shows that orbits of nonempty subsets 


qoaNyany’ a 


with y # y’ provide a finite generating set of relations (generalizing the vertex cycle 
relations): see e.g. Raghunathan [Rag72, Theorem 6.15]. 


38.5 x Arithmetic groups, more generally 


In this section, we briefly discuss more general definitions of arithmetic group and 
show that they reduce to the “working” one given above (Definition 38.3.4). As above, 
let F be a number field. 


38.5.1. Let G < GL,,r be a linear algebraic group, a subgroup variety of GL), 

defined by polynomial equations in the entries and the inverse of the determinant, with 

coefficients in F. Equivalently, G is an affine variety over F equipped with identity, 

multiplication, and inversion morphisms giving it the structure of a group variety.) 
We say a subgroup 


I < GF») = | [a < | [ GLA) 


vloo yloo 


is arithmetic (as a subgroup of G(F,.)) if itis commensurable with G(Z p). This notion 
of arithmetic group was developed significantly by Borel [Bor62, Bor69]. 


38.5.2. Let B be a quaternion algebra over F. Then there is an embedding p: B —> 
M4(F) as in Exercise 2.11 by the regular representation. Thus B* < GL4(F) and the 
image is described by explicit polynomial equations (Exercise 38.5). Therefore there 
exists a linear algebraic group G < GL4 p such that G(F) = B*. 

Similar statements hold for B! < SL4(F). 


Lemma 38.5.3. A group T commensurable with T! (O) for a quaternion algebra B and 
order O C B is arithmetic in the sense of 38.5.1. 


Proof. Applying 38.5.2, we have p: B <> M4(F) realizing B! ~ G(F) < GL4(F) as 
a linear algebraic group by appropriate polynomial equations. Under this embedding, 
O := p(B) A M4(ZrF) is a Zr -order, and thus O! = G(ZF) and II, GA) = TI, B}, as 
required in the definition. o 
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In view of Definition 38.3.4, we consider now the converse: when does the more 
general definition give rise to discrete subgroups of two-by-two matrices? 


38.5.4. Let G be a linear algebraic group over F, and suppose that GF ~ GL, F, so 
there is a chance to obtain discrete groups of symmetries like the ones considered 
above. (For a complete treatment, we should consider SL» as well as the group PGLo, 
but the arguments are similar.) We say then that G is an F'-form of the algebraic group 
GL». 


Lemma 38.5.5. Let G be an F-form of GL. Then there exists a quaternion algebra B 
over F, unique up to F-algebra isomorphism, such that G(F) = B™. 


Proof. This is a basic result in non-abelian Galois cohomology, and it would take us 
too far afield to prove it here: see e.g. Milne [Milne2017, Theorem 20.3.5] and more 
generally Serre [Ser79, Chapter X]. o 


Lemma 38.5.5 explains that more general notions of arithmetic groups do not create 
anything new beyond our quaternionic definition. 


Remark 38.5.6. In this context, there is a criterion for compactness, generalizing Main 
Theorem 38.4.3 (conjectured by Godement): A discrete subgroup of G(R) is cocompact 
if and only if the reductive part of the connected component of G is anisotropic over F. 
If G is semisimple, then cocompactness is equivalent to asking that every element of 
G(F) is semisimple. This criterion was proven by Borel-Harish-Chandra [BHC62] and 
Mostow—Tamagawa [MT62]; Godement [God62] (with Weil) extended the method of 
Mostow—Tamagawa and simplified the proof by working directly on adele groups. See 
also Platonov—Rapinchuk [PR94, $4.5]. 


38.6 =» Modular curves, seen idelically 


We have already seen how idelic methods can be both a conceptual and a computa- 
tional simplification. The quaternion groups defined above naturally also fit into this 
perspective, and we describe this in the final two sections. As motivation, we begin in 
this section by reconsidering the classical modular curves from an idelic point of view. 


38.6.1. Recall that the adeles of Q decompose as 
Q=QxR 
into finite and infinite parts. Let B = M2(Q). Then 
B = M2(Q) = M2(Q) x M2(R) = B x Bo. 


The order O = M2(Z) is maximal in B, and we have the adelic order O= M>(Z) Cc 

M2(Q). (We have seen that B = M2(Q) < B = M2(Q) sits discretely and the quotient 

B\B is compact; like the adelic quotient F\F itself, this is not very interesting.) 
Similarly, we have 


BX = GLQ) = B x BX = GL2(Q) x GL,(R). (38.6.2) 
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It was a key consequence of strong approximation—but easy to establish in this case 
(Lemma 28.2.4)—that 


~, a ~ a 

B*\B*/O = GLo(Q)\GL2(Q)/ GL2(Z) = {1} (38.6.3) 

is trivial: every nonzero right (invertible) fractional M>(Z)-ideal is principal, or equiv- 
alently, every Z-lattice in Q? has a basis in Q?. 


As lovely as this is, this description leaves out the real place, and by putting it back 
we restore archimedean structure. 


38.6.4. The projection map 
GL2(Q) = GLa(Q) x GL2(R) > GL2(Q) 
yields a continuous projection 


GL2(Q)\GL2(Q)/ GL2(Z) > GL2(Q)\GL2(Q)/ GL2(Z) = {1} 


by (38.6.3). Therefore, every element of GL2(Q) is represented in the double coset by 
an element of the form (1, @.) with @» € GL2(R); and the element o is well-defined 
up the action of the group of pairs (y, 4) € GL2(Q) x GL2(Z) satisfying 


soy =p! € GL(Q) N GL,(Z) = GL2(Z), acting on the left. In other words, we have 


a bijection J 
GL2(Q)\GL2(Q)/ GL2(Z) e GL2(Z)\\GL2(R). (38.6.5) 


38.6.6. At this point, we are no stranger to the quotient GL2(Z)\ GL2(R)! We have stud- 
ied in detail the related quotient SL2(Z)\ SL2(R): by the symmetric space description, 
we have an isometry 

SL2(R)/ SO(2) > H? 


nies (38.6.7) 


We similarly obtain a bijection 
GL2(R)/R* SO(2) > H** = C\R (38.6.8) 


(any matrix of negative determinant interchanges the upper and lower half-planes, so 
we maintain a bijection). Then we can take the quotient on the left by GL2(Z) to get 
an identification 


Y(1) = PSL;(Z)\H? = GL2(Z)\GL2(R)/R* SO(2) (38.6.9) 


from (38.6.7), and we find again the classical modular curve we considered in section 
35.1. (In section 40.1, we will see another version of this in that the space Y(1) 
parametrizes complex lattices A C C up to scaling by C*.) 
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Putting (38.6.9) and (38.6.5) together, we have 
Y(1) > GL2(Z)\H*™* 
© GL»(Z)\GL2(R)/R* SO(2) 
© (GL2(Q)\GL2(Q)/ GL2(Z))/(R* SO(2)) (38.6.10) 
= GL2(Q)\(GL2(Q)/ GL2(Z) x GL2(R)/R* SO(2)) 
= aX 
= B*\(B*/O Xx BY /Ko) = B*\B™/K; 
in the last line, we introduce the notation Kœ := R* SO(2) and K := on x Kə to help 
in grasping this double coset. Chasing down all of the maps, the bijection is obtained 


by sending an element z € H* to the class of (1, a) € GL2(Q) x GL»(R) where z = ai. 
Note there is a nice symmetry in the expression on the right-hand side of (38.6.10). 


Remark 38.6.11. In this way, we just wrapped the classical quotient in ideles. This 
double cosetification (a beautiful monster of a word) provides a uniform way to describe 
the orbifold quotients obtained from quaternionic arithmetic groups more generally: in 
particular, class number issues are made more transparent in the language of double 
cosets. On the other hand, geometric structures are not always visible in this language, 
which is why we have treated both approaches in this text. 


38.7 » Double cosets 


In this section, we give a description of quaternionic orbifolds in terms of idelic double 
cosets. We retain the notation from 38.2.1, 38.2.6, and 38.3.1. In particular, F is a 
number field with r real places and c complex places, B is a quaternion algebra over F 
that is split at t real places with t + c > 0, we have an embedding 


i: B > Mp(R)' x Mo(C)°. 


We have defined 
H= (HZ x (H>)° 


where we may also write 
H? © SL2(R)/SO(2) and H? e SL2(C)/SU(2) 
by the symmetric space decomposition. We similarly define 
H* = (H?=FF xy 
where similarly 
H?* e GL,(R)/R* SO(2) and H? e GL,(C)/C* SU(2). (38.7.1) 


In this chapter we have been working with groups I < Isom*(H) commensurable 
with r!(0O) = Pu(O!). To work idelically, we restrict our attention to such groups that 
can be defined idelically: these are the congruence subgroups. 
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Definition 38.7.2. Let Wt be a nonzero ideal. Let 
OO := {a € O : æa € R + MO}. 
The principal congruence subgroup of level Mt C R is the group 
Toot) = THO) := P(O(M)!) < Isom*(H) 


obtained from the units of norm 1 in O(N). 
A group I commensurable with IT '(O) is congruence if it contains T(N) for 
some M. 


A congruence subgroup is defined by (finitely many) congruence conditions. Let T 
be the closure of T with respect to the topology on B* , where observe that a fundamental 
system of open neighborhoods of 1 is given by the images of the principal congruence 
subgroups. 


Lemma 38.7.3. Suppose that T is commensurable with T1 (O). Then Fn B* =T if and 
only if T is congruence. 


Proof. The group Tis a closed subgroup of B* commensurable with the compact open 
subgroup Pa) and so Tis also compact open. By definition of the topology on B*, the 
closure of I := I N B* is the smallest congruence group containing I’, and T contains 
T with finite index. So T is congruence if and only if T =T. oO 


From now on, suppose that I is a congruence subgroup. 


Remark 38.7.4. In general, we may work with the congruence closure FN BX >T of 
an arithmetic group T. 


38.7.5. In view of (38.6.10) and (38.7.1), we let Kæ := [],).. Ky where 

R*H! = H*, ifv is real and ramified in B; 

K, := 4 R* SO(2), if v is real and split in B; (38.7.6) 
C* SU(2), if v is complex. 


The groups K, are the extension of a maximal compact subgroup by the center. We 
then let K = T x Kœ and consider the double coset space 


Ysn(T) = B*\B*/K (38.7.7) 


where B* acts on B* by left multiplication (under the diagonal embedding) and K acts 
on BX = BX x BX by right multiplication. 

The expression (38.7.7) is tidy, and generalizes well, but we also want to know 
what it looks like. Plugging back in (38.7.1), we have BX /Kœ = H*, so 


Ysn(V) = BX \((BX/T) x H*) (38.7.8) 


where B* acts on H* via ı and on BX/T by left multiplication. Since there can hopefully 
no confusion about this action, removing parentheses we will write 


Ysp(V) = BX\(BX x H*)/F. 
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Definition 38.7.9. We call Ys,(I) the quaternionic Shimura orbifold of level T. 


Definition 38.7.9 explains the subscript ,,. We immediately proceed to justify the 
name orbifold; moreover, we will see that the space is possibly disconnected, and we 
write it as a union of connected components. 


38.7.10. By weak approximation, there exist elements œ € B* with nrd(q) having all 
possible real signs at the split real places of B. Therefore 
Yau) = BX \B* x H/T. (38.7.11) 
38.7.12. There is a natural (continuous) projection map 
Ysa) > BŽ \B* /T. (38.7.13) 
Recall that ż + c > 0, and B is indefinite. Therefore, strong approximation (as in 
Corollary 28.6.8) implies that there is a bijection 
nrd: BX,\BX/T > FX)\F*/ard(P) =: Clq, R. (38.7.14) 


Therefore Clon R is a (narrow) class group of F associated to the group I; as such, 
it is a finite abelian group that surjects onto the strict class group Cl* R. 
38.7.15. By 38.7.12, the space Ysp(T) is the disjoint union of finitely many connected 
components indexed by the group Clon) R. We identify these connected components 
explicitly as follows. 

Let the ideals b C R form a set of representatives for Clon) R, and let 6 = b 8z Z 
be their adelification; then for each b, there exists bER generating b, so 

DROR=b6. 

For simplicity, choose b = R and b = 1 for the representatives of the trivial class. 

By surjectivity of the map (38.7.14), for each b there exists B € B* such that 


nrd(ß) = = b. Therefore EON 
Ysn(L) = |_| Be (BT x H). (38.7.16) 
b 


For each 5, let oer 
Ty := BIB A BS, (38.7.17) 

Then we have a natural bijection 
BX, (Bl x H) © Ty\H 


GE ee (38.7.18) 


Letting 
Y (To) := To\H 


we see that each Y (Tẹ) is a connected orbifold of dimension 2t + 3c. We abbreviate 
Y(T) = Y((q)) for the trivial class. Putting these together, we have 


Ysa) = | |v) (38.7.19) 
b 


as a disjoint union of connected orbifolds. 
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Remark 38.7.20. When c = 0, i.e., F is totally real, then Ysy(I) can be canonically 
given the structure of an algebraic variety defined over a number field, by work of 
Shimura [Shi67] and Deligne [Del71]; in this case we upgrade Y sn(T`) to a quaternionic 
Shimura variety. The theory of Shimura varieties is both broad and deep—see Milne 
[Milne-SV] and the references therein. We will begin the study quaternionic Shimura 
varieties of dimension 1 in Chapter 43, touching upon the theory of canonical models 
in section 43.8. 


The above description has hopefully provided a more transparent way to understand 
arithmetic orbifolds. For example, we can prove an idelic version of Hey’s theorem 
(Main Theorem 38.4.3) as follows. 


Theorem 38.7.21. Suppose that B is a division algebra. Then Ys»(1) is compact. 


Proof. We appeal to Fujisaki’s lemma (Main Theorem 27.6.14): the quotient B*\B® 
is cocompact, under the important hypothesis that B is a division algebra. Let Kœ be 
as in 38.7.6, and consider the inclusion followed by the projection 


B*\B® — B*\B* > B*\B™/K = Ys). (38.7.22) 


These maps are continuous, and the composition is surjective as R* < K» (embedded 
diagonally) and R*B“) = B*. So the target is compact, proving the statement. oO 


Exercises 


In these exercises, we maintain the notation in this chapter: let F be a number field 
with r real places and c complex places, degree n = [F : Q], and ring of integers R, 
and let O be an R-order in a quaternion algebra B over F. 


1. Let a € R\Q. Show that Z[qa] is not discrete in R. (Taking e.g. a = Vd, this 
gives a reason to worry about discreteness of number fields when we project.) 


2. Embed O! diagonally in SL2(IR)’~? x SL2(C)°. Show that a (further) projection 
to a proper factor is not discrete. 
a,b S 1 1 
3. Let B = FP and let v bea split real place of B. Show thatO° — B, ~ SL2(R) 


if and only if F is totally real and for all nonidentity real places v’, we have 
v’(a) < 0 and v(b) < 0. 


4. In this exercise, we give a direct argument for the discreteness of an arithmetic 
Fuchsian group. Suppose F is totally real, let v be a split place of B, consider 
F — v(F) C R as a subfield of R, and suppose that B is ramified at all other 
(nonidentity) real places. Prove that O! c SL,(R) is discrete. 


5. Consider the regular representation p: B — M4(F) (Exercise 2.11). Describe 
the image explicitly in terms of polynomial equations in matrix entries. Conclude 
that B* and B! are also described by polynomial equations. 
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(a) Suppose not: then there exists a sequence @n = tn + Xni + Ynj + Znij > 1 
with tn, Xn, Yn, Zn € F with bounded denominators. Multiplying through, 
suppose that all coordinates are integral. Show for n sufficiently large that 
all of the coordinates are integral and bounded. 

(b) Show that for all nonidentity v, the coordinates of v(a,,) are also bounded 
using compactness. 

(c) Finally, prove that there are only a finite number of elements in R that are 
bounded in each coordinate (all conjugates are bounded). [Hint: look at 
the coefficients of a minimal polynomial, and derive a contradiction. ] 


6. Let F be a number field and R = Zp its ring of integers. In this exercise, we 
give a proof of Dirichlet’s unit theorem using the same method as in the proof 
of Main Theorem 38.4.3. 


(a) Show that [R* : R!] < 2. 

(b) Show (following the proof of Main Theorem 38.4.3) that R! eae is compact. 

(c) Under the usual logarithmic embedding log : F} > (IL R)?, conclude 
that log R!\ log FR is compact, and therefore log R! has rank r +c — 1 as 
an abelian group (written additively). 

(d) Conclude that R* has rank r + c — 1 as an abelian group (written multi- 
plicatively). 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s 
Creative Commons license, unless indicated otherwise in a credit line to the material. If 
material is not included in the chapter’s Creative Commons license and your intended 
use is not permitted by statutory regulation or exceeds the permitted use, you will need 
to obtain permission directly from the copyright holder. 


Check for 
updates 


Chapter 39 


Volume formula 


In this chapter, we exhibit a formula for the covolume of a quaternionic group, a formula 
with many applications. 


39.1 > Statement 


We saw in (35.1.5) that the hyperbolic area of the quotient SL2(Z)\ H? can be computed 
directly from the fundamental domain 


n = {z € H? : |Rez| < 1/2 and |z| > 1} 


as 


area(SL2(Z)\HÔ = area(x) = f n = a (39.1.1) 
na y 3 
In fact, given a Fuchsian or Kleinian group I, the hyperbolic area or volume of the 
quotient [\H (where H = H?, H?, respectively) can be computed without recourse to 
a fundamental domain: it is given in terms of the arithmetic invariants of the order and 
quaternion algebra that give rise to I. 
To begin, we consider the already interesting case where the quaternion algebra is 
defined over Q. 


Theorem 39.1.2. Let B be a quaternion algebra over Q of discriminant D and let 
O C B be a maximal order. Let Tj(D) < PSL»(R) be the Fuchsian group associated to 
the group PO! = O! /{+1} of units of reduced norm 1. 


Then 7 
area(Ij(D)\H*) = zD) (39.1.3) 
where i 
y(D) := [o-v=e[](! = 2) (39.1.4) 
piD pip\ P 


Recall that the discriminant D € Z is a squarefree positive integer, so the function 
y is just the Euler totient function. Theorem 39.1.2 recovers (39.1.1) with D = 1. 
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Example 39.1.5. Recall the case x} (6) from section 38.1. We confirm that the hyper- 

bolic area computed from the fundamental domain agrees with the formula in Theorem 

391.2: 5 
area(X) (6, D) = EPOD = =. 

The expression (39.1.3) is quite similar to the Eichler mass formula (Theorem 
25.3.15). Indeed, the method of proof is the same, involving the zeta function of the 
order O. For the case of a definite quaternion algebra, when the unit group was finite, 
we used Theorem 26.2.12 to relate the mass to a residue of the zeta function (see in 
particular Proposition 26.5.10); for an indefinite quaternion algebra, to carry this out in 
general would involve a multivariable integral whose evaluation is similar to the proof 
of Proposition 26.2.18 (the commutative case)—not an appealing prospect. That being 
said, this type of direct argument with the zeta function was carried out by Shimizu 
[Shz65, Appendix] over a totally real field F. 

We prefer instead to use idelic methods; we already computed the normalized 
Tamagawa measure vB! \B') = 1 (see Theorem 29.11.3), and this number has 
everything we need! It is a much simpler computation to relate this to the volume of 
the hyperbolic quotient: we carry this out in section 39.3. 


The following notation will be used throughout. 


39.1.6. Let F be number field of degree n = [F : Q] with r real places and c complex 
places, so r + 2c = n. Let B be a quaternion algebra over F of discriminant D that is 
split at t real places. Suppose that B is indefinite (i.e., t + c > 0). 

Let R = Zr be the ring of integers of F. Let O C B be an R-order of reduced 
discriminant Yt that is locally norm-maximal. Let 


H -= (H? x (H?)° 
and let 
T!(O) < PSL,(R)' x PSL2(C)° DH 


be the discrete group associated to the group PO! = O! /{+1} of units of O of reduced 
norm 1. 


For a prime p | N with Nm(p) = q, let (2) € {-1,0, 1} be the Eichler symbol 
(Definition 24.3.2), and let 
= 1+1/q, if(O| p)=1; 
_ 2 
A(O, p) := A N =4;1-1/q4, if(O|p)=-1; (39.1.7) 
Ñs (2) Nm(p)! [1-1/q2, if(Olp)=0. 


Main Theorem 39.1.8. With notation as in 39.1.6, we have 


(47)! 
on De ir Dd NMO) I] AO, p). (39.1.9) 


1 oe a A ee 
vol(T (ONH) = (422) (82 pN 
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Main Theorem 39.1.8 is proven as Main Theorem 39.3.1. Since ¢F (2) ~ 1, for 
algebras B with fixed signature r, s,t, we can roughly estimate 


vol(I!(O)) x d}? Nm(discrdO). (39.1.10) 


39.1.11. The constant factor in (39.1.9) is written to help in remembering the formula; 
multiplying out, we have 
UAn 1 
(4m2 (872€ ~ 92r+3c-2t-1 g2r+2c-t ` 
Note that the right-hand side of (39.1.9) is independent of the choice of order O in 
the genus of O, as it depends only on O. 


Remark 39.1.12. If O is not locally norm-maximal, then one corrects the formula by 
inserting factors [Ry : nrd(0%)] as in (39.2.10); since R* c O% we have Ke C nrd(O5), 
so these factors are at most 2 for each p. 

An important special case of Main Theorem 39.1.8 is the case where O is an Eichler 
order, generalizing Theorem 39. 1.2. 


Theorem 39.1.13. Suppose that O = Oo(Q) is an Eichler order of level IN and 
disc B = D, so N = DM. Write row = T! (O) and T! (1) for the group associated to 
a maximal order O(1) > Oo(QN). Then 

(4r) 3/2 


-br (2)dy PD) (WN) 


1 = 
vol T OWH) = AnD 8n 


where 


yD) = H(Zp/D)* = Nm] | (! - mas} 


m 
pear p 


We write p° || Dt to mean p° exactly divides M, i.e., ordp (D) = e, and we take 
the product over all prime power divisors of Wt in the definition of y. 
In particular, when O is a maximal order, then Mt = R and y(W) = 1. 


Remark 39.1.15. Theorem 39.1.13 is often attributed to Borel [Bor81], who derived 
it under the (highly nontrivial!) assumption that 7!(B!\B!) = 1. 


Example 39.1.16. Let T! = PSL2(Z[i]). Then 
2 2 
vol(X!) = —47/2¢,-(2) = — fp(2) = 0.3053... 
8x2 m2 


This agrees with the computation we did with a fundamental domain (see 36.6.8), 
since for y the character 36.6.14 of conductor 4, 


2 2 1 
72 SF (2) = EOL, x) = zL x). 
T H 
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39.2 Volume setup 


In this section, we setup a few calculations in preparation for the volume formula. 

Let F be number field with discriminant dp, let B be a quaternion algebra over F 
of discriminant D = disc B, and let O C B be an order. The key input to the proof is 
the following ingredient: from Theorem 29.11.3, 


7'(B'\B') = 1. (39.2.1) 


We will convert the volume (39.2.1) into the desired form by separating out the 
contribution from the finite places and the infinite (real) ramified places: what remains 
is the volume of the orbifold we seek, which we then renormalize from the adelic to 
the standard hyperbolic volume. 

We define pa 

O := 0x B% CB. 


Then 
O' = {y € O* : nrd(y) = 1} = O% N B' 
and 7 
O! = 0! x BL. 


Now we apply the important assumption: we suppose that B is indefinite. The 
hypothesis that B is indefinite is necessary even to get a nontrivial space H = (H?¥ x 
(H°)® for the group to act upon! The case where B is definite was handled in the proof 
of the Eichler mass formula (Main Theorem 25.3.19). 


39.2.2. By strong approximation (Corollary 28.5.12) we have B! = B'O!. Therefore, 
the natural inclusion 


O'"\0' > B'\B' 
is also surjective, hence an isomorphism. Thus by (39.2.1) 
t'(O'\0') = T'(B'\B') = 1. (39.2.3) 
We have an embedding O! = BL, so 
1 = T (0'\0') = 7'04 (O'\Ba). (39.2.4) 
39.2.5. If O is maximal, then from (29.7.25) we have 
TO) =| [OD =r Pr | | Wmp-ty! 


p peRam(B) 


so that ms 
TOD! = |dr P’ ér DYO). (39.2.6) 


So by (39.2.6), we conclude that 
To(O'\B}) = dp ¢r(2)9(D). (39.2.7) 
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39.2.8. In general, if O C O’ with O’ maximal, then 
HO = [0 7016 0) 


so similarly 


TA(O'BL) = dp! Lr ODO : 0. (39.2.9) 
If O is locally norm-maximal, then further 
[0 : 01] = [O* : 0% (39.2.10) 
and by Lemma 26.6.7, with N = discrd O we have 
ADIO : 0*1 = | [10 : O10, p) = N | [AO.p) 69.210 
pN plot 


(as in (26.6.8)). 


Next, we relate the measures on SL2(R) and SL2(C) to the corresponding measures 
on H? and H°. 


39.2.12. Recall the symmetric space identification 
H? > SL)(R)/ SO(2) (39.2.13) 


The hyperbolic plane H? is equipped with the hyperbolic measure u, the unique measure 
invariant under the action of PSL2(R); the group SL2(R) has the Haar measure ql, 
also invariant under the left action of SL2(R). Therefore, the identification (39.2.13) 
relates these two measures up to a constant v(SO(2)) that gives a total measure to SO(2) 


(normalizing its Haar measure). Ditto for H? and SU(2), with a constant v(SU(2)). 


Lemma 39.2.14. We have 
v(SO(2))=x and (SU) = 87’. 


Proof. One could compute the relevant constant by doing a (compatible) integral, but 
we prefer just to refer to an example where both sides are computed. We consider a 
fundamental domain for the action of SL2(Z) O SL»2(R) that is invariant under SO(2): 
for example, we can lift a fundamental domain for PSL2(Z) O H? under (39.2.13). 
The difference between SL2(Z) and PSL2(Z) = SL2(Z)/{+1} is annoyingly relevant 
here! We have 

PSL2(Z)\H* = PSL2(Z)\ SL2(R)/ SO(2); 


if we let SO(2)2 ~ SO(2)/{+1} be the rotation group acting by 20 instead of 6, then we 
can lift from PSL2(Z) to SL2(Z) and 

PSL2(Z)\ SL2(R)/ SO(2) = SL2(Z)\ SL2(R)/ SO(2)2 
which under compatible metrics gives 


T! (SL2(Z)\ SL2(R)) 


SO(2)) = 2v(SO(2)2) = 2 
ME NOES SH 


(39.2.15) 


736 CHAPTER 39. VOLUME FORMULA 


On the bottom we have p(SL2(Z)\H7) = 7/3 by (35.1.5) and on the top we have 
t!(SL2(Z)\ SL2(R)) = 6Q)= n /6 by (39.2.9). Plugging in, we compute 
n?/6 
v(SO(2)) = 2—— = 2. (39.2.16) 
1/3 
Similarly, 


v(SU(2)) = y T (SLa(Zli)\ SLo(©)) = 9 £ toD = 8r- (39.2.17) 
u(PSL2(Z{i])\H?) (2/n?)Sqa(2) = 


by Example 39.1.16 and again (39.2.9). 


39.3 Volume derivation 


We now establish the volume formula (Main Theorem 39.1.8) using the computation 
of the Tamagawa measure (Theorem 29.11.3, (39.2.1)), following Borel [Bor81, 7.3]. 

We continue with the notation from 39.1.6, so in particular n = [F : Q] and F 
has r real places, c complex places, and B is split at t real places; B is indefinite, so 
t +c > 0; and O C B is a locally norm-maximal order. We restate the formula for 
convenience. 


Main Theorem 39.3.1. We have 


_ U4rny 3/2 
vol(T!(O)\H) = Gane oF dg” NaN) [He (39.3.2) 


Proof. To summarize the previous section, we started with t!(B!\B!) and concluded 
t!(O!\O!) = 1 by strong approximation; we factored this into finite and infinite parts, 
with the finite part computed in terms of the order and the infinite part. Then 


O'\BL = on] [s ~ BE x (o'\[ |). (39.3.3) 


vloo veQ ploo 
v¢Q 
Each term contributes to the volume. For the first product, for each of the r — t places 
v € Q we have B} ~ H! and we computed in Lemma 29.5.9 that t!(H') = 47”. For 
the remaining terms, we employ the comparison formula between measures (Lemma 
39.2.14), and are plagued by the same factor 2 coming from the fact that '!(O) arises 
from PO! /{+1}. Putting these together, the decomposition (39.3.3) yields a volume 


th (O!\BL) = (4r? n (8n?) E vo 


O76. Die (39.3.4) 
z HAAL I PAITNA). 
From (39.2.9) and (39.3.4) we conclude 
voll '\H) = a HOB) 
T m 
(4)! (39.3.5) 


= Greet FOOI : 0 
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Finally, the computation (39.2.11) of the local index completes the proof. 


Remark 39.3.6. A similar proof works for the case where F is a function field or 
where T is an S-arithmetic group, but in both cases still under the hypothesis that B is 
S-indefinite for an eligible set S (playing the role of the archimedean places above). 


Example 39.3.7. Suppose F is totally real, and we take B = M>(F) and O = M2(Zp). 
Then H = (H*)” and 


vol(C'(O)\H) = EDAN g = SEO) pn, 


(4r 2)n dp 


39.4 Genus formula 


In this section, we take the volume formula (Main Theorem 39.3.1) in the special case 
of a Fuchsian group and extend it to a formula for the genus of a Shimura curve. 

We maintain our notation but now specialize to the case where c = 0 and ż¢ = 1: in 
particular, F is a totally real field, and B is indefinite. Thus Ir = '!(O) < PSL2(R) isa 
Fuchsian group. 

We suppose that O is an Eichler order of level W. 


39.4.1. Recalling 37.7, let (g; e1, . . ., ex; 6) be the signature of T. Then Y(T) has genus 
g; has k elliptic cycles of orders e1,...,ex € Zs2, corresponding to cone points on 
Y(T) with given order; and has ô parabolic cycles, corresponding to the punctures of 
Y(T). We have 6 = O unless B = M2(Q), corresponding to the case of classical modular 
curves. 

By Proposition 37.7.4, 


k 
uYT)) = 27 fos IF (! 7 >) a | l 
i=] 


l 


Rewriting this slightly, for q € Z>2, let mg be the number of elliptic cycles of order q. 


Then 
LET) -ga Sm (! E 5) +6 (39.4.2) 


2n 
q22 


and the sum is finite. 


The numbers mg are determined by embedding numbers of quadratic orders into 
the quaternion order O, as studied in chapter 30. 


39.4.3. Let q € Zs2. Suppose that mg > 0, so that O! has a maximal finite subgroup 

(y) < O! of order 2q. Then the field Kg = F(¢2_) D F is a quadratic field extension and 

K4 — B embeds, where £9, is a primitive 2gth root of unity, and we have two optimal 

embeddings S = F(y) N O = O given by y and y. Conversely, to every embedding 

$: Kq > B, we associate the order S = ¢(K,) N O and the finite subgroup S%s C O'. 
Thus there is a two-to-one map 


ors 
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{O'-conjugacy classes of optimal embeddings ¢: S — O with SX, = 2q} 
l 
{Elliptic cycles of T of order 2q}. 


In the notation of 30.3.10, we have shown that 


nes > -(S,0;0'). (39.4.4) 


Kg2S2Rlfq] 
#S% .=2q 
Our next major ingredient is the theory of selectivity, treated in chapter 31. 


39.4.5. We claim that K, does not satisfy the selectivity condition (OS), defined in 
31.1.6. If O is an Eichler order, we may appeal to Proposition (31.2.1) and condition 
(a): since F is totally real, Ką is totally complex, and B is split at a real place, condition 
(a) fails. 

Therefore by Main Theorem 31.1.7(a), Gen O is not optimally selective. By Corol- 
lary 31.1.10, for every R-order $ C Kg, we have 
h(S) 
#Clo R 
where we have substituted #Cls O = # Clo R (by Corollary 28.5.17). 


The adelic embedding numbers m(S, O; O*), a product of (finitely many) local 
embedding numbers by 30.7.1, are computed in section 30.6. 


m(S,O0;0*) = m(S,0;0*) (39.4.6) 


We will need one lemma relating units to class numbers. 


Lemma 39.4.7. We have [R3 o : R*?] = 2[Clo R : C1 R]. 


Proof. The index [RE 0 : R**], which does not depend on S, is related to class 
numbers as follows. For each real place v, define sgn,: F* — {+1} by the real sign 
sgn,,(a) = sgn(v(a)) at v. Let 
sgng: F* > {+1} 
a |> (sgn,(a))y 

collect the signs at the places v € Q. Then we have an exact sequence 

1 > {+1}°/ sgng R* > Clo R> CIR > 1 (39.4.8) 
where the map on the left is induced by mapping a tuple of signs in {+1}® to the 
principal ideal generated by any a € F* with the given signs. We have a second 


(tautological) exact sequence 

1 RX yR? > RIRO —* (41)  [41)%/ sgng RX > 1 (39.4.9) 
of elementary abelian 2-groups (or F2-vector spaces). Combining (39.4.9) with (39.4.8), 
and noting that [R* : RX] = 2” by Dirichlet’s unit theorem and #Q = r — 1 by hypoth- 
esis, we conclude that 


[RX o © RO] = 2"* [Clo RI CIR] = 2[Clo R : CIR]. 
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Definition 39.4.10. For a quadratic R-order S C K, the Hasse unit index is defined by 
Q(S) := [Nmxyr(S*) : R]. 


We have Q(S) < co because S% and R* have the same Z-rank. 


Remark 39.4.11. Hasse [Hass52, Sätze 14-29] proved numerous theorems about 
Q(Zx), including that Q(Zx) < 2: see also Washington [Was97, Theorem 4.12]. 


We are now ready to write in a simplified way the count mg of elliptic cycles. 
Proposition 39.4.12. We have 


ae AS) 0 
Mg = By" O:0) 


#S% 


tors 7 


where h(R) = #Pic R and h(S) = # Pic S. 


Proof. Beginning with (39.4.4), we have 
1 


E oO! 
Mg = 5 X, m(S,0;0'), (39.4.13) 
ScKg 
#5 hrs =24 
By Lemma 30.3.14, we have 
m(S,O; O') = m(S, O; O*)[nrd(O*) : nrd(S*)]. (39.4.14) 


Since B is indefinite, by Corollary 31.1.11 we have nrd(O*) = RZ 0° By Lemma 39.4.7, 
we have [Re o : R*?] = 2[Clo R : CLR]. Thus 


[nrd(O*) : nrd(S”)] = [R29 : R*?)[R : nrd(S*)] 
_ 2[Clo R : CIR] (39.4.15) 
— 
Substituting (39.4.6) and (39.4.15) into (39.4.14), we find 


m(S,0;0!) = m(S,O;0*)[nrd(O*) : nra(S%)] 


_ AS) ny [Re 0 : R*?] 
= 9" FoR 89.4.16) 


2 1O Ea 
= OS? ). 


Finally, plugging (39.4.16) into (39.4.13) and cancelling a factor 2 gives the result. 


Corollary 39.4.17. The signature of a Shimura curve depends only on the discriminant 
D and level M. 
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Proof. The ambiguity corresponds to a choice of Eichler order of level WM and choice 
of split real place; when F = Q, there is no ambiguity in either case. So we may suppose 
that I has no parabolic cycles. Then we simply observe that the formula (Proposition 
39.4.12) for the number of elliptic cycles depends only on Ô. 


Example 39.4.18. As a special case of Proposition 39.4.12, suppose that N = OM is 
coprime to q. Then as in Example 30.7.5, we have 


al GLEE) & as 

-—_|](1-|—# 1+{—4 uA 

ma xo] (* II AF a O(S) 
#S% s =24 


We now have the ingredients to give a formula for a Shimura curve. 


Theorem 39.4.19. Let Y'(O) = T'(O)\H?. Then Y'(O) is an orbifold with genus g 
where 


4 


55 
e (4x2)" 


1 
FORD- X mg (! 7 =) -ð 


qz2 


where mg are given in Proposition (39.4.12). 


Proof. Combine the volume formula (Main Theorem 39.3.1) with (39.4.2). 
The special case where F = Q is itself important. 


Theorem 39.4.20. Let D = disc B > 1 and let O C B be an Eichler order of level M, 
so N = DM = discrdO with D squarefree and gcd(D, M) = 1. 
Then X'(O) = T'(O)\H? is an orbifold with genus g where 


_ DWM) _ m _ 2m 


6 2 3 


where the embedding numbers were computed in Example 30.7.7: 


I es 
-| 1+(=]), aya: 
m = m(Z[i],O;O*) = pi (5 II P 


2g -2 


0, if4| M. 
-3 -3 
ee 1+(—]), worm: 
m3 = m(Z[w], 0; O*) = [if (5 Ani Fal rt 
0, if9 | M; 


Example 39.4.21. Suppose D = 6 and M = 1, so we are in the setting of Then 
m = m3 = 2 and 


(6) 4 1 
E E ALE E E er 
8 6 3 3 


so g = 0; this confirms that x!(O) has signature (0; 2, 2,3, 3) as in 37.9.10. 
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Exercises 


1. Let F be the function field of a curve X over F4 of genus g. Let B be a quaternion 
algebra over F. 


(a) Let v € PIF be place that is split in B. Let S = {v}, let R = Ris), and 
let O C B be an R-order. Let 7 be the Bruhat-Tits tree associated to 
B, ~ Mo2(F,). Via the embedding ı : B — B,, show that the group 
T1(O) = PuO')UT acts on T by left multiplication as a discrete group 
acting properly. 


(b) Continuing as in (a), compute the measure of TONT using the methods 
of section 39.3. 


2. Generalizing the previous exercise, let F be a global field, let B be a quaternion 
algebra over F, let S be an eligible set and suppose that B is S-indefinite. Let 
R = Rgs) and let O C B be an R-order. Define a symmetric space H on which PO! 
acts as a discrete group acting properly, and compute the measure of T!(O)\H. 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 
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Creative Commons license, unless indicated otherwise in a credit line to the material. If 
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Part V 
Arithmetic geometry 


Check for 
updates 


Chapter 40 


Classical modular forms 


In this chapter, we introduce modular forms on the classical modular group. This 
chapter will provide motivation as well as important examples for generalizations in 
this last part of the text. 


40.1 > Functions on lattices 


In this section, we pursue the interpretation of the quotient T \H? with I = PSL,(Z) as 
a moduli space of lattices, and we study functions on the quotient. There are a wealth of 
references for the classical modular forms, including Apostol [Apo90, Chapters 1-2], 
Diamond—Shurman [DS2005], Miyake [Miy2006, Chapter 4], Lang [Lang95, §1], and 
Serre [Ser73, Chapter VII]. For this section, see Silverman [Sil2009, Chapter VI] for 
the complex analytic theory of elliptic curves and the relationship to Eisenstein series. 

Recall from 35.3.3 that Y = ['\H? parametrizes complex lattices up to homothety, 
i.e., there is a bijection 


Y =T\W > {Ac Clattice}/ ~ 
[rh [Z+ Zr]. (40.1.1) 


In particular, the set of homothety classes has a natural structure of a Riemann surface, 
and we seek now to make this explicit. We show that there are natural, holomorphic 
functions on the set of lattices that allow us to go beyond the bijection 40.1.1 to realize 
the complex structure on Y explicitly. 

Let A c C be a lattice. To write down complex moduli, we average over A in a 
convergent way, as follows. 


Definition 40.1.2. The Eisenstein series of weight k € Z2 for A is 


If k is odd, then G;(A) = 0, so let k € 2Z55. 
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Lemma 40.1.3. The series Gg (A^) converges absolutely. 


Proof. Up to homothety (which does not affect convergence), we may suppose A = 
Z + Zr, with t € H*. Then we consider the corresponding absolute sum 


1 1 
2, = =Z 40.1.4) 
k > k ( 
Zra lAl nenez Im +n] 
4+0 (m,n)#(0,0) 


The number of pairs (m, n) with r < |mt + n| < r + 1 is the number of lattice points in 
an annulus of area a(r + 1)? — ar? = O(r), so there are O(r) such points; and thus the 
series (40.1.4) is majorized by (a constant multiple of) )\°° r!-k | which is convergent 
for k > 2. oO 


40.1.5. For z € H? and k € 2Zs», define 


1 
G = G(Z + Zz) = —. 40.1.6 
k(2) = Gk(Z + Zz) 21 Gane (40.1.6) 
(m,n)#(0,0) 


Lemma 40.1.7. Gx(z) is holomorphic for z € H?, and 
Giyz) = (cz + d)“ Gr(2) 


b 
forall y = k ) € PSL2(Z). 


Proof. This is true for z € X since then 


|m + nz} =m +2mnRez + n|z\" >m -m+n = |m + nw? 


thus |G;(z)| < |G(w)| and so by the Weierstrass M-test, Gg(z) is holomorphic for 
z € H: by Morera’s theorem, uniform convergence implies holomorphicity. But now 


for all y = É p 


) €T, we claim that 
c d 


Gr(y2) = (cz + d) GC) (40.1.8) 
(and note this does not depend on the choice of sign): indeed, 


1 cz+d 


man) (RFRA OFAR (40.1.9) 


and the map 


(n,m) |> (n,m) k ,) = (an + cm, bn + dm) 
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is a permutation of Z? — {(0, 0)}, so by absolute convergence we may rearrange the sum 
to get 


GoD = ezda > ! 


k 
mnei ((bn + dm) + (an + cm)z) 
(m,n)#(0,0) 


= (cz +d) By _ (cz + DEG). 


(m + nz)k 


(40.1.10) 


m,néeZ 
(m,n)#(0,0) 


By transport, since TH = H’, we see that G;(z) is holomorphic on all of H?. o 


40.1.11. In this (somewhat long) paragraph, we connect the theory of Eisenstein series 
above to the theory of elliptic curves. 
Let A C C be a lattice. We define the Weierstrass -function (relative to A) by 


TECG TSY (r - >) (40.1.12) 
ao eee (z- A) a 
A#0 
ae 1 lzl(2|4] + |z) 1 
Z + |Z 
- = 40.1.1 
eae a? 7 [AP QAI- Iz)? ý (a read 


so as above we see that (z) is absolutely convergent for all z € C\A and uniformly 
convergent on compact subsets, and so defines a holomorphic function on C\A. Since 


40.1.14 
(z 2 r -5 z i D5 ( ) 
by differentiating the geometric series, we find 
1 3 k_ 1 4 6 
QZ) = + 5 (k — 1)Gg(A)2" = = + 3G4(A)z" + 5G6(A)z? +.... (40.1.15) 
E E Z 
Differentiating with respect to z and squaring, we find that 
d 4  24G4(A 
(£e ) E A E (40.1.16) 
z Z 


Expanding out the first few terms, we find that 


2 
f2) = (Zo) — 49(z)* + 60G4(A)p(z) + 140G6(A) = OK?) 


is holomorphic at z = 0 and satisfies f(z +4) = f(z) forall 4 € A. By periodicity, f(z) 
takes its maximum in a fundamental parallelogram for A; then by Liouville’s theorem, 
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f is bounded on C so constant. Since f(0) = 0, we conclude that f(z) is identically 
zero. 
Following convention, write 


ga = ga(A) = 60G4(A) and ge = go(A) = 140G6(A) 
and 
dg 
x(z) = 9(z;A) and y(z)= ane A). 
Then the image of the map 
C/A > P%O) 
(40.1.17) 
Ze (x(z): yz): 1) 


is cut out by the affine equation 
2 3 
y^ =4x° — g4x — g6; 


the map (40.1.17) is an isomorphism of Riemann surfaces. Looking ahead to Definition 
42.1.1, this map exhibits C/A as an elliptic curve over C. 


To produce holomorphic functions that are well-defined on the quotient T\H?, we 
can take ratios of Eisenstein series; soon we will exhibit a map 


j:Woc (40.1.18) 


obtained in this way that defines a bijective holomorphic map T\H? Z C (Theorem 
40.3.8). 


40.1.19. Eisenstein series can also be thought of as weighted averages over the (cosets 
of the) group PSL2(Z) as follows. 

Let T% < IT = PSLo(Z) be the stabilizer of œ; then F% is the infinite cyclic group 
1 


generated by T = o 


il We consider the cosets T~ \I: for t € Z and y = k ) WE 


have T’y = j tee p r with the same bottom row. Thus the function (cz + d}? 
is well-defined on the coset l'y. Thus we can form the sum 
1 1 
Fx@= Š, (z+dy*= 5 oy = ae (40.1.20) 
Toye o\P cae (+d) 
ged(c,d)=1 


the factor 2 coming from the choice of sign in PSL2(Z). Since every nonzero (m, n) € z? 
can be written (m, n) = r(c, d) with r = gcd(m, n) > 0 and gcd(c, d) = 1, we find that 


Gk(2) = C(K)EK(Z). 
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40.2 > Eisenstein series as modular forms 


In the previous section, we saw that natural sums (Eisenstein series) defined functions 
on H? that transformed with respect to PSL2(Z) with a natural invariance. In this 
section, we pursue this more systematically. 


Definition 40.2.1. Let k € 2Z and let IT < PSL2(R) be a Fuchsian group. A map 
f: H? > CU {co} isweight k-invariant under T if 


fz = (cz +d)‘ f(z) forally = É ) er. (40.2.2) 


40.2.3. If f is weight k invariant and f’ is weight k’ invariant, then f f’ is weight 
k +k’ invariant, and if k’ = k then f + f’ is weight k invariant. Therefore, the set of 
weight k-invariant functions has the structure of a C-vector space. 


40.2.4. Weight k invariance under IT can be checked on a set of generators for T lifted 


4 € SL,(R), define j(y; z) = cz +d. Then (40.2.2) 


can be rewritten f(yz) = J(y; z)* f(z). 
For y’ € T, we compute that j satisfies the cocycle relation 


to SL2(Z), as follows. For y = (: 


JOY: 2) = VY 2907/3 2) (40.2.5) 


Fa 


because if y’ = c i then 


a’z+b’ 
c’z +d’ 


(a'c + c'd)z + b(b’c + dd’) = l | ) + a) (c’z+d’). (40.2.6) 


Therefore, if f is a map with f(yz) = J(y; 2)‘ f(z) and f(y’z) = ly’; 2* f(z), then 
FIV) = IY FD = HY DIO FFD 
= yy'i2)" f2). 


Since PSL2(Z) is generated by S, T, it follows from (40.2.7) that a map f is weight 
k invariant for PSL2(Z) if and only if both equalities 


fZ+) =f 


(40.2.7) 


FOD = 2k F(2) se 
hold for all z € H’. 
40.2.9. Since m i 
E TET (40.2.10) 
the weight k invariance (40.2.2) of a map f can be rewritten 
foD dyz. = f@az™? (40.2.11) 


so equivalently, the differential f(z) dz®*/? is (straight up) invariant under T. 
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40.2.12. Let f: H? — C be a meromorphic map that is weight k invariant under a 


f i) Then f(z + 1) = f(z). If f admits a Fourier series 


expansion in q = exp(277iz) of the form 


Fuchsian group [ > 


oo 


f= X, ang” € CCA) (40.2.13) 


n=—co 


with a, € C and a, = O for all but finitely many n < 0, then we say that f is 
meromorphic at oo; if further a, = 0 for n < 0, we say f is holomorphic at oo. 

More generally, let © < PSL2(Z) be a subgroup of finite index. For y € PSL2(Z), 
we define 


FIY) := ys 2) f2). (40.2.14) 


Then f[y]x(z) is weight k invariant under the group y~!ry. We say that f is mero- 
morphic at the cusps if for every y € PSL2(Z), the function f[y] is meromorphic 
at co, in the above sense. Since f is weight k invariant, to check if f is meromorphic 
at the cusps, it suffices to take representatives of the finite set of cosets T\ PSL2(Z). 
(The name cusp comes from the geometric description at co coming from the parabolic 
stabilizer group, recalling Definition 33.4.5.) 

Finally, we say that f is holomorphic at the cusps if f[y]x(z) is holomorphic at 
oo for all y € PSL2(Z), and vanishes at the cusps if f[y],(co) = 0 for all y. 


Definition 40.2.15. Let k € 2Z and let T < PSL2(Z) be a subgroup of finite index. A 
meromorphic modular form of weight k is a meromorphic map f: H? — C that is 
weight k invariant under I and meromorphic at the cusps. A meromorphic modular 
function is a meromorphic modular form of weight 0. 

A (holomorphic) modular form of weight k is a holomorphic map f: H? > C 
that is weight k invariant under I and holomorphic at the cusps. A cusp form of weight 
k is a holomorphic modular form of weight k that vanishes at the cusps. 


Let M;(T) be the C-vector space of modular forms of weight k for T, and let 
Sk (T) © M; (1) be the subspace of cusp forms. 


Lemma 40.2.16. The Eisenstein series Gx(z) is a holomorphic modular form of weight 
k € 2Z>2 for PSLo(Z) with Fourier expansion 


2 . k co 
Gi(2) = 2¢(k) +2 K as » oK1(n)q” (40.2.17) 
where 
21 
£(k) = 2, T 
and 
ox-1(n) = X,d 
d\n 
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Proof. We start with the formula 


co M 
1 1 
z cot(1z) = »y cam 7am, Die =a (40.2.18) 


(Exercise 40.2); with q = exp(2ziz), 


cos(7z) .gt+l., 2 
= =f 1 + 
sin(7z) q-1 


cot(7z) = 


and so we obtain the Fourier expansion 


n cot(zz) = ni — 27 >, q". (40.2.19) 
n=0 


Equating (40.2.18)-(40.2.19) and differentiating k — 1 times, we find that 


co 


1 _ 1 sk = k-1on 
boa a pie à" q (40.2.20) 


(since k is even). Thus 


1 
Gx(z) = > (m+nzk Ears ay > (m+ ak 


(m,n)#(0,0) n=] m=—oo 


so replacing n < a and then substituting z — nz in (40.2.20), summing over n we 


obtain 
Gila) = 24) +2 aay alg” 


Snia 2 rT DO i 


n=1 din 


(40.2.21) 


grouping together terms in the second step. The fact that Gg is holomorphic at oo then 
follows by definition. o 


40.2.22. We accordingly define the normalized Eisenstein series by 


Ex(Z) = TOLG 
(see also 40.1.19). We have 
2k © 
Ex(2) = 1-5 2u 0da" (40.2.23) 


where B, € Q* are the Taylor coefficients of 


1 xê 


—— =) = _l Ix oe ee 
e—-1 Le kay 2°" 62! 304! 426! 
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(Exercise 40.3): the numbers Bg € Q (with By + 0 for k € 2Z>ọ) are Bernoulli 
numbers. Expanding, we find 


E4(z) = 1 + 240q + 2160q? + 6720g? + 17520q* +... 
Eo(z) = 1 — 504q — 166324? — 1229760? — 5327284 -.... 


Remark 40.2.24. The notion of Eisenstein series extends in a natural way to the Bianchi 
groups PSL2(Zr) where F is an imaginary quadratic field: see Elstrodt-Grunewald— 
Mennicke [EGM98, Chapter 3]. 


40.3 > Classical modular forms 


In this section, we study modular forms for PSL2(Z); to ease notation, we abbreviate 
T := PSL2(Z). 


40.3.1. Let f be a meromorphic modular form of weight k for = PSL2(Z). If k = 0, 
so f is a modular function, then f descends to a meromorphic function on Y := T\H?. 
Although this is not true for (nonzero) forms of weight k # 0, the order of zero or 
pole ord,(f) is well defined on the orbit Iz by weight k invariance (40.2.2). With the 
Fourier expansion (40.2.13), we define 


ordes(f) := ordy( X ang”) = min({n : an # 0). 
The form f has only finitely many zeros or poles in Y, i.e., only finitely many 
T-orbits of zeros or poles: since f is meromorphic at oo, there exists € > 0 such that f 
has no zero or pole with 0 < |g| < €, so with 


log(1/e)_ 


Imz> M= 
: Qn 


but the part of x with Im z < M is compact, and since f is meromorphic in HÊ, it has 
only finitely many zeros or poles in this part as well. 


40.3.2. In a similar way, the order of the stabilizer e; := #Stabr(z) is well defined on 
the orbit Iz, since points in the same orbit have conjugate stabilizers. By 35.1.14, 


3, iffz=Tw; 
ez = 42, iffz=Ti; (40.3.3) 
1, otherwise. 


Proposition 40.3.4. Let f : H? — C be a meromorphic modular form of weight k for 
T = PSL2(Z), not identically zero. Then 


1 k 
ordo(f)+ J, corde) = 75 (40.3.5) 
Tzer\H? 


where e; := #Stabr(z). 
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The sum (40.3.5) has only finitely many terms, by 40.3.1, and the stabilizers are 
given in 40.3.2. 


Proof. See Serre [Ser73, §3, Theorem 3]: the proof consists of performing a contour 
1d 

integration a on the boundary of x. Alternatively, this statement can be seen 
ni 


as a manifestation of the Riemann—Roch theorem: see Diamond—Shurman [DS2005, 
§3.5]. oO 


40.3.6. We have E4(STz) = (z + 1)*Ea(z), so since (ST)(w) = w, 
Elw) = (w + 1)*E4(@) = w°Ga(w) 


so E4(w) = 0. Since E4 is holomorphic in HŽ, we have ord,(E4) € Zso for all z € H?, 
and thus by Proposition 40.3.4, we must have that £4(z) has no other zeros in x. 
Similarly, 

Eo(i) = Eo(Si) = i°Eo(i) = —Eo(i) 


so E¢(i) = 0, and F(z) has no other zeros. 
For the same reason, the function 


E4(z)> — E6(z)" 
Az HO ee = q- 249? +2529? - 147244 +... (40.3.7) 


is a modular form of weight 12 with no zeros in H? with ord,.(A) = 1. 


We give two applications of Proposition 40.3.4. First, we obtain the identification 
promised in (40.1.18). 


Theorem 40.3.8. The function 


Ex(z 1 
j(z)= a = — +744 + 196884g + 21493760q7 +... (40.3.9) 
z q 


is a meromorphic modular function for PSL2(Z), holomorphic in W?, defining a bijec- 
tion 
Y =T\H >C. 


Proof. The function j is weight 0 invariant under I as the ratio of two forms that are 
weight 12 invariant. Since E4 is holomorphic in H?, and A is holomorphic and has 
no zeros in H?, the ratio is holomorphic in H? and j(z) has a simple pole at z = oo, 
corresponding to a simple zero of A at z = œ. From 40.3.6, we have j(i) = 1728, and 
J(z) — 1728 has a double zero at z = i, and j(z) has a triple zero at z = w. 

To conclude that j is bijective, we show that j(z) — c has a unique zero [z € Y. 
If c # 0,1728, this follows immediately from Proposition 40.3.4; if c = 0, 1728, the 
results follow for the same reason from the multiplicity of the zero. o 
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Remark 40.3.10. The definition of j(z) is now standard, but involves some choices. 
In some circumstances (including the generalization to abelian surfaces, see 43.5.7), 
it is more convenient to remember the values of the Eisenstein series themselves, as 
follows. To z € H, we associate the pair (E4(z), E6(z)) € C?; if y € I and z’ = yz, then 


(Ea(z'), Eo(2’)) = (E4), 6° Eo(z)) 
where 6 = j(y; z) € C*. We therefore define the weighted projective (4, 6)-space by 
P(4,6)(C) := (C7\{(0,0)})/ ~ 


where 
(E4, E6) ~ (5* Es, 6% E6) 


for 6 € C*. We write equivalence classes (E4 : E6) € P(4,6)(C). The map 
j : P(4,6X(C) > P' (C) 
1728E; 


(E4 : E6) > j(E4 : E6) = E-E? 


is well-defined and bijective—see Silverman [Sil2009, Proposition III.1.4(b)]. 


To conclude, we give a complete description of the ring of (holomorphic) modular 
forms. By 40.2.3, the C-vector space 


MT) = MT) 
ke2Z 
under multiplication has the structure of a (graded) C-algebra; we call M(T) the ring 
of modular forms for T. 


Theorem 40.3.11. We have M(T) = C[Ea, E6], i.e., every modular form for T = 
PSL2(Z) can be written as a polynomial in E4, E6. 


In particular, M;, (1) = {0} for k < 0. 


Proof. We have M(T) 2 C[E4, E6], so we prove the reverse inclusion. We refer to 
Proposition 40.3.4, and ask for solutions a1, a2, a3 € Z>ọ to ay + a2/2 + a3/3 = k/12. 
When k < 0, there are no such solutions; when k = 0, 2,4, 6, 8, 10, there is a unique 
solution, and we find that M; (T) is spanned by 1, 0, E4, E6, E2, E4E6, respectively. 

For all even k > 4, there exist a,b € Zso such that 4a + 6b = k (if k = 10 and 
k = 2 (mod 4), then k — 6 > 4 and 4 | k), so EJ EL € M(T) and (E$ EL (00) =1 
(by 40.2.22). Let S(T) E M(T) be the subspace of forms that vanish at oo. Then 
MT) = CEZE? ® S(T) by linear algebra, and by the previous paragraph, S(T) = {0} 
for k < 10. 

We claim that multiplication by A furnishes an isomorphism M;(T) = Sk+12(T) of 
C-vector spaces for all k: division by A defines an inverse because A has a simple zero 
at co by 40.3.6 and no zeros in H°. The result now follows by induction on k >0. o 
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More generally, one can study modular forms for congruence subgroups (section 
35.4) of PSL2(Z) in an explicit way, as the following example illustrates. 


Example 40.3.12. At the end of section 35.4, we examined a fundamental domain for 
the group I (2), defined by (35.4.8). As with X(1), the homeomorphism (35.4.11) can 
be given by a holomorphic map 


A: X(2) > P(O) 


obtained from Eisenstein series for (2), analogous to j(z). The map 4 satisfies A(yz) = 
A(z) for all y € IT(2) and in particular is invariant under z + z + 2. One can compute 
its Fourier expansion in terms of g!/? = e"? as: 


A(z) = 16q! — 128q + 704g7/* — 3072q7 + 11488q°/* — 38400g? +.... (40.3.13) 


Since j(z) induces a degree 6 = [I'(2) : [(1)] map X(2) — X(1), we find the relation- 
ship 
(a2 -2a +1) 

a2(A-1)2 ` 
From (40.3.14) (and the first term), the complete series expansion (40.3.13) can be 
obtained recursively. 

As a uniformizer for a congruence subgroup of PSL2(Z), the function A(z) has 

a moduli interpretation (cf. 40.1.11): there is a family of elliptic curves over X(2) 
equipped with extra structure. Specifically, given A € P!(C) \ {0, 1, co}, the correspond- 
ing elliptic curve with extra structure is given by the Legendre curve 


Ey: y? = x(x- 1)(x — 4), 


j=256 (40.3.14) 


equipped with the isomorphism (Z/2Z)? — E[2] determined by sending the standard 
generators to the 2-torsion points (0,0) and (1,0). The map j is the map that forgets 
this additional torsion structure on a Legendre curve and remembers only isomorphism 
class. 


40.4 Theta series 


An important class of classical modular forms arise via theta series, which count 
the number of representations of an integer by a positive definite quadratic form. We 
present only a small fraction of the general theory here. 

Let Q: Z™” — Z be a positive definite integral quadratic form in m variables and 
suppose that m = 2k is a positive even integer. We define the theta series of Q by 


Oo : H > C 
@9(z) = ` ePiQ)z — Dy ro(n)q" (40.4.1) 
xeZ™ n=0 


where q = e7”"2 and 
ro(n) = #{x € Z” : Q(x) =n} < œ 


counts the number of lattice points on the sphere of radius yn. 
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Lemma 40.4.2. ©o(z) is a holomorphic function. 
Proof. Since Q is positive definite, there exists c € R>ọ such that 
Q(x) = ca? ++ x). 


Thus ro(n) = O(n‘), and the series @,(z) is majorized by (a constant multiple of) 
E2] nq", so converges to a holomorphic function. Oo 


Let [T] be the Gram matrix for the symmetric bilinear form associated to Q; 
then [T] € M,,(Z) is an integral symmetric matrix with even diagonal entries. Let 
d = detQ = det[T] € Z. Then dA“! € M,,(Z) is the adjugate matrix: it is again 
symmetric. 


Definition 40.4.3. The least positive integer N € Z>o such that NA! is integral with 
even diagonal entries is called the level of Q. 


We recall the definition of the congruence subgroups 35.4.5. 
Theorem 40.4.4. The theta series @gQ(z) is a modular form of weight k for T\(N). 


Proof. Unfortunately, in this generality the proof would take us too far afield. Fun- 
damentally, the transformation formula for ©g follows from Poisson summation and 
careful computations: see Eichler [Eic73, §1.3, Proposition 2], Miyake [Miy2006, 
Corollary 4.9.5], or Ogg [Ogg69, Chapter VI]. oO 


40.4.5. We can be a bit more specific about the transformation group for Og(z) as 


(-1)* det O 
n 


follows. To Q, we associate the character y defined by y(n) = ) Then 


Oo(yz) = x(d\(cz + d) @o(z) for ally = k i} € Io(N); 


accordingly, we say Og is a modular form of level N with character y. 


40.5 Hecke operators 


Significantly, the C-vector space Mg(To(N)) of modular forms of weight k for To(N) 
carries with it an action of commuting semisimple operators, called Hecke operators. 
These operators may be interpreted as averaging modular forms over sublattices of 
a fixed index; for efficiency, we work with a more explicit definition. For further 
reference, see e.g. Diamond—Shurman [DS2005, Chapter 5] or Miyake [Miy2006, 
§2.7, §4.5]. 

Throughout, let N € Z>1. Let 


O = O(N) := I ‘| € M(Z): N | e} C M2(Z) 


be the standard Eichler order of level N in M>(Q), so that T = To(N) = O! /{+1}. 
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Let n € Zs with gcd(n, N) = 1. We consider the set of matrices 
On = {a € O : det(a) = n}. (40.5.1) 


Visibly, there is a left (and right) action of O! on O,, by multiplication. 


Lemma 40.5.2. A system of representatives of O!\Oy is given by the set of matrices 


of the form l i} with ad =n, a >0, and0<b<d. 


Proof. The lemma follows as in Lemma 26.4.1(b) using the theory of elementary 
divisors, but applying row operations (acting on the left). o 


Example 40.5.3. When p { N is prime, the set O!\O, is represented by the p + 1 


matrices 
p oO 1 0 oe) ee 1 p-l 
0 1/’?\0 p}’\0O py’ "10 p |` 


Definition 40.5.4. For n € Zs, with gcd(n, N) = 1, we define the Hecke operator 
T(n) : MC) > M:(T) 
TMO = Y azz) flez. (40.5.5) 


O!aeO!"\0n 


By the condition of automorphy f(yz) = j(y;z)* f(z) and the cocycle relation 
(40.2.5), the Hecke operators are well-defined and preserve weight k invariance. 


40.5.6. By Lemma 40.5.2, we have more explicitly 


d-1 
1 az +b 
_ „k-1 1 

(Ta) f(z) =n! > T = s| s ) . (40.5.7) 

ad=n b=0 

a>0 
Accordingly, if f(z) = D9 anq”, then (TM) f)(z) = Dp bmg” where 

Drs anes (40.5.8) 


d|gcd(m,n) 
d>0 
so in particular for n = p prime we have 


Osis if p | m; 


40.5.9 
0, ifp {m. Á ) 


bm = apm + | 
Applying (40.5.9), we see that if f € Mg(T`) has f(co) = 0 (equivalently, aọ = 0), then 
the same is true for T(n)f. Repeating this for the functions f[y], with y € PSL2(Z) 
(as in (40.2.14)), we conclude that the operators T(n) act on the space of cusp forms 
Sk(P) C MgC). 
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Proposition 40.5.10. For m,n € Z>1, we have 


T(m)T(n) = 7, d!T(mn/a®). (40.5.11) 
d|gcd(m,n) 


In particular, if gcd(m, n) = 1 then 
T(m)T(n) = T(n)T(m) = T(mn) 
and if p is prime and r > 1 then 


Tipe \=Te™) +p" TOE" '). 


Proof. These statements follow directly from the expansion (40.5.8). m| 


Theorem 40.5.12. The Hecke operators T(n) for gcd(n, N) = 1 on Mk (To(N)) generate 
a commutative, semisimple Z-algebra. 


Proof. See e.g. Diamond—Shurman [DS2005, Theorem 5.5.4]. Briefly, we treat Eisen- 
stein series separately and work with cusp forms Sx(To(N)). To prove that the operators 
are semisimple, we would need to show that the Petersson inner product 


oe | NODT dutz) 
T\H2 


is well-defined, positive, and nondegenerate, and then verify that the operators are 
normal with respect to this inner product. oO 


By Theorem 40.5.12 and linear algebra, there exists a C-basis f;(z) of Mg (To(N)) 
consisting of simultaneous eigenfunctions for all T(n). 


Exercises 


> 1. Let f: U — C be a function that is meromorphic in an open neighborhood 
U 2 C with z € U, and let C be a contour along an arc of a circle of radius € > 0 
centered at z contained in U with total angle 0. Show that 


' df _ 
lim . a = ĝi ordz (f). 


> 2. Prove the formula 


mcot(mz) = D : 


z+m 


m=— 0 


for z € C. [Hint: the difference h(z) of the left- and right-hand sides is bounded 
away from Z, invariant under z œ> T(z) = z + 1, and in a neighborhood of 0 
is holomorphic (both sides have principal part 1/z) so bounded; thus h(z) is 
bounded in C and hence constant. ] 
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> 3. In this exercise, we give Euler’s evaluation of ¢(k) in terms of Bernoulli numbers. 
As in Exercise 36.12, define the series 


1 x? 1 xt 


X z xk x 
eal à&®*p Isi pp ~ . (40.5.1 
i 2 ig a a sae 


(a) Plug in x = 2iz into (40.5.13) to obtain 


zcotz = TS i9 : 
k=2 : 


(b) Take the logarithmic derivative of 


oo 2 
inz=z[ |f- 
sinz =z — 
n?n? 


n=1 


to show 


zcotz=1-2 y ye. 
k=2 


(c) Conclude that 


for k € 2Zs1. 


4. We defined Eisenstein series G;(z) for k > 4, and found Gz(z) € Mz (SL2(Z)) 
are modular forms of weight k for SL2(Z). The case k = 2 is also important, 
though we must be a bit more careful in its analysis. Let 


(c,d)#(0,0) 
(a) Show that G2(z) converges (conditionally) and satisfies 


Gaz) = 26(2) - 82? X o(n)q” 
n=l 
where q = e??? 


(b) Show that G2(z + 1) = G2(z) and 
-1 2ni 
G2 (=| = G2(z) - —. 
Z z 


[Hint: use a telescoping series and rearrange terms. ] 


760 CHAPTER 40. CLASSICAL MODULAR FORMS 


(c) Conclude that 


2nic a b 
G2(yz) = Go(z) - ead for all y = k s) ; 


[Hint: use the cocycle relation. ] 
(d) Define 
j m 
G5(2) = Go(z) - —. 
Imz 


Show that G3(z) is weight 2 invariant under SL2(Z). 


5. In this exercise, we give a proof using modular forms of the formula for the 
number of ways of representing an integer as the sum of four squares, due to 
Jacobi. Consider the function 


Hz) := 3 q” 


n=—oo 


where q := e?z and z € H’. [The function #(z) is a theta series for the univariate 
quadratic form x +» x?.] Let r4(n) be the number of ways of representing n > 0 
as the sum of 4 squares. 


(a) Show that 


@o(z) := Wz) = 14 X ranq" =1+8q +1240? +.... 


n=1 
(b) Show that Og(qg) € M2(Io(4)) is a modular form of weight 2 on To(4. 
(c) Show that dimc M2(T(4)) = 2 and dimc S2(T(4)) = 0. 
(d) Let 
G22(z) = Go(z) — 2G2(2z) 
G2,4(z) = Go(z) — 4G2(4z). 


Show that G22, G2,4 are a basis for M2 (To(4)). [Hint: use Exercise 40.4(c).] 
(e) Show that 


3 co 
Ex2(2) := ~GGr2(z) = 1 + 24 oq" 


n=1 


1 co 
Ex4(z) := ~GGralz) = 1 +8 X ong” 


n=1 


o™(n) = 2 d. 


mt{d|n 


where 


(f) Matching the first few coefficients, show that 
Oo0(z) = Er,4(z). 


Conclude that r4(n) = 80(n) for all n > 0. 
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Chapter 41 


Brandt matrices 


In this chapter, we revisit classes of quaternion ideals: organizing ideals of given norm 
in terms of their classes, we find modular forms. 


41.1 »> Brandt matrices, neighbors, and modular forms 


Let B be a quaternion algebra over Q. A major theme of this text has been the study of 
classes of quaternion ideals, beginning with chapter 17. When B is indefinite, we saw 
(Theorem 17.8.3, treated broadly in chapter 28) that strong approximation applies: via 
the reduced norm, very often the conclusion is that the class set is trivial. 

We are left with the case that B is definite. By the geometry of numbers (see section 
17.5) we found that the number of ideal classes of an order is finite, generated by ideals 
of small reduced norm. (Studying the zeta function we found a mass formula in chapter 
25, and then studying quadratic embeddings we found a class number formula in section 
30.8.) We now pursue this further: there is an exquisite arithmetic and combinatorial 
structure to be found by counting right ideals of given norm by their classes as follows. 
We begin in this section by an introduction and survey (working over Q). 

Let O C B be an order. Let Cls O be the right class set of O, keeping track of the 
isomorphism classes of invertible right O-ideals in B. Let h := #Cls O be the (right) 
class number of O, and let J, ..., In © B represent the distinct classes in Cls O. 


Let n € Zs 1. We define an h x h-matrix T(n) € M (Z) with nonnegative integer 
entries, called the n-Brandt matrix, by 


Tij := HJ C I; : nrd(J) = nnrd(J;) and [J] = []} 


3 (41.1.1) 
= #J CT: Uj: J] =n and [J] = [F;]}. 


(The notation T(n) deliberately overloads that of the Hecke operators defined in section 
40.5: keep reading to see why!) The Brandt matrix T(n) depends on O, but for brevity 
we do not include this in the notation. In the jth column of the Brandt matrix T(n), we 
look at the subideals of J; with index n? and count them in the ijth entry according 
to the class [J;] they belong to. If n = p is prime and p { N = disc O, then there are 
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exactly p + 1 such ideals, so the sum of the entries in every column in T(p) is equal to 
pti. 


—1,-23 


Example 41.1.2. We continue with Example 17.6.3. We have B = of 


discriminant 23 and a maximal order O with three ideal classes [/;], [/2], [3]. In 
(17.6.5), we found three ideals in J; = O: two belong to the class [72] and the third 
is principal, belonging to [7/1]. This gives the first column of the matrix as (1, 2, 0)*. 
Computing further, we find 


1 1 0 
T(2)=]2 1 3). 
0 1 0 
In a similar manner, we compute 
0 1 3 30 28 24 
T(3)=|2 3 Of, T(q01)=|56 54 60]. 
2 0 1 16 20 18 


41.1.3. There is a second and computationally more efficient way to define the Brandt 
matrix using representation numbers of quadratic forms. Let q; = nrd(J/;), let O; = 
OL), and let w; = #O¥/{+1} < co. Then 


1 
Tni = 5 Hae 17's nrd(@)qi/q; =n}: 
Wi 
indeed, œl; = J C J; with nrd(J) = nnrd(/;) if and only if œ € GI! = (I; : I) and 
nrd(@)q; = pqj, and a is well defined up to right multiplication by u € OF. Now 
Qij: II! >Z 


Qi;(a) = nrd(a) Ë (41.1.4) 
qj 


is a positive definite quadratic form, so it suffices to enumerate lattice points! 


Example 41.1.5. Returning to our example, we have 


Tenu = -Hy € O; : nrd) = nd. 
2wi 


Fori = 1, we have w; = 2 and with y = t+ xæ + yf +zaß andt, x, y,z € Z, by (17.6.6) 
nrd(y) = t? + ty +x? + xz + 6y? + 627 


so T(p)ıı counts half the number of representations of n by this positive definite 
quaternary quadratic form. 


There is a third way to understand Brandt matrices which is visual and combinato- 
rial. 
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Definition 41.1.6. Let J, J C O be invertible right O-ideals. We say J is a n-neighbor 
of Tif J C I and nnrd(/) = nrd(J). 

The n-Brandt graph is the directed graph with vertices Cls O and a directed edge 
from [/;] to [J] for each n-neighbor J € Jj. 


There is no extra content here, just a reinterpretation: the n-Brandt matrix is simply 
the adjacency matrix of the n-Brandt graph. 


Example 41.1.7. Returning a third time to our example, we have the 2-Brandt graph, 
as in Figure 41.1.8. 


Figure 41.1.8: The 2-Brandt graph for discriminant 23 


41.1.9. For n = p prime, there is another way to think of the p-Brandt graph. Consider 
the directed graph whose vertices are invertible right O-ideals whose reduced norm is 
a power of p, and draw a directed edge from / to J if J is a p-neighbor of J. If p { N, 
then this graph is a (p + 1)-regular directed tree, that is to say, from each vertex there 
are p + | directed edges. The notion of belonging to the same ideal class induces an 
equivalence relation on this graph, and the quotient is the p-Brandt graph. 


It is helpful to think of the matrices T(n) as operators on a space, so we define the 
Brandt module M(O) to be the C-vector space with basis Cls O and equipped with 
the action of Brandt matrices T(n) for n € Zo on the right. 

The Brandt matrices have two important properties. First, they commute: by a 
quaternionic version of the Sun Zi theorem (CRT), if gcd(m, n) = 1 then 


T(m)T(n) = T(n)T(n). (41.1.10) 


Informally, we might say that the process of taking m-neighbors commutes with the 
process of taking n-neighbors, when m,n are coprime. Second, they are self-adjoint 
for the pairing on M2(O) given by 


l/w;, ifi=j; 


0, else. 


(Wil, Uj) = | 


The proof of self-adjointness is contained in the equality w;T (n);; = w;T(n)ji, and this 
follows from a bijection induced by the standard involution. 

Therefore the matrices T(n) are semisimple (diagonalizable) and M2(O) has a 
simultaneous basis of eigenvectors, which we call a eigenbasis. The row e = (1, 1,..., 1) 
is always an eigenvector (by the sum of columns) with eigenvalue ap(e) = p + 1 for 


p{N. 
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Example 41.1.11. We check that T(2)T(3) = T(3)T(2) from Example 41.1.2; and 
W1, W2, €3 = 2, 1,3, so we verify that 


200 220 
0 1 OOel2. 1 3 
0 0 3 03 0 


is symmetric. The characteristic polynomial of T(2) is (x — 3)(x? + x — 1), and for T(3) 
it is (x—4)(x?—5). We find the eigenbasis e = (1, 1, 1) and f+ = (4, +V5—3, F3 V5 +3), 
and observe the required orthogonality 


(e, fa) = 544 (2V5—3) + 3V5 +3) = 0, 


By now, hopefully the reader is convinced that the Brandt matrices capture interest- 
ing arithmetic information about the order O and that they are not difficult to compute. 

Now comes the modular forms: the second way of viewing Brandt matrices shows 
that we should be thinking of a generating series for the representation numbers of the 
quadratic forms Q;; defined in (41.1.4). As in section 40.4, we define the theta series 
for the quadratic form Q;; 


= 1 
OQ = X Tyg" =z 2, 4 e ZIlall. 


n=0 f yeljI;! 


Letting q := e?""% for z € H?, by Theorem 40.4.4 (a consequence of Poisson sum- 
mation), the function ©;;(z) : H? — C is a modular form of weight 2 for an explicit 
congruence subgroup of SL2(Z): for example, if O is an Eichler order with reduced 
discriminant N, then ;;(¢) € Ma(To(N)) (trivial character). 

There is something enduringly magical about the fact that the entries of Brandt 
matrices (arithmetic) give Fourier coefficients of holomorphic modular forms (geo- 
metric, analytic). 


Example 41.1.12. Returning one last time to our example, the space M>(T(23)) of 
modular forms of weight 2 and level I9(23) has eigenbasis e23, f+, f- where 


11 ` * n * 
€93(Z) = pt > o™(n)q”, o™(n)= > d 
n=1 d\n 
ald 


and 


+V5+1 
Jz) i= q- 7 P+ V50? +... 


are cusp forms matching the eigenbasis in Example 41.1.11. 


One of the main applications of Brandt matrices is to express the trace of the Hecke 
operator in terms of arithmetic data, as follows. 
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Theorem 41.1.13. Let B be a definite quaternion algebra over Q of discriminant D 
and O © B a maximal order. For d < 0 a fundamental discriminant, define 


mo MK) 


p\|D 


where h(d) is the class number of AND, wg its number of roots of unity, and (2) is 
p 


the Kronecker symbol. For d’ < 0 a discriminant with d’ = df? and d fundamental, 
define hp(d’) = hp(d). Then the trace of the nth Brandt matrix associated to O is 


y(D)/12,  ifnis a square; 
; otherwise 


t T(n) = 3 hp(t? — 4n) + 
teZ 
t?<4n 


where ọ is the Euler totient function. 


Theorem 41.1.13 is a special case of Main Theorem 41.5.2, see Example 41.5.8. 
In a surprising way, it exhibits a relationship between traces of Hecke operators and 
(modified) class numbers of imaginary quadratic fields! 


41.2 Brandt matrices 


To begin, we define the all-important Brandt matrices in the generality considered in 
this text. 

Let R be a global ring with eligible set S C PI F, let B be an S-definite quaternion 
algebra over F of discriminant disc B = Ð and let O C B be an R-order in B with 
reduced discriminant discrd O = St. The reader will probably have in mind the case 
where F is a totally real (number) field, S the set of real (archimedean) places of F, 
and B a definite quaternion algebra over F; but the arguments hold just as well with a 
larger set S of ramified places or in the function field case. 


41.2.1. Let Cls O be the right class set of O. By Corollary 27.6.20, the class number 
h = #ClsO < œ is finite. Let Jı, ..., J, be a set of representative invertible right 
O-ideals for Cls O. For i = 1,...,h, let O; = OL(I;) be the left order of J;; then O; 
depends on the choice of J; but its isomorphism class (i.e., type) is independent of the 
choice of J;. (Due to the possible presence of two-sided ideals, there may be repetition 
of types among the orders O;.) 


41.2.2. Let n C R be a nonzero ideal. For each j, we consider the set of right invertible 
O-ideals J © J; with nrd(J) = nnrd(J;), and we count them according to their class in 
Cls O: 


T(n); := HJ C I; : nrd(J) = nnrd(/;) and [J] = [/;]} € Zso. (41.2.3) 


A containment J C J; of right O-ideals yields a compatible product J’ = JI and 
thus an invertible right O;-ideal with reduced norm nrd(J’) = nrd(J I7’) = n, and 
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conversely. So equivalently 
T(n)ij = #{J’ c Oj; : nrd(J’) = n and [J’J;] = [E]. 


(We could also rewrite [J’J;] = [/;] in terms of classes of right ideals of O;.) 


Definition 41.2.4. The n-Brandt matrix for O is the matrix T(n) € M (Z) whose 
(i, j)th entry is equal to T(n); j. 


41.2.5. To make the definition more canonical, we define 
M2(O) := Map(Cls O, Z) 


to be the set of maps from Cls O to Z (as sets). Then M2(O) has the structure of an 
abelian group under addition of maps, and it is a free Z-module on the characteristic 
functions for Cls O. The n-Hecke operator is defined to be 


T(n): M2(O) > M2(O) 
TMA (UD) = 2 fUD (41.2.6) 
nauj ehit) 


again the sum over all invertible right O-ideals J c J with condition on the reduced 
norm. Visibly, this definition does not depend on the choice of representative J in its 
right ideal class. And in the basis of characteristic functions for J;, the matrix of T(n) 
is precisely the n-Brandt matrix. 


Brandt matrices may be given in terms of elements instead of ideals. Let w; = 
OXY : R*]. By Proposition 32.3.7, since B is S-definite, the unit index w; < œ is finite. 
i y Frop 


Lemma 41.2.7. Let nj; = nnrd(/;)/nrd(J;) for i,j = 1,...,h. Then following state- 
ments hold. 


(a) We have 
T(n)ij = # {a € Ij]; : nrd(a@)R = nij} /OF 
1 
= —# {a E i : nrd(a)R = nij} /R* 
Wi 


(41.2.8) 


where we count orbits under right multiplication by O¥ and R™, respectively. 
(b) If the class of nij in Cl* R is nontrivial, then T(n); = 0. 
(c) Suppose that ni; = nijR with nij € F totally positive. Then 


1 
T(n)ij = J #{a € I7" : nrd(a) = un;;} (41.2.9) 


! uR”?ER% ,/R2 


where the sum is over a choice of representatives of totally positive units of R 
modulo squares. 
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Proof. We claim that there is a bijection 
{J CJ; : nrd(J) = nnrd(U/, and[J] = [/;]} 


41.2.10 
> fae LI! : nrd(a@) = nj} /O* ) 


with orbits under right multiplication by OF = On, Indeed, a containment 
J C I; of invertible right O-ideals with [J;] = [J] corresponds to a € B* such that 
al; = J, so in facta € (J : L) = Jt, and nrd(J) = nnrd(/;) translates into 
nrd(@) nrd(/;) = nrd(J) = nnrd(J;) or nrd(@)R = nij. Writing JI! = aO;j, we see that 
æ is unique up to multiplication on the right by O*. To finish (a), we note that the right 
action by O% is free; and (b) follows from (a), since nrd(B™) < Fry as B is S-definite. 

For (c), we just need to organize our generators; the sum in (41.2.9) is finite by 
the Dirichlet S-unit theorem. If nrd(@)R = n;; then nrd(@) = un;; for some u € Rž% 
Multiplying by an element of RX „, we may suppose that nrd(a)/n;; = u belongs in a set 
of representatives for RR and for v € R*, we have nrd(va) = v? nrd(a) = nrd(a) 
if and only if v = +1, which gives us an extra factor 2. oO 


41.2.11. The advantage of (41.2.9) is that it can be expressed simply in terms of a 
quadratic form. Suppose that F is a number field and R = Zr, when this observa- 
tion is especially clean. Since B is totally definite, as in 17.7.10, the space B <> 
B®gR = H” = R4” comes equipped with the positive definite quadratic form 
Q =Trr/gnrd: B > R, and if J is a R-lattice, then J = Z^” embeds as a Euclidean 
lattice J — R* with respect to this quadratic form. Therefore, 


{a € GI' : nrd(q) = unjj} € {a € I7" : O(a) = TrF jQ uni} 
where the latter set is finite and effectively computable. 
Finally, Brandt matrices are adjacency matrices. 


Definition 41.2.12. Let J, J C O be invertible right O-ideals. We say J is a n-neighbor 
of Tif J C I and nrd(J) = nnrd(/). 

The n-Brandt graph is the directed graph with vertices Cls O and a directed edge 
from [/;] to [J] for each n-neighbor J € I. 


By definition, the adjacency matrix of the n-Brandt graph is the n-Brant matrix 
T(n). 


41.2.13. Let p { N be prime and suppose that the class of p generates Clg(o) R. Then 
by Proposition 28.5.18, we may take the ideals J; to have reduced norm a power of 
p. Consider the directed graph whose vertices are the right O-ideals whose reduced 
norm is a power of p with directed edges for each p-neighbor relation. (This graph is 
a regular directed tree by Proposition 41.3.1 below, every vertex has out degree equal 
to Np + 1.) The equivalence relation of belonging to the same right ideal class (left 
equivalent by an element of B*) respects edges, and the quotient by this equivalence 
relation is the p-Brandt graph. 
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Remark 41.2.14. The Brandt graphs have interesting graph theoretic properties: they 
are Ramanujan graphs (also called expander graph), having high connectivity and 
are potentially useful in communication networks. In the simplest case where F = Q 
and B is the quaternion algebra of discriminant p, they were first studied by Ihara, 
then studied in specific detail by Lubotzky—Phillips—Sarnak [LPS88] and Margulis 
[Marg88]; for further reading, see the books by Lubotzky [Lub2010] and Sarnak 
[Sar90]. Over totally real fields, see work of Livné [Liv2001] as well as Charles— 
Goren—Lauter [CGL2009]. The proof that Brandt graphs are Ramanujan relies on the 
Ramanujan—Petersson conjecture, a deep statement proven by Deligne [Del74], giving 
bounds on coefficients of modular forms. 


Remark 41.2.15. The space of functions on Cls O can itself be understood as a space 
of modular forms, a special case of the theory of algebraic modular forms due to 
Gross [Gro99]. This general formulation harmonizes with the double coset description 
given in section 38.7, via the canonical bijection Cls O © B*\ BX / O*, but without the 
geometry! 


41.3 Commutativity of Brandt matrices 


In this section, we examine basic properties of Brandt matrices—including that they 
commute. 


Proposition 41.3.1. The following statements hold. 


(a) The sum of the entries >}; T(n)i; in every column of T(n) is constant; if n is 
coprime to N, then this constant is equal to Dain N), where N(a) = #(R/a) is 
the absolute norm. 

(b) If m, n are relatively prime, then 


T(mn) = T(m)T(n) = T(n)T(m). (41.3.2) 


Proof. First we prove (a). The orders O; are locally isomorphic, so by the local- 
global dictionary for lattices (Theorem 9.4.9), the number of invertible right O;-ideals 
with given reduced norm is independent of j, giving the first statement. For the 
second statement, under the hypothesis that n is coprime to N, for all p | n we 
have Op ~ M2(Rp), and we counted right ideals in our pursuit of the zeta function: 
by Proposition 26.3.9, these counts are multiplicative, and by Lemma 26.4.1(b), the 
number of reduced norm p° is 1 +q +-+- + q° where q = N(p). 

Statement (b) follows similar logic but with “unique factorization” of right ideals. 
As above, an invertible right Oj;-ideal of reduced norm mn by Lemma 26.3.6 factors 
uniquely into a compatible product of invertible lattices of reduced norm m and n: 
organizing by classes, this says precisely that 


h 
T(mn)j; = > T(m)KT (Nx; (41.3.3) 
k=l 


which gives the matrix product T(mn) = T(m)T(n). Repeating the argument inter- 
changing the roles of m and n, the result is proven. o 
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For prime powers coprime to St, we have a recursion for the p’-Brandt matrices 
that is a bit complicated: the uniqueness of factorization fails when the product is a 
two-sided ideal, so we must account for this extra term. To this end, we need to keep 
track of the effect of multiplication by right ideals of R on the class set. 


41.3.4. For an ideal a C R, let P(a) € M, (Z) be the permutation matrix given by 
I; + al;. In other words, we place a 1 in the (i, j)th entry according as [alJ;] = [i] 
(with 0 elsewhere). The matrix P(a) only depends on the class [a] € Cl R: in particular, 
if a is principal then P(a) is the identity matrix. Therefore we have a homomorphism 


P : CIR > GL; (Z) 
[a] > P(a). 


We have 
P(ab) = P(a)P(b) = P(b)P(a) 


and in particular P(a)P(a-!) = 1 and the image P(CIR) € GL,(Z) is an abelian 
subgroup; however, this map need not be injective. Moreover, for all a, n we have 


P(a)T(n) = T(n)P(a) (41.3.5) 


by commutativity of multiplication by a. 


As in 26.4.3, we say an integral right O-ideal J is primitive if we cannot write 
I =al’ with /’ integral anda C R. 


Proposition 41.3.6. Let p { N be prime. Then for r,s € Zs0, 


min(r,s) 
T(p’ )T(p*) = 3. NEIT (p +T P(p)’. (41.3.7) 
i=0 
In particular, for all r > 0, 
T(p"**) = T(P DT (p) — NTP") PQ). (41.3.8) 


Proof. When s = 0, the matrix T(1) is the identity and the result holds. We next 
consider the case s = 1, and will then proceed by induction, and consider the product 
T(p" )T(p): its ijth entry 


h 
TODT = X TOTO 


k=1 
counts the number of compatible products of right ideals J/ J’ where J? is an invertible 
O;, Ox-ideal with nrd(J;.) = p” and J’ is an invertible Ox, O;-ideal with nrd(J’) = p. 
The issue: these products may not all be distinct when they are imprimitive. If the 
product J} J’ is imprimitive, then we rewrite it as a compatible product 


KI sO Cy Tl = pTI 
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where now J’_, = p-|J/J’ has reduced norm p’~!. This procedure works in reverse as 
well. 

With apologies for the temporarily annoying notation, define Treat’) and 
Taman) to be the p’-Brandt matrix counting classes of primitive or imprimi- 
tive, accordingly. Then 


TG y= Lota) Tips (41.3.9) 
Under multiplication by p, we have 
Tm JT" PO): (41.3.10) 


Since there are N(p) + 1 right O-ideals of reduced norm p, with the previous paragraph 
we obtain 


T(p" )T(p) = Tprim(p”*') + (N(p) + 1) Timprim(p”*") 
= T(p"*!) + N(p)Timprim(p”*!) (41.3.11) 
= T(p"*!) + N(p)T(p""!)P(p). 


This proves the result for s = 1, and it gives (41.3.8) upon rearrangement and shifting 
indices. 
We now proceed by (an ugly but harmless) induction on s: 


T) (TO) + NTD) 


= X (NTP) (41.3.12) 


i=0 
F Oe O 


so 
TONTO) = X (NEYT PCY) 
i=0 
4 NCP) Pep) +! pepr =20+0)) 
= y N(p)'*! PEET (p++) (41.3.13) 
i=0 
= >) Nowy PepyT(p 1) 

i=0 

as claimed. o 


Definition 41.3.14. The Hecke algebra T(O) is the subring of M} (Z) generated by 
the matrices T(n) with n coprime to Nt. 


Corollary 41.3.15. The ring T(O) is a commutative Z-algebra. 
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Proof. By Proposition 41.3.1(b), we reduce to showing that T(p” )T(p*) = T(p*)T(p’) 
for all r,s > 0, and this holds by Proposition 41.3.6: the right-hand side of (41.3.7) is 
symmetric under interchanging r, s. o 


Example 41.3.16. Let F = Q(V10) and R = Zr = Z[V10] its ring of integers. 
Then the class group CIR = Z/2Z is nontrivial, represented by the class of the ideal 
p2 = (2, V10), and the narrow class group Cl* R = CIR is no bigger: the fundamental 
unit is 3 + V10 of norm —1. 
Let B = (-1,-1 | F). Since 2 is not split in F, the ramification set Ram B is the set 
of real places of F. A maximal order is given by 
O=Rep;'(1+) ep; '(1 +per HH t 


We find that # Cls O = 4, and 


0 0 0 1 
0 0 32 
3 100 


In this case, the matrix P(p2) is the identity matrix: for example, we have p20 = (1+i)O. 
Thus 


T(p3) = T(2R) = T(po)* - 2 = 


CONF 
whoo 
whoo 


1 
6 
0 
0 


41.4 Semisimplicity 


We now equip the space M2(O) = Map(Cls O, Z) with a natural inner product, and we 
show that the Hecke operators are normal with respect to this inner product. 


41.4.1. For [7] € Cls O, we define wiz) := [OL()* : R*]; this is well-defined, as a 
different choice of representative gives an isomorphic (conjugate) order. 
Let 1,7; be the characteristic function of [7] € Cls O; then 1,7; for [7] € Cls O form 


a basis for Mz(O). We define the bilinear form 
» »: Mo(O) x M2(O) — Z 
(€, ): M2(O) x M2(O) (41.4.2) 
(pp La) = we Outs) 


where ôr] = 1,0 according as [J] = [J] or not, and extend linearly. The matrix of 
this pairing in the basis of characteristic functions is the diagonal matrix diag(w;);, 
where w; = [OF : R*]. The pairing is symmetric and nondegenerate. 


The inner product (41.4.2) defines an adjoint T => T*. 
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Proposition 41.4.3. We have 
Py = PO) (41.4.4) 
T(n)* = PDT (n). (41.4.5) 


The Hecke operators T(n) are normal with respect to the inner product (41.4.2), and 
for n trivial in Cl* R the operators T(n) are self-adjoint. 


Proof. We may show the proposition for the Brandt matrices. Let W = diag(w;); define 
the inner product on Z” with the Brandt matrices acting on the right on row vectors. 
Then the inner product is (x, y) = xW yt and accordingly the adjoint (xT, y) = (x, T* y) 
is defined by 

T* = WTW (41.4.6) 


The transpose of a permutation matrix is its inverse and that OL (nJ;) = OL (I;), so 
that the unit groups match up, whence 


PO) = PM! = P’) (41.4.7) 
giving (41.4.4). 
For the Brandt matrices, we refer to Lemma 41.2.7(a), giving 
1 
T); = —# {a € II" : nrd(a)R = nij} /R* 
Wi 
where n;; = nnrd(J;)/ nrd(7;). Let 
O(n);; = {a & LI’ :nrd(a)R = ny} [R*. 


By (41.4.6), 
WT (n) = (O(n);j)i j =: O(n). 


We extend the definition of ©(n) to include all fractional ideals n. For each į, let 
i’ be such that [n7 !;] = [J], so that n™!H; = Bil;; the induced action is given by the 
permutation map P(n'). 


O(n);; = O(n) ji 


(41.4.8) 
a m (ab)! = By! a" 
is well-defined and bijective. 
Indeed, if œ € GI! then 
== nrd(/;) 
weit =p =11;'—2 41.4. 
«E J'i i J J nrd(J;) ( 9) 


since JT = nrd(J) for an invertible R-lattice 7, so 


a`! en hi = pily I7" 
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and therefore B'a! € TT; as claimed. And nrd(q@) = n;; implies nrd(a~!) = nny 
so nrd(6;!a7') = njiv. We can run the argument in the other direction to produce an 
inverse, and we thereby conclude the map is bijective. 

The map (41.4.8) together with the action by permutation and Wt = W yields 


WT(n) = O(n) = P(v!)O(n)t = PMTHWT (n) 


and thus T(n)* = P(n)*T(n), and substituting (41.4.7) gives (41.4.5). 

For the final statement, by (41.3.5) we have T(n) commuting with P(n), so T(n) 
commutes with T(n)*; and when n is narrowly principal, then P(n!) is the identity 
matrix so T(n)* = T(n). Oo 


By the spectral theorem in linear algebra, we have the following corollary. 


Corollary 41.4.10. T(O) is a semisimple commutative ring, and there exists a basis 
of common eigenvectors (eigenfunctions) for the Hecke operators. Each T(n) with n 
narrowly principal has real eigenvalues. 


41.5 Eichler trace formula 


In this section, we compute the trace of the Brandt matrices in terms of embedding 
numbers. We continue notation from the previous section. 

We begin by recalling the main ingredients. Let K > F be a separable quadratic field 
extension and let S C K be a quadratic R-order. Let A(S) = #Pic S. Let m(S, O, O%) 
be the number of O*-conjugacy classes of optimal embeddings S — O. Then by 
Theorem 30.4.7, 


Dd) mS, OL); O10") = h(S)m(S, 0; 0%) (41.5.1) 
[I]€Cls O 


We also recall 


h 
1 
mass(Cls O) := > — 
Wi 

i=l 


and that the Eichler mass formula (Main Theorem 26.1.5) gives an explicit formula for 
this mass in terms of the relevant arithmetic invariants. 


Main Theorem 41.5.2 (Trace formula). [fn is not narrowly principal, then tr T(n) = 0. 


If a = nR is narrowly principal with n € F% then 


eae l ifn = °R, c € F* 
70) = 5 Y Mn 6:0 « [MMH ifn= CR ce 


wats) YS , otherwise 


where ws := [S* : R*] and the sum is over finitely many triples (u, t, S) where: 
e u belongs to a set of representatives of Rž R9; 
o t € R satisfies t? — 4un € F% and 


< 


e SD R[x] /(x? — tx + un). 
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Proof. We have trT(n) = r T(n);;. By Lemma 41.2.7, since n; = n we conclude 
trT(n) = 0 if n is not narrowly principal. So suppose n = nR is narrowly principal, 
with n € FX). Then by (41.2.9) we have 


1 
wil (ii = = ` #{a € O; : nrd(a) = un}. (41.5.3) 
a uRERX , (Re 


We are free to organize by reduced trace, giving 


wT (n) = ; > b> #la € O; : trda) = t, nrd(@) = un}. (41.5.4) 


u teR 


Since B is definite, we have disc(a) = t* — 4un either zero or totally negative, so the 
inner sum is over finitely many t € R either satisfying t? = 4un or t? — 4un € Fo, 

If t? = 4un (equivalently ~ = t/2 € F), then n = nR = cR with c = +t/2; 
conversely, if n = nR = c*R for some c € F*, then there exists a unique representative 
u € RX)/ R”? such that un = c?. Consequently, exactly when n = c°R is a square of a 
principal ideal, there is a contribution of (1/2)(2) = 1 to the sum. 

For the remaining terms, we have a ¢ F and R[a] = R[x] /(x? -tx+un) is a domain. 
The embedding R[a] — O; need not be optimal, but nevertheless corresponds to the 
optimal embedding S — O; for a unique superorder S$ 2 R[a], and conversely. We 
count these up to units: the action of conjugaton by u € O% centralizes such an 
embedding if and only if u € S*, so letting ws := [S* : R*] we have 


#(a € O; : trd(a) = t, nrd(a) = un} = X m(S,O;;O%) E. 
S2R[x]/(x?-tx+un) ws 
Plugging these into (41.5.4), we obtain 
1 x 
wiT(Mi=5 >) 2, mS O75 0} ie 


u,t SDR[x]/(x2-tx+un) 


(41.5.5) 
F 1, ifn = (cR}°; 
0, otherwise. 
Dividing through by w; and summing (41.5.5), we have 
trT(n) = D D Zmes, O;; O%) + Die = (41.5.6) 


2 TA i= 


where 6 = 1, 0 according as n is a square of a principal ideal or not. Now substituting 
(41.5.1) and the definition of mass, the theorem is proven. o 


Corollary 41.5.7. We have 
h(S) 


Ws 


1 x 
#Cls O = mass(Cls O) + 5 oy m(S, 0; O*) 


(u,t,S) 


where the sum is over (u,t,S) with n = 1. 
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Proof. For n = R, we have T(1) the identity matrix, so tr T(1) = #Cls O. o 


Corollary 41.5.7 gives a different way to prove (and interpret) the Eichler class 
number formula (Main Theorem 30.8.6): for the exact comparison, see Exercise 41.2. 


Example 41.5.8. Suppose F = Q and R = Z. For d € Z a nonsquare discriminant, 
we write Sy := Z[(d + Vd)/2] for the unique quadratic ring of discriminant d, so 
Ws, = [S> : Z*] = 1 except for d = —3, —4. In the trace formula (Main Theorem 
41.5.2), the quaternion algebra that appears is definite, and so the only quadratic orders 
that embed are imaginary quadratic, with discriminant d < 0. Simplifying in this way, 
the trace formula then becomes 


TOREJ 3 H(Sa) WS, 0:0") (41.5.9) 


WwW 
teZ df?=t?—-4n<0 Sa 


for n not a square (adding a mass term for n a square). 
To notationally simplify a bit further, we define modified Hurwitz class numbers 


WS) aan 
ho(S) := LO O; O%) 
ws 
where the factor m(S, O; O*) is defined by purely local data, given in section 30.5 for 
maximal orders and section 30.6 for Eichler orders. Writing ho(d) = ho(Sq) for the 


order of discriminant d, we arrive at a pleasing formula: 
1 
rT = 5 ` y ho(d) (41.5.10) 
teZ df?=1?-4n<0 


again for n not a square. 
Taking n = 1 as in Corollary 41.5.7 (and adding back the mass term) gives 


#Cls O = mass(Cls O) + I > ho(t? — 4) 
oa (41.5.11) 
= mass(Cls O) + Tho + 2ho(-3). 


For O an Eichler order, after substitution we recover the Eichler class number formula 
(Theorem 30.1.5). 


Example 41.5.12. As an illustration of Example 41.5.8, we compute tr T(3) for the 
Hurwitz order. We compute the following table of values: 


t d |hSa) ws,  m(Sa0;0*%) | ho(d) 
0 -2 1 1 0 0 
0 -3] 1 3 2 2/3 
+1 -11| 1 1 1Cirig22 |) 2 
+2 -8| 1 1 1-(-812)=1] 1 
+43 -3| 1 3 1-(-3|2=2 | 2/3 
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Summing then gives 
trT(3) = (1/2)(0 + 2/3) + (2 + 14+ 2/3) = 4. 


Indeed, more generally since the Hurwitz order has # Cls O = 1, the matrix T(n) isa 1x 
1-matrix with T(n) = [o(n)] for n odd. This observation implies a nontrivial (and oth- 
erwise surprising) relationship between class numbers of imaginary quadratic orders! 


Remark 41.5.13. Brandt [Bra43, III] defined Brandt matrices in the same paper as 
his groupoid; he called them Hecke matrices, as he claimed to follow parallels with 
certain operators defined by Hecke. Indeed, Hecke [Hec40, §9, Satz 53] conjectured 
that the space of cusp forms of weight 2 on I'9(p) for p prime was spanned by certain 
linear combinations of theta series, and it was this observation that motivated Brandt. 
(Eichler [Eic56a, footnote 16] says that Brandt should not have named them after 
Hecke, since it was really Brandt who interpreted function-theoretic results of Hecke 
using pure arithmetic.) 

Eichler [Eic56a] proved that the ring generated by the Brandt matrices was a com- 
mutative, semisimple ring and proved the trace formula for Brandt matrices [Eic56a, 
§6]. In this early work, he already foresaw the application of Brandt matrices to other 
base fields: as an application, he used Brandt matrices to give class number relations 
between imaginary quadratic fields, and in the function field case these become rela- 
tions among divisor class groups for hyperelliptic curves. Eichler [Eic77, Chapter IT] 
presented the generalization to totally real fields, giving a treatment of Hecke operators, 
Brandt matrices, and theta series, and he proved that the Brandt matrices realize Hecke 
operators in certain spaces of Hilbert modular forms. 

Eichler then later gave a self-contained presentation [Eic73, Chapter II] of the 
theory of Brandt matrices over Q, with the intended application the solution to Hecke’s 
conjecture (suitably corrected), now known as the basis problem for To(p): to give 
bases of linearly independent forms of spaces of modular forms in terms of theta series 
of quadratic forms coming from quaternion algebras. This line of work was followed 
by generalizations by Hijikata [Hij74] and Hijikata—Saito [HS73] for general Eichler 
orders, Pizer [Piz76b, Piz76c] for residually split orders, culminating in a solution over 
the rational numbers to the basis problem by Hijikata—Pizer-Shemanske [HPS89a]. 

The method of proof for the solution to the basis problem is the use of the trace 
formula, for which a key ingredient is the theory of optimal embeddings: see Remark 
30.6.18. 

Indeed, it is much more involved analytically, but one can similarly compute the 
trace of the Hecke operator acting on classical spaces of modular forms or more 
generally spaces of Hilbert modular forms. These trace formulae are quite complicated, 
but one notices that they have a similar shape as the above trace formula; and in fact, 
under certain hypotheses and after restricting to an appropriate new subspace, the traces 
are equal. But since both rings are semisimple, this implies that the same systems of 
eigenvalues for the Hecke operators arise! Such a correspondence was first given by 
Eichler, as above; it was generalized to totally real fields by Shimizu [Shz72] using 
theta series, and the most general formulation given by Jacquet—Langlands [JL70]. This 
correspondence was conjectured to generalize to the principle of Langlands functorial 
transfer: for an introduction to this vast area, see Gelbart [Gel84]. 
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In light of the preceding epic remark, we hope we have inspired the reader to pursue 
the relationship between Brandt matrices and modular forms! Unfortunately, it would 
require another book to respectfully develop this subject. 


Remark 41.5.14. Pizer [Piz80a] was the first to give an algorithm for computing 
classical modular forms using Brandt matrices (on lo(N) for N not a perfect square); see 
also the work of Kohel [Koh2001] over Z. This algorithm was generalized to compute 
Hilbert modular forms over a totally real field of narrow class number 1 by Socrates— 
Whitehouse [SW2005], with algorithmic improvements by Dembélé [Dem2007]. The 
assumption on the class number was removed by Dembélé—Donnelly [DD2008]. A 
survey of these methods are given by Dembélé—Voight [DV2013, $4, §8]. 


Exercises 


Unless otherwise specified, in these exercises let R be a global ring with eligible set 
S C PIF, let B be an S-definite quaternion algebra over F and let O C B be an R-order 
in B. 


1. Extend the definition of the Brandt matrix to include the case n = (0) of the 
zero ideal, following (41.2.8): define T(0);; = 1/w; fori, j = 1,...,. Conclude 
tr T(0) = mass(Cls O). 

2. Show that Corollary 41.5.7 agrees with Main Theorem 30.8.6. [Hint: organize 
by q := [S* : R*], observe that in S 2 R[x]/(x? — tx + u) we have necessarily 
[SX : R*] > 2andeach such S contains q—1 orders of the form R[x] /(x?—tx+u).] 

3. Refine Lemma 41.2.7(c) in a special case as follows. Suppose CI* R is trivial. 
Show that 

T(n)ij = —#a € I;I" : nrd(a) = nij} 
where w; 1 := #O}. 

4. Suppose R = Z and F = Q, suppose disc B = p is prime and O is a maximal 
order. Show that 

1, if p = 2,3; 


el (aay, fo > 3. 


where 
h(-4p)/2, if p = 1 (mod 4); 
ho(-4p) = 4 h(-p), if p = 7 (mod 8); 
2h(-p), ifp = 3 (mod 8)and p > 3. 
What does this say about the number of maximal orders in B up to isomorphism 
such that every two-sided ideal is principal? 
5. Give another proof of Proposition 41.3.1(b) using the local-global dictionary for 
lattices. 
6. Prove (41.3.12) using induction and then expand to verify (41.3.13). 
-1,-1 
Q 


1 
7. Let B= ) with disc B = 11 and let O = ZG, AG +1)). 
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(a) Show that O is a maximal order with #O* = 4 
(b) pel that the ternary quadratic form associated to O is similar to x? — xz + 
243z? 
(c) Show that C1O = {[O], [/2]} where h = 20+ (1 +2i + j)O. [Hint: Follow 
Example 17.6.3.] Along the way, show that 


3 
rol 3) 
(d) Pause and show that O2 := O(h) has #0% = 6 and associated ternary 
quadratic form x? — xy — xz + y? + yz + 4z. 
(e) Show that M2(O) has two eigenspaces for the Hecke algebra, one spanned 
by a form e with T(p)(e) = (p + De for all p + 11, and the other spanned 


by a form f with T(2)(f) = -2f. 
(£) Verify the trace formula (41.5.9) for tr T(2) = 1 by computing class num- 
bers. 


[There is a unique normalized cusp form f € So(To(11)) of weight 2 and level 
11 with 


f@=a| [a-a?| [0-4-2-8 +- = Y ang" 
n=1 n=1 


matching f in the sense that T(n)f = anf for all 11 { n.] 


. Let O(@,5) be an Eichler order of level 5 and reduced discriminant 15 (in a 


quaternion algebra B of discriminant 5) and similarly let O(5, 3) be an Eichler 
order of level 3 and reduced discriminant 15. 


(a) Show that #Cls O(3, 5) = #Cls O(5, 3) = 2 

(b) Show that there is a unique eigenvector for the Brandt matrix 7(2) for 
both orders with eigenvalue —1. As far as you can compute, show that this 
eigenvector shares the same eigenvalues for T(n) both orders. 


. We consider an example of Brandt matrices not restricted to maximal orders. 


Let B = (= =) and let 


O = Z(2i,2j) = Z ® Z2i) ® Z(2j) © Z(4ij). 


(a) Show that O is an order with discrd O = N = 64. 
(b) Compute that #Cls O = 4. 
(c) Under the action of the Brandt matrices T(n), show that there are 3 irre- 


ducible factors of dimensions 1, 1, 2. In a basis of characteristic functions, 
identify the one-dimensional factors as: 


1 [e] 
(1,1, 1,1) Se= a 


(1-1-1, 1) © e(a) = 55 + ` o*(n)x(n)q” 
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-1 
where now o*(n) := > d and y(n) = (= 
n 
2łd\n 
[The two-dimensional space has basis (1, 0, 0, —1), (0, 1, —1,0) e fi, fo and 
ap( fı) = ap( f2) for all p, with 


fi=q +2 -3g - 692 +24" +... 


corresponding to the isogeny class of the elliptic curve E : y? = x3 + x of 
conductor 64.] 


This chapter is licensed under the terms of the Creative Commons Attribution- 
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Chapter 42 


Supersingular elliptic curves 


In the previous chapter, we showed that Brandt matrices for an order in a definite 
quaternion algebra B contain a wealth of arithmetic. In the special case where disc B = 
p is prime, there is a further beautiful connection between Brandt matrices and the 
theory of supersingular elliptic curves, arising from the following important result: 
there is an equivalence of categories between supersingular elliptic curves over F, and 
right ideals in a (fixed) maximal order O C B. We pursue this important connection in 
this chapter for the reader who has a bit more background in algebraic curves. 


42.1 Supersingular elliptic curves 


In this section, we briefly review what we will need from the theory of elliptic curves; 
see Silverman [Sil2009] for further general reference. Let F be a field with algebraic 
closure F”!, 


Definition 42.1.1. An elliptic curve is a smooth projective curve (variety of dimension 
1) of genus 1 equipped with a rational point. Every elliptic curve E is isomorphic over 
F to the projective curve associated to the affine equation 


E : y? +ajxy + ajy = xX? + ax? + a4x + a6 


with a; € F. 


Definition 42.1.2. An isogeny ¢: E — E’ is a nonconstant morphism of pointed 
curves; such a map is automatically surjective and a group homomorphism, with the 
marked point as origin. 


Let Hom(E, E’) be the collection of isogenies from E to E’ defined over F; if we 
need to allow isogenies defined over a larger field, we will similarly extend the field 
of definition of our elliptic curves. Then Hom(E, E’) is a torsion-free Z-module of 
rank at most four. Let End(Z) := Hom(E, E) be the endomorphism ring of E and let 
End(£)g := End(E) 8z Q be the endomorphism algebra. 


42.1.3. For each nonzero isogeny ¢: E — E’, there exists a dual isogeny ¢Y : E’ > E 
such that ø“ o ø and ¢ o ¢” are equal to multiplication by the degree deg ø € Zs on 
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E and E’, respectively. In particular, the dual Y is a standard involution on End(£) that 
is positive (see 8.4.1); the Q-algebra End(E)g := End(£) @z Q is therefore a division 
ring. In particular, we have the equality 

o 1 

~ deg¢ 
in End(£)g for all nonzero ø € End(E£). 


g! ov (42.1.4) 


From now on, let E be an elliptic curve over F. 


Lemma 42.1.5. The endomorphism algebra End(E)g of E is either Q, an imaginary 
quadratic field K, or a definite quaternion algebra over Q. 


Proof. We apply Theorem 3.5.1 to conclude that End(£)g is either Q, a quadratic field, 
or a division quaternion algebra. Then by Example 8.4.2, the involution is positive if 
and only if End(£)p is R, C, or H, so in the second case we must have an imaginary 
quadratic field and in the third case we must have a definite quaternion algebra. o 


42.1.6. Among the possibilities in Lemma 42.1.5, if End(E pa)Q is a quaternion alge- 
bra, then we say E is supersingular. 


See Silverman [Sil2009, §V.3] for a treatment of supersingular elliptic curves. 


Proposition 42.1.7. If E is supersingular, then char F = p > 0. Moreover, the follow- 
ing are equivalent: 


(i) E is supersingular; 
(ii) E[p(F® = {0}; and 
(iii) the map |p] : E > E is purely inseparable and j(E) € Faz 


If F is a finite field, then these are further equivalent to 
(iv) trd(¢) = ¢+¢Y = 0 (mod p), where ġ: E —> E is the Frobenius endomorphism. 
Proof. See Silverman [Sil2009, V.3.1]. o 


42.1.8. One can often reduce questions about supersingular elliptic curves to ones 
where the base field F is F „2 as follows: by Proposition 42.1.7Gii), if E is supersingular 
then E is isomorphic over F” to a curve E defined over F p 


The following fundamental result is due to Deuring [Deu41]; we give a proof due 
to Lenstra [Len96, §3]. 


Theorem 42.1.9. Let E be an elliptic curve over F and suppose that rkz End(E) = 4. 
Then B = End(E)g is a quaternion algebra over Q ramified at p = char F and œ, and 
End(E) is a maximal order in B. 


In particular, if over F we have dim End(£) = 4, then automatically E has all of its 
endomorphisms defined over F. 
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Proof. Let O = EndE C B = End(E)g. Let n > 0 be prime to p. Then there is an 
isomorphism [Sil2009, Corollary III.6.4(b)] 


E[n] = E[n (F® = Z/nZ © Z/nZ 


as abelian groups, and the endomorphism ring of this abelian group is End E[n] = 
Mo2(Z/nZ). 

We claim that the structure map O/nO — End E[n] is injective, which is to say, 
E[n] is a faithful module over O/nO. Indeed, suppose ¢ € O annihilates E[n]; then 
since multiplication by n is separable, by the homomorphism theorem for elliptic curves 
[Sil2009, Corollary III.4.11] there exists y € O such that ¢ = nw, so ¢ = 0 € O/nO, 
proving injectivity. But further, since #O/nO = #End E[n] = nf, the structure map is 
an isomorphism. 

Since O is a free Z-module, we have 


Oe := O 8z Ze = O 8z lim Z/E"Z ~ lim O/C" O. 
— — 
The structure isomorphisms in the previous paragraph are compatible with respect to 
powers of £, so with the previous line they provide an isomorphism 


Or > lim End E[é"] = Endz, E[€™] ~ M2(Ze) 
of Ze-algebras, and in particular O¢ is maximal and Be ~ M2(Qœe) so B is split at £. 

Since B is definite, it follows from the classification theorem (Main Theorem 
14.1.3, equivalent to quadratic reciprocity) that Ram(B) = {p, œ}, so Bp is a division 
algebra over Qp. 

To conclude, we show that Op is maximal. For ¢ € O an isogeny, let deg; ¢ be 
the inseparable degree of ¢, which is a power of p. We put deg; 0 = oo. Then deg; ¢ 
is divisible by q = p” if and only if ġ factors via the qth power Frobenius morphism 
E > E”, The map 


v : End(E)g > QU {œ} 
1 (42.1.10) 
v(aġ) = ordp(a) + 5 ord, (deg; ¢) 


for a € Q and ¢ € End(£) is well-defined (since deg,[p] = deg[p] = p°). Factoring an 
isogeny into its separable and inseparable parts shows that 


ord, (deg; 6) = ordp(deg ¢) = ordp(nrd ¢) 


so (42.1.10) is precisely the valuation (13.3.1) on B = End(E)ọ extending the p-adic 
valuation on Q. (See also Exercise 42.2.) 

To conclude, we show that O,,) is the valuation ring (13.3.3) of B and is therefore 
maximal (Proposition 13.3.4). If a € Op) = O 8z Zip) then dega € Zip) so a is 
in the valuation ring. Conversely, let a € B be a rational isogeny with v(@) > 0, 
and write a = ad where ¢ is an (actual) isogeny not divisible by any integer. Then 
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v(@) = ord, (a) + v(d) = 0 and 0 < v(¢) < 1/2, since the multiplication by p is purely 
inseparable; so ord,(a) > —1/2 and therefore a € Zip), and hence œ € Op). 

Finally, since an order is maximal if and only if it is locally maximal, O itself is a 
maximal order in the quaternion algebra B. o 


In light of 42.1.8, we now let F = Fal be an algebraic closure of F,. Let E, E’ be 
elliptic curves over F. If E is isogenous to E’, then E is supersingular if and only if £’ 
is supersingular (see Exercise 42.1). The converse is also true, as follows. 


Lemma 42.1.11. Let E, E’ be supersingular elliptic curves over F. Then Hom(E, E’) is 
a Z-module of rank 4 that is invertible as a right End(E)-module under precomposition 
and a left End(E’)-module under postcomposition. 


In particular, if E, E’ are supersingular elliptic curves over FY, then there exists a 
separable isogeny E > F’. 


Proof. We may suppose E is defined over a finite field F} such that E has all of its 
endomorphisms defined over F,. Let r € O = End(E) be the g-power Frobenius 
endomorphism. Then B = O 8z Q is a quaternion algebra over Q. Since End(£) is 
defined over F4, the endomorphism z commutes with every isogeny a € O, and so 
m lies in the center of O; since Z(B) = Q, we have m € Z = Z(O). But degr = 
AK = n? = qson = + /q € Z. Therefore #E(F,) = q + 1 + 2/q. Therefore 
#E(F a2) = g +1-2q=(q-1). 

Continuing to enlarge F4, we may repeat the above argument with E’ to conclude 
that #E(F,) = #E£’(F,). It then follows that E, E’ are isogenous over F4 [Sil2009, 
Exercise HI.5.4(b)], but we will show this and more. Let £ + p be prime and let 
T(E) = lim E[é"] = Zi is the f-adic Tate module of E. By the Isogeny Theorem 
[Sil2009, Theorem HI.7.7(a)], for every prime £ + p, the natural map 


Homp, (E, E’) ® Ze > Home, (T(E), Te(E’)) 


is an isomorphism, where Hom, (E, E’) denotes the group of isogenies E > E’ 
defined over Fq and Homp, (T¢(E), Te(E’)) denotes the group of Ze-linear maps that 
commute with the action of the g-power Frobenius Galois automorphism. In the first 
paragraph, we showed that this Frobenius action is scalar so commuting is automatic, 
and 

Home, (Te(E), Te(E’)) = Hom(Z;, Z) ~ M2(Ze); 


and Homr, (E, E’) = Hom(E, E’) has rkz Hom(E, E’) = 4. 

Finally, we can precompose by endomorphisms of O so Hom(E£, E’) is a torsion- 
free Z-module with a right action by O. Let y € Hom(E£, E’) be nonzero and let 
WY : E’ — E be the dual isogeny. Then J := wY Hom(E, E’) © O is an integral 
right O-ideal; since O is a maximal order by Theorem 42.1.9, the right O-ideal 7 
is necessarily invertible (see 23.1.1), and the same then holds for Hom(E, E’) as a 
right O-module. The same is true as a left End(E£’)-module, and these two actions 
commute. o 
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42.2 Supersingular isogenies 


We now investigate the quaternionic endomorphism rings of supersingular elliptic 
curves in more detail; we use Waterhouse [Wate69, §3] as our main reference. Let E 
be a supersingular elliptic curve over F := FY, let O := End(£), and let B := O@Q. 
By Theorem 42.1.9, we have Ram(B) = {p, co}, and O C B is a maximal order. Thus 
disc B = p = discrd O (recalling section 15.1). 

We temporarily and briefly need the language of group schemes, at the level of 
Waterhouse [Wate69]. The reader who is unfamiliar with this language is advised to 
skip to 42.2.4 and restrict consideration to left ideals Z with nrd(/) coprime to p (or 
equivalently, separable isogenies). 


42.2.1. Let Z C O be a nonzero integral left O-ideal. Since O is maximal, necessarily 
I is locally principal (in particular, invertible) by Proposition 16.1.2. 
We define E[/] C E to be the scheme-theoretic intersection 
El] := () Ea] (42.2.2) 
ael 
where E[a] = ker «œ as a group scheme over F. 
Accordingly, there exists an isogeny ¢; : E — Er where Er = E/E[IJ]. 


42.2.3. We will not need much about the theory of group schemes except that we 
can measure the degree of an isogeny via the rank of its kernel. Let H < E(F) bea 
finite F-subgroup scheme, for example, H := ker ¢ for ø: E — E’ an isogeny. Then 
H = Spec Ay is affine and Ay is a finite F-algebra; we define the rank of H by 
tk H := dimp Aq. In all cases, we have rk ker ø = deg ¢, even when ¢ is inseparable. 


This general, scheme-theoretic construction can usually be given plainly, as follows. 


42.2.4. If there is a nonzero a € I giving a separable isogeny a: E —> E, then 
E[I](F) = {P € E(F): a(P) = 0 for all a € I}. (42.2.5) 


We then have the more familiar separable isogeny ¢; : E —> E/E[I] with ker(¢;) = 
E[Z] [Sil2009, Proposition III.4.12], and rk ker 6; = #ker ¢;(F) = deg @y. 

What remains are inseparable isogenies. Since Ram B = {p, co} and O is maximal, 
by Theorem 18.1.3 (more generally, see 23.3.19), there is a unique two-sided O-ideal 
P c O of reduced norm p. Then the map E > Ep = E) is the p-Frobenius map, 
and rkker@p = p = deg@¢ even though (ker #)(F) = {0}. The equality P? = pO 
corresponds to the fact that E [P?] = E [p], and this lies behind the fact that j (E) € F 2 
as in Proposition 42.1.7. 

Accordingly, a left O-ideal Z can be written uniquely as J = PI with nrd(/’) 
coprime to p, and this corresponds to a factorization 


ġr: E > Ep > Er (42.2.6) 


with Ez ~ Epr/Epr[I’], the isogeny E — Epr purely inseparable and the isogeny 
Epr — Ey separable. (This corresponds to the factorization of the extension of function 
fields into first a separable extension, then a purely inseparable extension [Sil2009, 
Corollary I.2.12].) 
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Lemma 42.2.7. The pullback map 
*: Hom(E;, E) > I 
$r (En B) (42.2.8) 
pr wor 


is an isomorphism of left O-modules. 


Proof. The image of Hom(£7, E) under precomposition by ¢; lands in End(£) = O and 
factors through ¢; so lands in J by definition. The map ¢7 is an injective homomorphism 
of abelian groups. It compatible with the left O-action, given by postcomposition on 
Hom(E£;, E) and left multiplication on Z. 

To conclude, we show it is also surjective. Let a € J; then a(E[/]) = {0} by 
construction. If œ is separable, then «œ factors through ¢; : E — Ey [Sil2009, Corollary 
IlI.4.11]. In general, we factor øz as in (42.2.6), and then combine the separable case 
in the previous sentence with the p’-Frobenius map. o 


Finally, we can identify the right module structure as follows. 


Lemma 42.2.9. The ring homomorphism 


1: End(Er) => B 


P (42.2.10) 
UB) = ġr Bo, = 


1 


V 
EEF ($r btr) 


is injective and (End(E7)) = Or(J). 
Proof. The equality in (42.2.10) is justified in (42.1.4). The content of the lemma 
follows from the identification in the previous Lemma 42.2.7, by transporting structure: 
for 6 € End(£7) acting by precomposition, we fill in the diagram 
$r 
Hom(E;, E) ——> I 
s| je (42.2.11) 
$ 
Hom(Ey, E) —> 


to find that 
B (Wor) = WBbr = Woro BOD (42.2.12) 


and so z defines the induced action on J by right multiplication, giving an inclusion 
uEnd(E7)) € Or(/). But End(£7) is a maximal order and Or(J) is an order, so equality 
holds. o 


Next, we show that the isomorphism class of E7 depends only on the left ideal class 
of I. 


Lemma 42.2.13. If J = IB C O with B € B*, then Er = Ej. 


Proof. First, suppose 6 € O. Then 
E[IB] = {P € E(F): aB(P) = 0 for all œ € J}. 
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We claim that GE[/B] = E[/]. The containment (C) is immediate. For the containment 
(2), let Q € E[/]. Since £ is surjective (it is nonconstant), there exists P € E(F) such 
that B(P) = Q. Thus for all a € J, we have (@8)(P) = a(Q) = 0 so P € E[I£]. By the 
claim, we conclude that rg = ¢,f and Erg = Er. 

In general, there exists nonzero m € Z such that mf € O. By the previous paragraph, 
we have isomorphisms Ez = Ermp) = Eugym ~ Erg. oO 


So far, we have shown how to pass from classes of left O-ideals to (isogenous) 
supersingular elliptic curves via kernels. We can also go in the other direction. 


42.2.14. Given a finite subgroup scheme H < E(F), we define 
I(H) := {a € O : a(P) = 0 for all P € H} CO; 


then Z(H) is a left O-ideal, nonzero because #H € I(H). 
If Hı < H < E(F) are two such subgroups, then (H1) 2 [(A2). 


Lemma 42.2.15. If Hı © H and I(H,) = I(M2), then H, = Hp. 


Proof. Let ¢,: E — E/H. Factoring, without loss of generality we may assume that 
¢ is either separable or purely inseparable. Suppose first that ¢; is separable, and let 
n = #H)(F). By the proof of Theorem 42.1.9, the structure map O/nO — End E[n] is 
faithful. So if H? > H4, then there exists œ € O such that @(H;) = {0} but a(H2) + {0}, 
so [(H2) + I(H1). Second, suppose that ¢; is purely inseparable: then Hı = ker ø; is 
the kernel of the rı-power Frobenius for some rı > 0, and Z(H) = P™ as in 42.2.4. 
Then p" € I(H,) = I(H2), so E — E/H; is also purely inseparable, and H3 = ker $7 
and J(H2) = P’. We conclude rı = r2, and then Hı = Ad. oO 


Proposition 42.2.16. The following statements hold. 


(a) deg @; = nrd(J). 
(b) (ELI) =I. 


Proposition 42.2.16 justifies the use of overloaded notation. Our proof follows 
Waterhouse [Wate69, Theorem 3.15]. 


Proof. We begin with (a). We first prove it in an illustrative special case. Suppose 
I = OB is a principal left O-ideal. Then E[/] = E[£] where ¢; = 8: E > E, and 


deg 8 = BB = nrd(p) = nrd(J) 


is the constant term of the (reduced) characteristic polynomial of £. 


We now return to the general case. We first show that deg(7) | deg ¢;. Let O’ := 
OrU) = O_.U7!). By Exercise 17.5, there exists œ € B* such that J’ = I~!a c O' 
is in the same right O’-ideal class as 77! and with nrd(/’) coprime to deg øz. Thus 
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II’ = Oa C O is a compatible product and œ € O. By the previous paragraph, we have 
deg @; = nrd(/7’) = nrd(J) nrd(/’). 


The map ¢7,; factors through ¢7, so deg ¢; | deg rr. Since nrd(/’) is coprime to 
deg r we have deg ¢; | nrd(/). 

Repeating this argument, we have deg ¢; | nrd(J’) as well. We combine these to 
conclude (a). We have 


(deg ġr)(deg ør) | nrd(7) nrd(’) = deg $1. (42.2.17) 
But 
deg ġrr = rk E[I/’] | (rk E[/])(k E[/’]) = (deg $7)(deg 7’). (42.2.18) 


Putting together (42.2.17)-(42.2.18), we see that equality holds, so deg øz = nrd(J/). 
Now we prove (b). Let J = I(E[/]). Then J € J since JE[J] = {0}; thus E[/] 2 
E[J]. At the same time, we have 


EV] = () Ele] = () Ela] 2 ( \ Ela] = E[7] (42.2.19) 
aes acO ael 
a@E[I]={0} 

so equality holds and E[/] = E[J]. Thus deg ¢; = deg ¢,;. By (a), we have 
nrd(/) = deg ¢; = deg g; = nrd(J). (42.2.20) 


Since 7 € J, we have O c JI~!. From Proposition 16.4.3 and (42.2.20), we have 
nrd(JI~!) = [O : JI“!] = 1 so O = JI~! and therefore I = J. o 


Corollary 42.2.21. For every isogeny ¢: E — E’, there exists a left O-ideal I and 
an isomorphism p: Er > E’ such that ¢ = pr. Moreover, for every maximal order 
O’ C B, there exists E’ such that O’ = End(E’). 


Proof. Let H be the scheme-theoretic kernel of ¢. Then H C E[I(A)], so ġr(m) factors 
through ¢ with ¢7;H) = pọ for some isogeny p: Erm) > E’. But I(H) = I(E[I(A))) 
by Proposition 42.2.16, so H = E[I(H)] by Lemma 42.2.15. Thus deg ġr(m) = deg ¢, 
and so deg p = 1 and p is an isomorphism, with ¢ = p~! rm). The second statement 
follows similarly using a connecting ideal between orders (see section 17.4). o 


We may now compare endomorphisms analogously to Lemma 42.2.7. 


Lemma 42.2.22. Let I, I’ C O be nonzero integral left O-ideals. Then the natural map 
Hom(£;, E) Hom(Er, Er) > Hom(Er, E) 
is bijective, giving a further bijection 
Hom(Ep, Er) > (I : Dre = DI 


=} (42.2.23) 
pr ġr Yor. 


42.3. EQUIVALENCE OF CATEGORIES 791 


Proof. By Lemma 42.2.7, we have Hom(£7, E)¢; = I. By Proposition 42.2.16, we 
have m = deg ¢; = nrd(J). The left ideal J € O is invertible thus m = nrd(J) € IJ, 
hence there exist finitely many a;, 8; € Hom(£7, E) such that 


im) = Dard Bid’ = > (iby BY) = >| elm|By (42.2.24) 
therefore 
[1] = ` aiBY € End(E). (42.2.25) 


For y € Hom(Er, E), postcomposing with (42.2.25) gives 
y= > ai(ßY y) € Hom(E7, E) Hom(Ep, Er) (42.2.26) 


l 


so the natural injective map is bijective. This gives 
Iġī! Hom(Er Engr = I’ (42.2.27) 


and thereby the bijective map (42.2.23), using Exercise 16.11 for the relationship to 
the colon ideal.. o 


42.3 Equivalence of categories 


We now show that the association from supersingular elliptic curves to right ideals 
is an equivalence of categories. We recall that F is an algebraically closed field with 
p = char F prime. 


42.3.1. Let Ep be a supersingular elliptic curve over F := Fa; it will serve the role as 
a base object. Let Oo := End(Eo) and Bo := Oo 8 Q. 
Theorem 42.3.2. The association E =œ Hom(E, Eo) is functorial and defines and 
equivalence between the category of 

supersingular elliptic curves over F, under isogenies 
and 


invertible left Oọ-modules, under nonzero left Oo-module homomorphisms. 


Remark 42.3.3. Written this way, the functor Hom(-, Eo) is contravariant. One can 
equally well take Hom(£o, —) to get a covariant functor with right Oo-modules; using 
the standard involution, these are seen to contain the same content. 


Proof. To begin, we need to show Hom(-, Eo) is a functor. The association E t> 
Hom(E, Eo) makes sense on objects by Lemma 42.1.1 1. On morphisms, to an isogeny 
go: E > E’ we associate 


¢*: Hom(E’, Eo) > Hom(E£, Eo) 


42.3.4 
wr po. 
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The map ¢* is a homomorphism of left Oo-modules, since it is compatible with 
postcomposition with Op = End(£), so Hom(-, Eo) is functorial. 

Next, we claim that Hom(-, Eo) is essentially surjective. Let J be an invertible 
left Oo-module. Tensoring with Q we get an injection Ip) — Jp ® Q = Bo, so up to 
isomorphism of left Og-modules, we may suppose 7 € Bo. Scaling by an integer, we 
may suppose I C Op is a left Og-ideal. Let Ez = E/E[I]. By Lemma 42.2.7, we have 
Hom(E;, Eo) = I as left Oo-modules, as desired. 

Finally, we show that Hom(-, Eo) is fully faithful, i.e., the map 


Hom(E, E’) — Hom(Hom(E£’, Eo), Hom(£, Eo)) 
ong 


(the former as isogenies, the latter as homomorphisms of left Og-modules) is bijective. 
This bijectivity is made plain by an application of Corollary 42.2.21: there is a left Oo- 
ideal J such that E = Eo,7; applying this isomorphism, we may suppose without loss 
of generality that E = Eo. Then by Lemma 42.2.7, we have J = Hom(Eo,;, Eo) ¢0,1. 
Repeat with E£’ and 7’. Then after these identifications, we are reduced to the setting 
of Lemma 42.2.22 (with the location of the prime swapped): the map 


Hom(E ,7, Eor) > O : Dr = I7 
Y > by oor- 


is indeed bijective. o 


(42.3.5) 


Remark 42.3.6. See also Kohel [Koh96, Theorem 45], where the categories are 
enriched with a Frobenius morphism. 


Corollary 42.3.7. (Deuring correspondence). There is a bijection between isomor- 
phism classes of supersingular elliptic curves over F and the left class set Cls Oo. 
Under this bijection, if E [I], then End(E) ~ Op(J) and Aut(E) ~ OR(D*. 


Proof. Take isomorphism classes on both sides of the equivalence in Theorem 42.3.2, 
and compare endomorphism groups and automorphism groups. (We had to work with 
left Og-modules in the equivalence of categories, but each isomorphism class of objects 
is represented by a left Oo-ideal J € B.) Oo 


42.3.8. From the Eichler mass formula and Corollary 42.3.7 (swapping left for right, 
as in Remark 42.3.3), we conclude that 
1 p-1 


= ame (42.3.9) 


1 
2 # Aut(E) > #O.* 24 


[T]eCls R 
where the sum on the left is over isomorphism classes of supersingular elliptic curves 
over F = Fä, 

Similarly, from the Eichler class number formula (Theorem 30.1.5), the number of 
isomorphism classes of supersingular elliptic curves over F is equal to 


ae aaa 
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Remark 42.3.10. We can generalize this setup slightly as follows. Let M € Z>o be 
coprime to p, and let Co < Eo(F) be acyclic subgroup of order M. Then End(Eo, Co) = 
Oo(M) is an Eichler order of level M and reduced discriminant pM in Bo. Ina similar 
way as above, one can show that Hom(-, (Eo, Co)) defines an equivalence of categories 
between the category of supersingular elliptic curves equipped with a cyclic M-isogeny 
(under isogenies identifying the cyclic subgroups), to the category of left invertible 
Oo(M)-modules (under homomorphisms). The mass formula now reads 


1 1 p-1 


#AutE, C) FO 04 YM: 


[(E,C)] [I]€Cls Oo(M) 


One can also consider instead the category of cyclic M-isogenies ¢: E > E”. 


Example 42.3.11. Consider p = 11. The algebra B = ZA 
and the maximal order O = Z<i, (1 + j)/2). We have #Cls O = 2, with the nontrivial 
class represented by the ideal J generated by 2 and 1 + i(1 + j)/2. 

We have O* = (i) of order 4 and O, (I) = (1/2 —i(1 + j)/4) of order 6, and indeed 
1/4+ 1/6 = 10/24 = 5/12. The two supersingular curves modulo 11 are the ones with 
j-invariants 0 and 1728 = 1 (mod 11), and End(E) = O if j(E) = 1728 whereas for 
End(E’) = O’ we have Hom(E, E’) = I, in other words, E’ = E /E[I]. 


has discriminant 11 


Example 42.3.12. We return to Example 41.1.2. The order O = O; is the endomor- 
phism ring of the elliptic curve E; : y? = x3 — x with J(E\) = 1728 = 3 (mod 23), 
and similarly E>: y? = x(x — 1)(x + 2) with j(E2) = 19 and E3: y? = x? +1 with 
j(E3) = 0. We have 2w; = # Aut(£;) is the order of the automorphism group of E;. And 
the p-Brandt graph is the graph of p-isogenies among the three supersingular elliptic 
curves over F 3. 


42.3.13. Finally, and most importantly, in the above correspondence the entries of 
the Brandt matrix T(n) have meaning as counting isogenies. For n coprime to p, the 
entry T(n);; is equal to the number of subgroups H < E;(F) such that E;/H = Ej. 
This statement is just a translation of Lemma 42.2.22. Gross [Gro87] gives a beautiful 
and essentially self-contained presentation of the results of the previous chapter in the 
special case that disc B = p. 


Remark 42.3.14. The approach via supersingular elliptic curves connects back in 
another way: Serre [Ser96] gives an alternative approach to modular forms modulo 
p in a letter to Tate: one evaluates classical modular forms at supersingular elliptic 
curves and then relates these to quaternionic modular forms modulo p. 


42.4 Supersingular endomorphism rings 


In this section, we give a second categorical perspective, giving a base-object free 
refinement of Corollary 42.3.7 following Ribet [Rib1989, p. 360-361] (who credits 
Mestre—Oesterlé). To get there, we need to deal with a small subtlety involving the 
field of definition (fixed by keeping track of extra data). Recall that F = Fa. 
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Lemma 42.4.1. Let O be a maximal order. Then there exist one or two supersingular 
elliptic curves E up to isomorphism over F such that End(E) = O. There exist two 
such elliptic curves if and only if j(E) € F,2\Fp if and only if the unique two-sided 
ideal of O of reduced norm p is not principal. 

Proof. In Corollary 42.2.21, we proved that there is always at least one supersingular 
elliptic curve E with End(£) ~ O using a connecting ideal. We now elaborate on this 
point, refining our count. 

By Corollary 42.3.7, the isomorphism classes of supersingular elliptic curves are 
in bijection with the left class set Cls, Oo; their endomorphism rings are then given by 
End(E) =~ Or(/) for [I] € Cls, Oo. By Lemma 17.4.13 (interchanging left for right), 
the map 

Cls Oo — Typ Oo 


(42.4.2) 
[1], > class of On(/) 


is a surjective map of sets. The connecting ideals are precisely the fibers of this map, and 
by the bijection of Corollary 42.3.7, there is a bijection between the set of supersingular 
elliptic curves E with End(E) =~ O and the fiber of this map over the isomorphism 
class of O. 

We now count these fibers. We recall Theorem 18.1.3 with D = p and the text 
that follows (interchanging left for right): the fibers are given by the quotient group 
Pld! O\Idl O of the two-sided invertible fractional two-sided O-ideals by the subgroup 
of principal such ideals. There is a surjection Pic(O) — PIdl O\Idl O and Pic(O) = 
Z,/2Z is generated by the unique maximal two-sided ideal P of reduced norm P. The 
class of P in the quotient is trivial if and only if P = Or is principal. 

To conclude we recall 42.2.4: the Frobenius map is the map E > Ep = EP). So 
P is principal if and only if EP? = E if and only if j(E) = j(E™) = j(E)? if and only 
if j(E) € Fp. o 


We dig into this issue a bit further. 


42.4.3. Let E be a supersingular elliptic curve over Fa, Let w be a nonzero invariant 
differential on E. Then there is a ring homomorphism 
a: End(E) > FY 
a (42.4.4) 
$= agp, where ¢*w = agw 


(see Silverman [Sil2009, Corollary III.5.6]) independent of the choice of w. 


In light of 42.4.3, we make the following definitions. Let B be a quaternion algebra 
over Q of discriminant disc B = p; such an algebra B is unique up to isomorphism. Let 
O C B be a maximal order in B; then discrd O = p. 


Definition 42.4.5. An orientation of O is a ring homomorphism O > Fa, 


42.4.6. We claim that there are two possible orientations of O. In fact, P = [O, O] 
is the commutator ideal (cf. Exercise 13.7) and an orientation factors through the 
commutator. Since nrd(P) = p, localizing we have O/P ~ Op/Pp = F „2 by Theorem 
13.3.11(c). The claim follows as there are two possible inclusions F > Fa. 
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The notion of isomorphism of oriented maximal orders is evident. 


Definition 42.4.7. An isomorphism of oriented maximal orders from (O, Z) to (O’, 2’) 
is an isomorphism of orders ¢ : O —> O’ such that 2’¢ = <. 


We define the set of reduced isomorphisms to be Isom(£, E’)/{+1}. 


Proposition 42.4.8. The association E œ (End(E) © End(E)g, a) is functorial and 
induces an equivalence from the category of 


supersingular elliptic curves over Fä, under reduced isomorphisms 
to the category of 


oriented maximal orders (O C B, Z) 
in a quaternion algebra B of discriminant p, 
under isomorphisms. 


In the latter category, we do not choose the representative of the isomorphism 
class of quaternion algebra of discriminant p; it is tagging along only to provide a 
quaternionic wrapper for the order. 


Proof. The association has the right target by Theorem 42.1.9 for the order and 42.4.3 
for the orientation. This association is (covariantly) functorial with respect to isomor- 
phisms. Indeed, if y : E — E’ is an isomorphism of elliptic curves, then we have an 
induced isomorphism 

End(E) — End(E’) 


or wo. 
that is compatible with composition. The isomorphism (42.4.9) is also compatible with 


orientations, as follows. Let w’ be a nonzero invariant differential on E’; then W*w’ is 
so on E. Thus for all ¢ € End(£), we have 


agp w =P Vo ES E O = Wwe yo! 
= WG gy) = Vy gy o 


(42.4.9) 


(42.4.10) 


s = ag, which is the desired compatibility. 


0 Fy gy! 

The functor is essentially surjective, which is to say that every oriented maximal 
order arises up to isomorphism: that every maximal order arises is a consequence of 
Corollary 42.2.21, and that the orientation may be so chosen corresponds to applying 
the Frobenius morphism, by 42.2.4 and 42.4.6. 


Finally, we show the map is fully faithful, which is to say the map of finite sets 
Isom(E, E’)/{+1} > Isom((End E, £), (End E’, £’)) 


from (42.4.9) is bijective. Any two reduced isomorphisms on the left differ by an 
automorphism of E, and the same on the right, so it suffies to show that the map 


Aut(E)/{+1} > Aut((End(£), 2)) 


42.4.11 
ve (dR vv!) C ) 
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is bijective. Let O = End(£), so Aut(E) = O*. Then Aut(O) = Ngx(O)/Q”*, and 
1 > O* /{+1} > Aut(O) => AL(O) > 1 


where the Atkin—Lehner group AL(O) is nontrivial (and isomorphic to Z/2Z) if and 
only if P = zO is principal; but conjugation by z acts nontrivially on O/P and fails to 
commute with ¢ so does not act by an automorphism of (End(£), 2); thus 


Aut(E)/{+1} = O*/{+1} = Aut(O, 2). 
(We did not need to choose a base object in order to define this equivalence!) o 


Corollary 42.4.12. There is a bijection between isomorphism classes of supersingular 
elliptic curves over F and oriented maximal orders in a quaternion algebra B of 
discriminant p. 


Proof. Again we take isomorphism classes of objects in Proposition 42.4.8. o 


Exercises 


1. Let E, E’ be elliptic curves over F and suppose E, E’ are isogenous (necessar- 
ily by a nonzero isogeny). Show that E is supersingular if and only if E’ is 
supersingular. (Hint: show dimo End(E fa)g = dimo End(E z )Q; or show that 
deg;([p]) = deg,([p]’) where [p], [p] are multiplication by p on E, E’.] 


2. Let E be an elliptic curve over F with char F = p. Show that for all ¢, Y € End(£), 


we have 
deg;(gy) = deg; (¢) deg; (y) 
deg, (¢ + Y) = min{deg, ¢, deg, Y}. 
Conclude that |¢| = 1/deg,;(¢@) defines a nonarchimedean absolute value on 
End(E)p). 


3. In this exercise, we give an alternate “hands on” proof of Lemma 42.2.9. 


Let E be a supersingular elliptic curve over F = FS, let O = End(£) and let 
B = O 8Q. Let J C O be a nonzero integral left O-ideal, and let ġz: E > Er 
where Er = E/EĻ[I]. Consider the pullback isomorphism ¢,: Hom(E;, E) > I 
by y => wéy in (42.2.8). 


(a) Show that ¢* induces an isomorphism of O-module endomorphism rings 
p: End(Hom(E;, E)) => End(/) 
ap gag! . 


(b) Show that End) = Or(Z)°?. 

(c) Show that End(Hom(£;, E)) = End(£7)°?. [Hint: End(E7) is a maximal 
order, so the natural inclusion is an equality. ] 

(d) Conclude that p° induces an isomorphism End(£7) = Og(J). 
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(e) Show that p° is the map z in (42.2.10). 


4. Let B be a quaternion algebra over Q with disc B = p prime. Recall the definition 
of the Brandt matrix (e.g. (41.1.1), and more generally 41.2.2). 


Let J; fori = 1,...,/ be the representatives of the left class set Cls, O, with 
h = #Cls, O. Let E; be the supersingular elliptic curves over Fa corresponding 
to J; in Corollary 42.3.7. 


(a) For every m > 1, show that T(m);j is equal to the number of subgroup 
schemes C of order m in E; such that E;/C = E;. 

(b) Show that T(p) is a permutation matrix of order dividing 2 and that T(p”) = 
T(p)’ for allr > 1. 

(c) Show that E; is conjugate by an element of Aut(F3) to E; if and only if 
i = j or T(p);; = 1. Conclude that the number of elliptic curves £; defined 
over F, is equal to tr T(p). 
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Chapter 43 


QM abelian surfaces 


In this final chapter, we consider quotients of the upper half-plane by quaternionic 
unit groups as generalizations of such quotients from the matrix group (Chapter 40), 
realizing them as moduli spaces for abelian surfaces with quaternionic multiplication. 
This chapter can be seen as a culminating application of all of the parts of this book, 
and for that reason, is necessarily more advanced. Concepts are reviewed in the attempt 
to be self-contained, but additional background in algebraic and arithmetic geometry 
is suggested. 


43.1 > QM abelian surfaces 


Recall (40.1.1) that the curve SL2(Z)\H? parametrizes complex elliptic curves up to 
isomorphism: to T € H?, we associate the lattice A; := Z + Zr and the elliptic curve 
E, := C/A, and the association 


SL2(Z)\H? © {Complex elliptic curves up to isomorphism} 


(43.1.1) 
SL2(Z)t > [E7] 


is bijective. Moreover, we have a biholomorphic map j : SL2(Z)\H? — C, which is 
to say, two complex elliptic curves are isomorphic if and only if they have the same 
j-invariant. We compactify to X := SL2(Z)\H”* by including the cusp at ov. 


As in section 38.1, we are led to seek a generalization of (43.1.1), replacing 
B = M.(Q) with a quaternion algebra. To this end, let B be an indefinite quaternion 
algebra over Q of discriminant D, let O C B be a maximal order, and let 


lœ : B > B 8&9 R ~ M2(R) 


be an embedding (explicitly, we may take (38.1.1)). The order O is unique up to 
conjugation in B (by strong approximation) and similarly the embedding tœ is unique 
up to conjugation in M2(R), so these choices are harmless. Let 


T! (O) := t(01)/{+1} < PSL: (R). 
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The quotient I 1(O)\H? is compact when B + M>(Q); for uniformity, we define 
Xt := T'O)\H™, 


where H?® = H**,H? according as D = 1 or D > 1. Then X! is a good (compact) 
complex 1|-orbifold. 

We may then ask: what does X! parametrize? The answer is, roughly: X! para- 
metrizes complex abelian surfaces with endomorphisms by O. The correspondence 
itself is as pleasingly simple as for elliptic curves (43.1.1). To a point T € H?, we 
associate 


Az := teo(O) H Eg 
A, = C2/A (43.1.2) 
lr := lo: O => End(A;) 


Then A; is a complex torus of dimension 2 and «+ is an injective ring homomorphism, 
realizing endomorphisms of A; by O. 

However, there are a number of technical points required to make this completely 
precise. We quickly survey the theory of complex abelian varieties in section 43.4. One 
basic fact of life is that not every complex torus has enough meromorphic functions to 
give it the structure of a complex abelian variety embedded in projective space. One 
needs a polarization given by a Riemann form, and the simplest polarizations are the 
principal polarizations. (One can think of this rigidification as the difference between a 
genus 1 curve and an elliptic curve, where the genus 1 curve is equipped with a point.) 
A principal polarization defines positive involution on the endomorphism ring, called 
the Rosati involution. 

This rigidification is matched on the quaternion order: a principal polarization 
on O is an element u € O such that u? + D = 0. Every (maximal) order has a principal 
polarization, and the involution a + a* = p!@yp is a positive involution on O. A 
quaternionic multiplication (QM) structure by (O, u) on a principally polarized 
complex abelian surface A is an injective ring homomorphism O —> End(A) that 
respects the positive involutions on B and End(A)qg. 

The happy fact is that A, as defined in (43.1.2) has via u a principal polarization 
and thereby QM by (O, u). In other words, the choice of the QM structure determines a 
canonical principal polarization: but it gives a finite amount of additional information, 
as there will in general be more than one QM structure on a principally polarized 
abelian surface. In many cases, these structures can be understood in terms of the 
Atkin—Lehner group 


AL(O) = Ngx(O)/Q*O* = I] Z/2Z (43.1.3) 
p|D 
acting by automorphisms of X!. 


In any event, the main result of this chapter (Main Theorem 43.6.14) is that this 
association is bijective. 
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Main Theorem 43.1.4. The map 


(A, t) principally polarized 
1 2 complex abelian surfaces 
TF (O)\Hf © with QM by (O, 4) (43.1.5) 
up to isomorphism 


T'(O)r & [(Az, tI 


is a bijection. 


This main theorem generalizes (43.1.1): indeed, we may take B = M2(Q) > O = 


M2(Z) and u = i i , and we find that A; ~ E? as principally polarized abelian 
surfaces. 

One feature that makes this theory even more appealing is that abelian surfaces 
arise naturally as Jacobians of genus 2 curves via the Abel—Jacobi map: this motivates 
much of the theory, so we begin with it in section 43.3. In particular, there are functions 
called Igusa invariants analogous to the elliptic j-function that record the isomorphism 


class of a principally polarized abelian surface. 


43.1.6. We then define modular forms as for the classical modular group. Let k € 2Zso. 
A map f: H? = C is weight k-invariant under T = T! (O) if 


fz = (cz +d)‘ f(z) forall y = k ) er. (43.1.7) 


A modular form for T of weight k is a holomorphic function f: H? — C that is 
weight k invariant and is holomorphic at œo, if T = PSL2(Z). Let Mx (T) be the C-vector 
space of modular forms for I. Then Mg(T) is a finite-dimensional C-vector space, and 
by a similar contour integration as in the proof of Proposition 40.3.4, dimc M;(I) can 
be expressed in terms of k and the signature of I. And 


MT) := B MAT) (43.1.8) 


kE2Z>0 


has the structure of a graded C-algebra under multiplication. (When D > 1, there are 
no cusps, so vacuously all modular forms are cusp forms.) 


It would not be unreasonable for us to have started the book here, with this topic 
at front and center. In this chapter, we will do our best to treat the complex analytic 
theory in as complete and self-contained a manner as possible, but this is really just the 
beginning of the subject, one that is rich, deep, and complicated—worthy of a book all 
to itself. For example, the following result is fundamental. 


Theorem 43.1.9 (Shimura [Shi67, p. 58]). There exists a projective nonsingular curve 
X! defined over Q and a biholomorphic map 


g: T'(O)\H? > xX! (©). 
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The curve X! over Q coarsely represents the functor from schemes over Q to sets 
whose values are isomorphism classes of QM abelian schemes, suitably defined. More- 
over, the map ¢ respects the field of definition and Galois action on certain special points 
called CM points on T! (O)\H? obtained as fixed points of elements v € B* with Q(v) 
an imaginary quadratic field. As a result, the curve X! is canonical, uniquely character- 
ized up to isomorphism, and is so called the canonical model. We give some indications 
of this result by example in the next section and more generally in section 43.8. 


43.2 >QM by discriminant 6 


For concreteness, before embarking on our general treatment, we consider in this 
section an illustrative example and one of special interest; it is well-studied and beloved 
by quaternionic practitioners, see Remark 43.2.21 for further reference. 


1,3 
Let B = ) be the quaternion algebra of discriminant 6 studied in sections 
37.8—37.9. As in 37.8.12, we have a maximal order 
Ltitj+i 
O=Z@ZieZjeZk, k= aie 


with k? — k — 1 = 0, and an embedding 
loo: B — M(R) 

E 

, 1 O}’\o -v3 
Let T! = 1.,(O!)/{+1} < PSL2(R) and X! = T!\H?. We computed a compact Dirichlet 
fundamental domain x for T! in 37.9.4, with u(x) = 27/3. Further, we saw explicitly 
in 37.9.10 (and again by formula in Example 39.4.21) that T! has signature (0; 2, 2, 3, 3); 
that is, X! has topological genus g = 0 and there are 4 cone points, two points z2, a © 
with stabilizer of order 2 and two z3, z3 € X with order 3 stabilizer. 

As in Chapter 40, to exhibit a model for X ! we seek modular forms, indeed, we 


now describe the full graded ring of (even weight) modular forms (43.1.8). We will 
use the following essential proposition. 


Proposition 43.2.1. The following statements hold. 


(a) Let f: H? => C be a nonzero meromorphic modular form of weight k for T!, 
not identically zero. Then 


1 k 
pe TA rh) = z. 
ae # Stabyi (z) 6 
(b) We have 
1, if k = 0; 
dimc M4(T!) = 4 0, if k = 2; (43.2.2) 


1— k +2lk/4] +2Lk/3]), ifk>4. 


43.2. > QM BY DISCRIMINANT 6 803 


Proof. See Theorem 43.9.4; for the purposes of this introduction, we provide a sketch to 
tide the reader over. For (a), we argue just as in Proposition 40.3.4: we integrate dlog f = 
df/f over the boundary of the fundamental domain x and use the identification of sides 
provided by rotation at their fixed points (elliptic vertices), reversing the direction of 
the path so the contributions cancel, and we are left again to sum angles. The details are 
requested in Exercise 43.7. For (b), we can get upper bounds on the dimension using (a), 
but to provide lower bounds we need to exhibit modular forms, and these are provided 
by the Riemann-Roch theorem. For example, for k = 2, we have dimc M2(T'!) = g = 0 
by (40.2.11). o 


43.2.3. We are now in a position to prove an analogous statement to Theorem 40.3.11. 
Referring to Proposition (43.2.1), by part (a) we seek a1, a2, a⁄, a3, a’, € Zso with 
a2 + a, a3+a, k 
+ + =F. 43.2.4 

“MR 3 6 han 
By part (b), we have dimç M,(T!) = 0 for k < 0, and indeed, there are no solutions. 
For k = 0, there is a unique solution corresponding to the constant functions. For 
k = 2, there are no solutions, as follows. Let f(z) € Ma(T 1), Let y3 be a generator for 
the stabilizer at z3. Then 


F3) = f(y3z3) = J03; 23)" f3); 


by the cocycle relation, we have 1 = JOR; z3) = J(y3;z3} a nontrivial cube root of 
unity, so f(z3) = 0 and a3 > 0. Similarly a’, > 0, and this contradicts (43.2.4). 

Arguing in the same way, we find that the unique solution for k = 4 is a = a, = 0 
and a3 = a’, = 1; thus Ma(T!) = Cf, and f4 necessarily vanishes at z2, Za: Similarly, 
for k = 6 we have only a3 = a’, = 0 and az = a, = 1, with M6(T') = Cge. 


Continuing as in 43.2.3, we collect dimensions and spanning functions as in Table 
A325. 


k | dimc MT’) Spanning functions 

0 1 1 

2 0 P 

4 1 fa 

6 1 &6 

8 1 i 

10 1 S486 

12 3 fp. 8 hi2 

24 5 dd ee ds 84, Bei 


Table 43.2.5: Generators for M(T) 


43.2.6. In weights 8, 10, we have products of forms seen previously. In weight k = 12, 
we find a third function hy. € Mj2(T!) spanning together with Ji: gg. Continuing in 
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this way, finally in weight k = 24, we find 6 functions in a 5 dimensional space, and 
so they must satisfy an equation r(fa, g6, h12) € CL fa, g6, 412], homogeneous of degree 
24 if we give f4, g6, h12 the weights 4, 6, 12. 


Proposition 43.2.7. We have 


CL fa, 86, h12] 


MTH ~ ae i, 
© (r(fa, 86, h12)) 


Proof. The bound on the degrees of generators and relations in Theorem 43.9.6 makes 
this proposition immediate. It is also possible to give a proof with bare hands: see 
Exercise 43.9. o 


43.2.8. We do not have Eisenstein series available in this setting, but the notion of 
taking averages 40.1.19 is still quite sensible: we find what are known as Poincaré 


series. Recall j(y;z) = cz + d for y = k ‘| € SL,(R). The square j(y; z)? is 


well-defined on y € PSL2(R). 
For k € 2Z>2, we define the Poincaré series 


Pr(z) = >) se). 


yer! 


Then P;(z) is nonzero, absolutely convergent on H?, uniformly on compact subsets, 
and satisfies 


Pryz) = Jy; z)* Pez). (43.2.9) 


The convergence statement is fairly tame, because the fundamental domain X is com- 
pact: it implies that the total integral 


(Im z)* 
x OS Zl 


is finite, and the Poincaré series converges (absolutely) by comparison [Kat85, §1, 
Proposition 1]. The equality (43.2.9) follows from the cocycle relation (40.2.5). There- 
fore Px(z) € S(T), and in particular we may take f4 = P4 and g6 = P6; with a bit more 
computation, one can also show that P, Pe P\2 are linearly independent, so that we 
may take hi2 = P12 as well. 


du(z) < oo 


43.2.10. A convenient and meaningful normalization of the functions above is given 
by Baba—Granath [BG2008, §3.1]. 

First, there are exactly two (necessarily optimal) embeddings $ = Z[V—6] = O 
by Example 30.7.5: we have #Cls O = 1 and K(V—6) is ramified at p = 2,3 | D = 6, so 
m(S, O; O”) = h(Z[V—6]) = 2. The fixed points of these two embeddings are distinct 
points z6, z € H. Explicitly, we note that 


u=3i+ij=-1+2i-j+2k (43.2.11) 


has u? + 6 = 0, and choose its fixed point as ze. 
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We rescale ge so that Fp (26) /82(z6) = V-3, and we choose h12 such that h12(z6) = 
hy2(z6) = 0, and rescale so that 


r(fa, 86 h12) = hi + 384 + fÉ = 0. (43.2.12) 
Corollary 43.2.13. The holomorphic map 
(0) > P? 
z = (FG (2) : 86) : h22) 


has image the conic X! : x? + 3y? + z? = 0, defining the canonical model over Q. 


(43.2.14) 


Proof. This result is attributed to Ihara by Kurihara [Kur79, Theorem 1-1(1)]; it is 
proven by Baba—Granath [BG2008, Theorem 3.10] along the lines above. o 


We note that X!(R) = 0; this is a general feature, see Proposition 43.7.2. 


43.2.15. The Atkin—Lehner group 
AL(O) = Ngx(O)/Q*O* = (Z/2Z) 


has three nontrivial involutions w2, w3, we with representatives having positive reduced 
norm. Explicitly, we have we = u by (43.2.11) and w2 = 1 +i and w3 = we/w2 = 
1+i—j +k. These involutions act on the space of modular forms as follows [BG2008, 
§3.1]:So for example f4(w2z) = —fa(z). 


| w2 W3 W6 


= = 
86 oe we Se 
hi2 == + = 


These involutions act on the canonical model X! by w(x: y : z) = (x: —y : z), 
w3(—x : y : z), and we(x : y : z) = (x : y : =z). 


We choose a principal polarization (see Definition 43.6.4) on O by y in (43.2.11). 
In this way, Main Theorem 43.1.4 provides that the curve X! parametrizes abelian 
surfaces with QM by (O, w). 


43.2.16. The forgetful map [(A;, tr)] # [A7] which forgets the QM structure is the 
map [BG2008, Proposition 3.9] 


j: X! = P! 
6y2 (43.2.17) 
(x:y:z)= 
9x2 


generically 4-to-1. The map j can fruitfully be thought of as an analogue of the classical 
elliptic j-invariant, mindful of the above technicalities: it parametrizes principally 
polarized complex abelian surfaces that can be equipped with a QM structure. 
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The Igusa invariants 43.3.5 of A; where j = j(7) are given by [BG2008, Proposition 
3.6] 
(h : I4 : I6: No) 
= (12(j +1): 60? +7 +1) : 4G? — 27? +1): 7?) (43.2.18) 
€ P(2, 4, 6, 10). 


There exists a genus 2 curve with these Igusa invariants if and only if j = 0, —16/27 or 
the Hilbert symbol 
(-6j, -2(27/ + 16))g = 1 


is trivial. 


Example 43.2.19. The two points with j = 0,00 are exactly those points which are 
not Jacobians of genus 2 curves: these correspond to points with CM by Z[V—1] and 
Z{w], and these abelian surfaces are the squares of the corresponding CM elliptic 
curves (with the product polarization). Elkies [Elk98, §3] computes equations and 
further CM points for discriminant 6. 


Example 43.2.20. The case j = —16/27 corresponds to a CM point with discriminant 
D = —24 [BG2008, §3.3]: it is the Jacobian of the curve 


y? = (1 + V2)x° — 3(7 — 3V2)x4 — 3(7 + 3-V2)x? + (1 - V2) 


isomorphic to the product of the two elliptic curves with CM by Z[V—6] (but not with 
the product polarization). 


Remark 43.2.21. For further reading to connect some of the dots above, see the article 
by Baba-—Granath [BG2008], refining the work by Hashimoto—Murabayashi [HM95, 
Theorem 1.3] who give an explicit family of genus 2 curves whose Jacobians have QM 
by O. 


43.3 Genus 2 curves 


We begin in the concrete setting of genus 2 curves. Let F be a perfect field with 
char F + 2 and let F” be an algebraic closure of F. Let X be a smooth projective curve 
of genus 2 over F. 


43.3.1. Using Riemann—Roch in a manner analogous to the proof for elliptic curves 
(see e.g. Silverman [Sil2009, Proposition II.3.1(a)]), X is given by a Weierstrass 
equation of the form 

y? = f(x) (43.3.2) 


where f(x) € F[x] is squarefree of degree 5 or 6. It follows that X is hyperelliptic 
over F, with map x : X — P! of degree 2. 

If (y’)* = f'(x’) is another Weierstrass equation for X, then it is related by a change 
of variables of the form 


ax+b Pa ey 
cx+ d 4 ~ (cx + dp 


, 


(43.3.3) 
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with ad — bc, e € F*. After such a change of variable, we may suppose without loss of 
generality that deg f = 6. 


Example 43.3.4. Let X”! be the base change of X to F*!. The automorphism group 
Aut(X”) is a finite group containing the hyperelliptic involution (x, y) > (x,—y). 
The possibilities for this group were classified by Bolza [Bol1887, p. 70]: when 
char F # 2,3,5, the group Aut(X*!) is isomorphic to one of the groups 


Co, Va, Dg, C10, D12, 2D 12, Sa 
of orders 2, 4, 8, 10, 12, 24,48. A generic genus 2 curve over F al has Aut(X2!) = C2. 


43.3.5. We now seek invariants of the curve defined in terms of a model to classify 
isomorphism classes. We factor 


mapa 
i=1 
with a; € F*! the roots of f. We abbreviate a; — a; by (i j), and we define 
h := cy 12° 45 6)", 
Is := (4c)* X0 2236 D456 6°64)" 
Is := 40$ X0 D236 1°14 5)"5 66 4° 4)"2 5°36)”. 
ho = 49" | Ja 2, 


where each sum and product runs over the distinct expressions obtained by permuting 
the index set {1,...,6}; by Galois theory, we have J, I4, I6, [io € F. In particular, we 
have 


(43.3.6) 


ho =(40)® | | (i-a? = disc f) #0 


1<i<j<6 


is the discriminant of the polynomial 4f. The invariants J, I4, I6, 10, defined by Igusa 
[Igu60, p. 620] by modifying a set of invariants due to Clebsch, are known as the 
Igusa—Clebsch invariants. 

Under a change of variable of the form (43.3.3), we have 


(E, Iy K, Ko) = (6h, ôl, 8 le 60 ho) 


where ô = e? /(ad — bc)’. Accordingly, we define the weighted (2, 4, 6, 10)-projective 
space 
P(2, 4, 6, 10)(F") := ((F")*\{(, 0, 0, 0)})/~ 


under the equivalence relation 
(b, L, Is, Lio) ~ (6° hy, 5* Ia, 6° Io, 8'° ho) 
for all 5 € (F#!)*; we write equivalence classes (Jz : I4 : I6 : T10) € P(2, 4, 6, 10)(F*). 
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Proposition 43.3.7. The genus 2 curves X and X’ over F are isomorphic over F” if 
and only if 


(h : I4 : I6 : ho) = (5:14:16: Bo) € P(2, 4, 6, 10)(F*'). 
Proof. See Igusa [Igu60, Corollary, p. 632]. Oo 


43.3.8. For arithmetic reasons (in particular to deal with problems in characteristic 2), 
Igusa [Igu60, pp. 617ff] defined the invariants [[gu60, pp. 621—622] 


Jn := 1h/8, 
J4 := (45 — 14)/96, 
Je := (8J3 — 160J2J4 — 16)/576, (3a) 


Jg := (nde — J4)/4, 
Jio := 19/4096, 


now called the Igusa invariants, with (J2 : Ja : Je : Jg : Jio) € P(2,4, 6, 8, 10)(F aly 
Visibly, the Igusa—Clebsch invariants determine the Igusa invariants and vice versa. 


Remark 43.3.10. One can also take ratios of these invariants with the same weight and 
define (three) absolute invariants analogous to the classical j-invariant of an elliptic 
curve, following Cardona—Nart—Pujolas [CNP2005] and Cardona—Quer [CQ2005]. 


Example 43.3.11. The locus of genus 2 curves with given automorphism group (cf. 
Example 43.3.4) can be described explicitly by the vanishing of polynomials in the 
Igusa(—Clebsch) invariants. For example, the unique genus 2 curve up to isomorphism 
over F*! with automorphism group Cio (when char F + 5) is the curve defined by the 
equation y? = x(x° — 1) with (h : I4 : Ie : Jo) = (0: 0:0: 1), with automorphism 
group generated by (x, y) > (g5x, —Z. > y), where és is a primitive fifth root of unity. 


43.3.12. The group Autr (F®™) acts on P(2, 4, 6, 10)(F#) in each coordinate: 
olh : I4 : I6: ho) = (72) : ol) : o(s) : 7U0)) 


for 7 € Autp(F®). Given a point P € P(2, 4, 6, 10)(F*?), we define its field of moduli 
M(P) to be the fixed field of FP under the stabilizer of P under this action. Just as in 
the case of ordinary projective space, the field M (P) is the minimal field over which P 
is defined. 


43.3.13. In this way, given a genus 2 curve, we have associated invariants of the curve 
that determine it up to isomorphism over F*!. We may also ask the inverse problem: 
given Igusa invariants (Jķ)ą with Jio + O, find a genus 2 curve with the desired 
invariants. This problem has been solved explicitly by work of Mestre [Mes91] and 
Cardona—Quer [CQ2005]. 

We give a sketch of the generic case of curves whose only automorphism over F® 
is the hyperelliptic involution, due to Mestre [Mes9 1]: in brief, the field of moduli may 
not be a field of definition for the desired genus 2 curve, but a quadratic extension 
will always suffice. Abbreviate Q[J] = Q[J2, Ja, Je, Jg, J10]. First, Mestre constructs an 
explicit ternary quadratic form L(J) and ternary cubic form M(J) defined over Q[J]. 
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Under substitution of generic invariants, the quadratic form L(J) defines a quaternion 
algebra B(J) over the field of moduli F of the point, and Mestre proves that there exists 
a curve X over a field K 2 F with the desired Igusa invariants if and only if K is a 
splitting field for B(J). The quaternion algebra B(J) is accordingly called the Mestre 
obstruction. Over a field K where B(J) splits, equivalently over a field K where the 
conic defined by L(J) = 0 has a K-rational point, we can parametrize L(J) and by 
substituting into M(J) we obtain a binary sextic form f(x, z) with the property that 
y? = f(x, 1) has the desired invariants. 


43.4 Complex abelian varieties 


Shifting gears, we pause to briefly recall some basic properties of complex abelian 
varieties, needed for our discussion of abelian surfaces. For further reference, see 
Birkenhake—Lange [BL2004], Mumford [Mum70], or Swinnerton—Dyer [Swi74]. 


Definition 43.4.1. A complex torus of dimension g € Zs, is a complex manifold of 
the form A = V/A where g = dimc V and A C V is a lattice of rank 2g. A morphism 
of complex tori V/A — V’/A’ is a C-linear map ¢: V > V’ such that d(A) € A’. 


Let A = V/A be a complex torus of dimension g. Then V ~ C8 and A = Z8 so 
V/A = (R/Z)’8 as smooth real manifolds. 


43.4.2. Suppose for concreteness (choosing a basis) that V = C£, working with column 
vectors. Choose a basis {A;}j=1,..,2¢ for A with A; = (Ay e C£. The matrix II = 
(Aij)i,g € Matgx22(C) is called the big period matrix of the lattice A (with respect to 
the basis {A;};). 

A change of basis of C! corresponds to left multiplication by an element of GLg(C) 
on IT and induces an isomorphism of complex tori. Writing 


=(P: P), with Pj, Py € GL,(C) 


we have P3 IJI = (Q 1), andQ = Py Pie GL,(C). Therefore every complex torus 
is isomorphic to a torus of the form C£ /(QZ& + Z£) for some Q € GLg(C), called the 
small period matrix. 


Definition 43.4.3. A complex torus A is a complex abelian variety if there exists a 
holomorphic embedding A — P”(C) for some n > 1. 


Remark 43.4.4. Every complex torus of dimension g = 1 is an abelian variety, indeed 
an elliptic curve, by the theory of classical Eisenstein series (see 40.1.11). But the case 
g = l is quite special! For a general lattice A C CE with g > 2, there will probably be 
no meromorphic functions on C%/A and in particular there will be no way to realize 
the torus as a projective algebraic variety. 


The conditions under which a complex torus is a complex abelian variety are given 
by the following conditions, due to Riemann. 


Definition 43.4.5. A matrix II € Matgy29(C) is a Riemann matrix if there is a 
alternating matrix E € M24(Z)at with det E # 0 such that: 
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(i) TE7!T* = 0, and 
(ii) V-1I1E~'T* is a positive definite Hermitian matrix, where * = — denotes 
conjugate transpose. 


Conditions (i) and (ii) are called the Riemann relations. 


Theorem 43.4.6. Let A = C8 /(TIZ?8) be a complex torus with TI € Matgy29(C). Then 
A is a complex abelian variety if and only if TI is a Riemann matrix. 


Example 43.4.7. Let f(x) € C[x] be a squarefree polynomial of degree 2g + 1 or 
2g + 2 with g > 1. The equation y? = f(x) defines an algebraic curve with projective 
closure X a nonsingular curve over C. A basis of the holomorphic differential 1-forms 
on X is w; = x !dx/y fori =1,...,¢. 

The set of points X(C) has naturally the structure of a compact (connected) Riemann 
surface of genus g > 1. Let a1, £1, . . -, Œg, Bg be a basis of the homology H(X, Z) and 
suppose that this basis is symplectic: each closed loop a; intersects 8; with (oriented) 
intersection number | and all other intersection numbers are 0, as in the following 
standard picture in Figure 43.4.8. 


Figure 43.4.8: A standard surface with a symplectic homology basis 
The integration pairing 
Q! x H(X, Z) = C 


wu fw 


is nondegenerate, giving a map Hı (X, Z) => Homc(Q!,C). Let 


t 
A= (+ vemaD] CGS, 


A Z-basis of A is given by the integrals with v = a;, Bi fori = 1,..., g. Let 
Jac X := Homç(Q!, C)/Ay(X, Z) = CE /A 


be the Jacobian of X. Then Jac X is a complex torus of dimension g. It has big period 
matrix II = (P; P2), where 


ri=(f os) , P= f o) . 
Qi ij Pi ij 


By cutting open the Riemann surface along the given paths and applying Green’s 
theorem, we verify that the big period matrix I is indeed a Riemann matrix. Therefore 
the Jacobian Jac(X) is an abelian variety of genus g. 


We now upgrade the above to a basis-free formulation. 
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Definition 43.4.9. Let 
E:AxA>Z 
be an alternating Z-bilinear map. Let Ep: V x V —> R is the scalar extension of E over 


R obtained from RA = V. 
We say E is a Riemann form for (V, A) if the following conditions hold: 


(i) Eg(ix, iy) = Eg(x, y) for all x, y € V; and 
(ii) The map 
VxV >R 
(x, y) > Er(ix, y) 


defines a symmetric, positive definite R-bilinear form on V. 


43.4.10. Let E be a Riemann form for (V, A). If we choose a Z-basis for A, we get a 
period matrix II and a matrix for E which is a Riemann matrix (satisfying conditions 
(i)-Gi) of Definition 43.4.5), and conversely. 


Proposition 43.4.11. If E is a Riemann form for (V, A), then the map 
H:VxV >C 
H(x, y) = Ep(ix, y) + iER(, y) 
is a positive definite Hermitian form on V with Im H = Ep. 


Conversely, if H is a positive definite Hermitian form on V such that Im H(A) € Z, 
then Im H|; is a Riemann form for (V, A). 


(43.4.12) 


Proof. This proposition can be checked directly, a bit of linear algebra fun: see Exercise 
43.6. o 


Example 43.4.13. For the torus C/(Zi + Z), the forms 
E(xı + ix, y1 +iy2) = X2y1 — x1y2 
Egr(i(xı + ix2), yı + iy2) = x1y1 — (—x2y2) = X1 y1 + x2y2 
H(x, y) = xy. 


define a Riemann form E, its associated (symmetric, positive definite) real part, and 
its associated (positive definite) Hermitian form H. 


Definition 43.4.14. A complex torus A = V/A equipped with a Riemann form is said 
to be polarized. 

A homomorphism of polarized complex tori is a homomorphism ¢: A > A’ of 
complex tori that respects the polarizations in the sense that the diagram 


AxA Z 
onl 
E’ 
Nxt’ 


commutes. 
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By Theorem 43.4.6 and Proposition 43.4. 11, a polarized complex torus is an abelian 
variety, and accordingly we call it a polarized abelian variety. 
We now seek to classify the possibilities for Riemann forms. 


43.4.15. There is a normal form for alternating matrices, analogous to the Smith 
normal form of an integer matrix, called the Frobenius normal form. Let M be a free 
Z-module of rank 2g equipped with an alternating form E : M x M —> Z. Then there 
exists a basis of M such that the matrix of E in this basis is 


0 D 
where D = diag(d},...,dg) is a diagonal matrix with diagonal entries d; € Zs and 
dı | dz |---| dg. The integers dı, . . ., dọ are uniquely determined by E, and are called 


the elementary divisors of E when all d; > 0 (equivalently, Æ is nondegenerate). 


Definition 43.4.16. A Riemann form E with elementary divisors 1,..., 1 in its Frobe- 
nius normal is called a principal Riemann form. 


Lemma 43.4.17. Let A = V/A be a polarized abelian variety, and suppose the Rie- 
mann form E has elementary divisors d\,...,dg. Then there is a basis for V and a 
basis for A such that the big period matrix of A is (QD) where D = diag(d\,...,d¢), 
and Q is symmetric and ImQ is positive definite. 


In particular, if A is principally polarized, then the conclusion of Lemma 43.4.17 
holds for Q, the small period matrix. 


Proof. Compute the period matrix with respect to a basis in which the Riemann form 
is in Frobenius normal form. o 


Example 43.4.18. Let A; = V/A; and Az = V2/A2 be two polarized abelian varieties, 
with Riemann forms Ej, Ey. Let A = A; X A2 = V/A, where V = Vi @ V and 
A =A, @A2 C Vi V = V. Then A can be equipped with the product polarization 
E = E; E Ed, defined by 


E(x, + x2, y1 + y2) = Ey (1, y1) + E2(x2, y2). 


If E}, E> are principal, then the product E is also principal. 


43.4.19. Polarizations can be understood in terms of duality, as follows. 

Let A = V/A be a complex torus. A C-antilinear functional on V is a function 
f:V—- C such that f(x + x’) = f(x) + f(x’) for all x, x’ € V and f(ax) = af(x) 
for all a € C and x € V. Let V* = Hom@(V, C) be the C-vector space of C-antilinear 
functionals on V. Then V* is a C-vector space with dimc V = dimc V* and the under- 
lying R-vector space of V* is canonically isomorphic to Homg(V, R). The canonical 
R-bilinear form 


vV*xV >R 
(f. x) + Im f(x) 
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is nondegenerate, so 
A* := {f €V* :Im f(A) c Z} 


is a lattice in V*, called the dual lattice of A, and the quotient AY := V*/A* is a 
complex torus. Double antiduality and nondegeneracy gives a canonical identification 
(V*)* = V, giving a canonical identification (AY) = A. 

Now suppose A is polarized with E a Riemann form for (V, A), and let H be the 
associated Hermitian form (43.4.12). Then double duality induces a Riemann form E* 
on (V*, A*), so AY is a polarized abelian variety. We have a C-linear map 


A: V => V* 


(43.4.20) 
xh A(x, -) 


with the property that A(A) C A*. Since the form is nondegenerate, the induced 
homomorphism 2: A — AY is an isogeny of polarized abelian varieties. The degree 
of the isogeny A is equal to the product dı --- dg of the elementary divisors of E, so in 
particular if E is principal then 4 is an isomorphism of principally polarized abelian 
varieties. 


43.4.21. Let A = V/A bea principally polarized complex abelian variety with Riemann 

form E. Let ø: A = AY be the isomorphism of principally polarized abelian varieties 

induced by (43.4.20). Then we define the Rosati involution associated to E (or ¢) by 
t: End(A) > End(A) 

ara’ = plao 


(43.4.22) 


where œ“: AY — AY is the isogeny induced by pullback. The Rosati involution is 
uniquely defined by the condition 


E(x, vy) = E(a‘x, y) (43.4.23) 


for all x,y € A. 


Proposition 43.4.24. The Rosati involution } is a positive involution on the Q-algebra 
End(A) 8 Q. 


Proof. Let B := End(A) ® Q, and let a € B with œ + 0. Let 8 := ata. Since pÝ = B, 
and ‘ is the adjoint involution with respect to the positive definite Hermitian form H, 
the action of £ on V is diagonalizable with positive real eigenvalues: indeed, if x € V 
is an eigenvector with eigenvalue A, then 


AH (x, x) = H(Bx, x) = H(ax, ax) € Rso 


and H(x, x) € Rx, so A € Ryo. The eigenvalues of 6 on End(A) by left multiplication 
are its eigenvalues with some multiplicity, and accordingly the trace tr(B) = tr(a‘ a) is 
a nonempty sum of these eigenvalues, hence is positive. o 
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43.4.25. Following Lemma 43.4.17, we define the Siegel upper-half space 
He ={t€M,(C): tt = t and Imr is positive definite}. 


To T € Hg, we associate the lattice A, = TZE @ ZE c C£ and the abelian variety 
Ar = C8/A, with principal polarization 


E,(tx1 + x2, Ty1 + y2) := Xp ye = xy = (Xf, x) o) 


where J = a al By Lemma 43.4.17, every principally polarized complex abelian 


variety arises in this way. 

Two elements t,t’ € Sy give rise to isomorphic abelian varieties if and only if 
they arise from a symplectic change of basis of A if and only if they are in the same 
orbit under the group 


Spog(Z) = {Y € M2g(Z) : y'Jy = J} 
where Sp7.(Z)ONg acts by 


tHe (at+b\(ct+d)', for É € SP2 (2). 


These maps give a bijection between the set of principally polarized complex 
abelian varieties of dimension g and the quotient 


Ag (C) := Sprg(Z)\Hg- 


By the theory of theta functions, Ag (C) is the set of complex points of a quasi-projective 
variety defined over Q of dimension (g? + g)/2. 


43.5 Complex abelian surfaces 


We now specialize to the case g = 2 of principally polarized abelian surfaces; in this 
section, we describe their moduli and the relationship with genus 2 curves, in analogy 
with elliptic curves (g = 1). 

We recall Example 43.4.7, where abelian varieties were obtained from Jacobians 
of curves—we now specialize this to the case g = 2. The following theorem links 
complex genus 2 curves, via their Jacobians, to complex abelian surfaces. 


Theorem 43.5.1. Let A be a principally polarized abelian surface over C. Then exactly 
one of the two holds: 


(i) A ~ JacX is isomorphic as a principally polarized abelian surface to the 
Jacobian ofa genus 2 curve X equipped with its natural polarization; or 

(ii) A ~ E x E is isomorphic as a principally polarized abelian surface to the 
product of two elliptic curves equipped with the product polarization. 
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43.5.2. In case (i) of Theorem 43.5.1, we say that A is indecomposable (as a principally 
polarized abelian surface, up to isomorphism). It is possible for A to be indecomposable 
as a principally polarized abelian surface and yet A is not simple, so A is isogenous 
to the product of elliptic curves. In case (ii), we say A is decomposable, and this case 
arises if and only if there is a basis e4, e2 for C? such that 


A= Ay e Aveo 
where Aj, A2 C C. 
We now pursue an explicit version of Theorem 43.5.1, linking the algebraic descrip- 


tion (section 43.3) to the analytic description (section 43.4), in a manner analogous to 
the construction of Eisenstein series for elliptic curves (g = 1) in 40.1.11 and 40.1.19. 


43.5.3. For brevity, let IT := Sp4(Z), let 
JT) =ctT+dEM(C), fory = f a cT andr € Sp, 


where a, b,c, d € M2 (Z), and let 


cell Jere 


We define for k € 2Z>2 the Eisenstein series 
Yk: 2 >C 
War) = $, deyo. 


Toy ET \T 


As for classical Eisenstein series, Y(T) is absolutely convergent on compact domains. 
By design, the function yw, has a natural invariance under T: 


Weyt) = (det (y; T) We (7) (43.5.4) 


for ally e I and T E€ po. 
We define two further functions: 


43867 


X10 += -312355271531 446 — V10) 
— 131 - 593 2m2 ,3 le3,2 
X12 := 5133753723371 9 Thy +2 5 We — 69112) 


(The constants are taken so that the Fourier expansion is appropriately normalized; 
their precise nature can be safely ignored on a first reading.) 


The function 710 is somewhat analogous to the classical function A, in the following 
sense. 


Lemma 43.5.5. Let t € 92. Then y10(t) = 0 if and only if A, is decomposable (as a 
principally polarized abelian variety). 
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In other words, the vanishing locus of 19 is precisely where case (ii) of Theorem 
43.5.1 holds and the abelian surface is not isomorphic to the Jacobian of a genus 2 
curve (as a principally polarized abelian surface). 


Remark 43.5.6. More generally, a (classical) Siegel modular form of weight k € 2Z 
for the group I = Sp,(Z) is a holomorphic function f: §2 — C such that 


FOT) = (det (y; 7) f(T) 


for all y € Sp,(Z) and T € $2. (By the Koecher principle, such a function is automati- 
cally holomorphic at infinity in a suitable sense, and so the conditions at cusps 40.2.12 
for classical modular forms do not arise here.) 

Let M;(T) be the C-vector space of Siegel modular forms of weight k; then Mg (I) 
is finite-dimensional, M,(I) = {0} for k < 0, and Mọ(T) = C consists of constant 
functions. Let M(T) = B ke2Zng M,(C); then M(I) has the structure of a graded 
C-algebra under pointwise multiplication of functions. Igusa proved that 


MT) = Cl, Y6, x10, X12] 
in analogy with Theorem 40.3.11. Extending the analogy, Igusa also proved that 
Wa, Wo, —4 X10, 12x12 have integer Fourier coefficients with content 1. 


43.5.7. The Igusa—Clebsch invariants 43.3.5 can be expressed in terms of the functions 
above. The precise relationship was worked out by Igusa [Igu60, p. 620]: we have 


hapy 
X10 
l = 27 
2 Y4xı2 
pos AA 
3 á X10 
No = -2 y10 


The functions 14, I6, X10, X12 are holomorphic, but I2 is meromorphic. In other words, if 
X is a complex genus 2 curve with Jac X = A; for T € %2, then the algebraic invariants 
of X can be computed in terms of the values of these functions evaluated at t. This 
description is again analogous to the case of elliptic curves (cf. Remark 40.3.10). 


Proposition 43.5.8. Two indecomposable principally polarized abelian surfaces Az, 
A, are isomorphic (as principally polarized abelian surfaces) if and only if 


(h : In: I6 : Tot) = (h : I4 : I6 : ho)(r’) € P(2, 4, 6, 10)(C). 


In other words, the Igusa(—Clebsch) invariants are naturally defined coordinates on 
the moduli space Az(C) of abelian surfaces, a complex threefold by 43.4.25. 


Proof. Combine 43.5.7 and Proposition 43.3.7. oO 
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43.5.9. Let A be a principally polarized complex abelian surface. Let End(A) be the 
ring of endomorphisms of A, and let B = End(A) @z Q. By the classification theorem 
of Albert (Theorem 8.5.4), the Q-algebra B is exactly one of the following: 


(i) B = Q and we say A is typical; 
(ii) B = F areal quadratic field, and we say A has real multiplication (RM) by F; 
(iii) B is an indefinite quaternion algebra over Q, and we say A has quaternionic 
multiplication (QM) by B; 
(iv) B = K is aquartic CM field K, and we say A has complex multiplication (CM) 
by K; or 
(v) B = Mo(K) where K is an imaginary quadratic field. 


Each one of these 5 cases is interesting in its own right—but given the subject 
of this book, we concern ourselves essentially with only case (iii), where quaternion 
algebras play a defining role. 


43.6 Abelian surfaces with QM 


In this section, we consider moduli spaces of abelian surfaces with quaternionic mul- 
tiplication. For further reference, see Lang [Lang82, §1X.4—5]. Throughout, let A be 
a principally polarized complex abelian surface with Riemann form E. Let B be an 
indefinite quaternion algebra over Q with disc B = D, let 


loo: B > Br =~ M>(R) (43.6.1) 


be a splitting over R. 


43.6.2. The Rosati involution Ý (defined in 43.4.2 1) is a positive involution on End(A)g. 
We classified positive involutions in section 8.4: specifically, when End(A)p ~ M2(R), 
by Example 8.4.15 there exists u € End(A); with pw? € Repo such that 


a =p lau (43.6.3) 


for all a € End(A). The map t defines a Q-antiautomorphism of End(A), so by the 
Skolem—Noether theorem, we must have u € End(A)*. 


From now on, let O be a maximal order in B. (One can relax this hypothesis with 
some additional technical complications, but there is enough to wrangle with here 
already!) In light of 43.6.2 we make the following definition (cf. Rotger [Rot2004, 
§3]). 


Definition 43.6.4. A polarization on O is an element u € O such that p? € Zeo; a 
polarization is principal if 4? + D = 0. 

An isomorphism of polarized orders (O, u) = (O’, w’) is an isomorphism of orders 
ġ: O — O such that 6(p) = p. 


43.6.5. By Lemma 17.4.2 (employing the Skolem—Noether theorem), an isomorphism 
¢@: O = O' of rings is induced by conjugation by an element of B*. In particular, the 
polarized orders (O, u) and (O, yz’) are isomorphic as polarized orders if and only if 
there exists v € Ngx(O) such that w’ = v7! uv. 
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43.6.6. Every O has a principal polarization. Indeed, by the local-global principle 
for splitting/embeddings (Proposition 14.6.7), the field K = Q(v-D) embeds in B 
because Kp = Q»(V-D) is a field for all p | D. Therefore there exists w’ € B with 
(w)? +D = 0, and p’ € O’ for some maximal order O’. But by a consequence of strong 
approximation (Theorem 28.2.11), O is conjugate to O’, so there exists a conjugate 
u € O with still 2 + D = 0. (By the theory of optimal embeddings, this is also 
immediately implied by Example 30.7.5, which counts the number of O*-equivalence 
classes of optimal embeddings Zg = O.) 


Lemma 43.6.7. Let u be a principal polarization on O. Then u € DO}, i.e., trd(uO) € 
DZ. 


Proof. We may check the desired equality locally. If p { D, then u € O% and 
trd(uO,) = trd(Op) = Zp. Otherwise, if p | D, then u generates the normalizer 


group Ng% (Op)/(Q5O5) by Exercise 23.4, and trd(uO,) E pZp as desired. o 
For a principally polarized order (O, u), we define the positive involution 
*: B>B 
. = (43.6.8) 
a=" ap 


From now on, let (O, u) be a principally polarized order. 


Definition 43.6.9. A quaternionic multiplication (QM) structure by (O, 4) on A is 
an injective ring homomorphism 1: O — End(A) such that the induced homomor- 
phism 7: B — End(A)g respects involutions, i.e., the diagram 


B —— End(A)o 


| | (43.6.10) 
B —> End(A)g 


commutes, where * is the Rosati involution defined in (43.4.22). 

We say A has quaternionic multiplication (QM) by (O, u) if A can be equipped 
with a QM structure by (O, u), and we say that A is a QM abelian surface if it has a 
QM structure for some (O, jz). 


Writing out (43.6.10), for a QM structure ų: we require that ua)! = u(a*) for all 
æ EB. 


Definition 43.6.11. A homomorphism (A, 1) — (A’,v’) of principally polarized com- 
plex abelian surfaces with QM by (O, u) isa homomorphism ¢: A —> A’ of polarized 
abelian surfaces (respecting the polarization) that also respects 1,0’ in the sense that 
the diagram 

B — > End(A)g 


r 
End(A)g 
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commutes; an isogeny is a surjective homomorphism with finite kernel. 
QM abelian surfaces can be constructed as follows. 


43.6.12. Extend t% to a map tœ : B — Be = M2(C). Let T € H?. Let 
Ag := too(O) (i) ce. 


Then A, is a lattice in C?, since rkz O = 4; let A; := C/A, be the associated complex 
torus. The map tæ induces a natural injective ring homomorphism 4 : O —> End(A;) 
since to(O)A; € A; as O itself is closed under multiplication. Define the form 


E,:A;xA,; ~Z 
43.6.13 
EHEH) =  trd(o( x9) 


with x, y € too(O). The form E, takes values in Z by Lemma 43.6.7. 
The main result of this section is then the following theorem. 


Main Theorem 43.6.14. Let T = 1.(O')/{+1} € PSL2(R). Then the map 


(A, ¿) principally polarized 

complex abelian surfaces 
with QM by (O, u) (43.6.15) 
up to isomorphism 

Prt > [(Az, )] 


T\H? © 


is a bijection. 


The proof of this theorem occupies the rest of this section; it amounts to checking 
that various conditions and compatibilities are satisfied. The reader who is willing to 
take these as verified can profitably move along to the next section. 

We begin by verifying the Riemann relations. 


Lemma 43.6.16. The form E = E, defined in (43.6.13) or its negative E = —E; is a 
Riemann form. 


This sign ambiguity was already present above, as the involution (43.6.8) is the 
same replacing u by —u; working backwards with Lemma 43.6.16 in hand, we may fix 
the sign by replacing u by —u once and for all so that E, itself is a Riemann form for 
all T. 


Proof. E is alternating since 
trd(too(4)xx) = nrd(x) trd(u) = 0 
for all x € 4..(O). 


820 CHAPTER 43. ABELIAN SURFACES WITH QM 


Next, let 
_ 1 [Rer -ir 
e= I Rer) MeN 
Then 5 2 
—  —(Ret)* + |r| 
detlo) = 09 = ———— = 1 


(Im r)* 


so o € SL2(R), and moreover 
tT) 1 tRet-—|r/? _ ({iRet—-Imrt)\ _ fr 
oli) = maz nese lal i ali) (43.6.17) 
Therefore, for all x, y € to(O), 


r fiho) -#(e(i}()) -# ee) oe) 


= trd(to(4)(x0)YO) = ttd(oo(H)x(@@) Y) (43.6.18) 


= trd(lo(wxy) = E b (7) H) ; 


We now show that (x, y) => Eg(ix, y) is a symmetric, positive definite R-bilinear 
form on V. It is enough to verify this for x,y € ¢.(O). In this calculation, to avoid 
clutter we write u for too(4). Following as above, first we show symmetry: 


E (is pi H) = trd(u(xe)y) = trd( yo x F) 


= trd(u(y@)X) = E ff) 


using that u = —y and p = —p since they have trace zero. For positivity, we replace o 
with an expression in u in order to simplify, and then apply positivity. Since 4? = —D, 
if we let wy = u/i ND with VD € Ryo, then My = —1. Since also rong = —1, there exists 
5 € GL2(R) such that 57! 1416 = o. From the calculation that 


(43.6.19) 


m = ey! = VD", 


we obtain 


5 D- 
ND 5-15 (43.6.20) 


2 = 50! = ÖVDy BET ~ nrd(6) 


and hence 
E GH! CH) = trd(u(xo)F¥) = — trd( ux F) 
(43.6.21) 


vD z 
= —~——_ trd( x6 6x). 
nrd(6) ha 


This may look worse, but now we use positivity of * applied to xô: 


trd ladaa") = trd(x5u l Szu) = trd(uxõu l 6T) > 0. 
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It follows that E [> (7) GH) is always either positive definite or negative definite 


(depending on the sign of nrd(6)). oO 
Lemma 43.6.22. The polarization induced by E is principal. 


Proof. Let 6: A — AY be the isogeny induced by E. Then the degree of ¢ is 


deg ¢ = det (e (x i} FH) = det(trd((too(W)/D) xix ))ij 


ij 


where x; are a Z-basis for t.o(A). Thus 


N 
det(trd(Ces (H/D) T) = A dettrda TD) 
2 
2 me Y Gised Oy = aD? =1. 


o 


Lemma 43.6.23. The homomorphism u: O —> End(A-)ọQ satisfies the compatibility 
(43.6.10), and E, is the unique compatible principal polarization on A, compatible 
with r. 


Proof. Abbreviate ı = t,. Let a € O. Then the Rosati involution is uniquely defined 


by the condition 
E b (7) «o(;)] =E Lorki) JH) (43.6.24) 


for all x, y € too(O), i.e., 
trd(tao(wxua)y) = trd(tco()u(a)' xy). (43.6.25) 
Let z = (œ) and x, y € to(O). We verify that (43.6.25) holds by 


trd(to(H)Z XV) = trd(to( H)(too (U) Zto(H))xF) 
= trd(loo(f)xY Z) = trd(to(u)xZy) 


(43.6.26) 


as desired. 

To conclude, any other polarization corresponds to another positive involution of 
the forma + v~!@yvas in 43.6.2; scaling, we may take v € O such that trd(vO) c DOË. 
In the proof of Lemma 43.6.22, we see that the degree of the polarization is equal to 
nrd(v)?/D7, so it is principal if and only if nrd(v) = D, i.e., v? + D = 0. But then by 
compatibility trd(t..(4)xy) = trd(too(v)xy) for all x, y € O, which implies u = v. o 


Remark 43.6.27. Lemma 43.6.23 shows that one could be more relaxed in the definition 
of QM abelian surface in the following sense. Let A be a (not yet polarized) complex 
abelian surface, and let ı: O <> End(A) be a ring homomorphism. Then there is a 
unique principal polarization on A such that the induced Rosati involution is compatible 
with u in the sense of (43.6.10). 
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Proposition 43.6.28. Every principally polarized complex abelian surface with QM 
by (O, u) is isomorphic as such to one of the form (Az, tz) for some T € H?. 


Proof. Let (A, +) be a principally polarized complex abelian surface with QM by (O, u), 
and let A = C?/A. Then 


End(A) = {æ € Mo(C) : aA C A}. 


Therefore 1: O — End(A) extends to an inclusion B ~ M)(C). By the Skolem-— 
Noether theorem, every two inclusions are conjugate by an element of GL2(C), acting 
by an isomorphism of A; so without loss of generality, we may suppose that : extends 
tO loo. 

We claim that A = 1..(O)x for some x € C?. Indeed, A @ Q has the structure of a 
left B-module with the same dimension as B as a Q-vector space; by Exercise 7.6, it 
follows that A@Q = Bx is free as a left B-module with x € A C C2; thus A = Ix where 
I C Bisa left O-ideal. Since O is maximal and therefore hereditary, J is invertible 
as a left O-ideal, and in particular 7 is sated. Now comes strong approximation: by 
Corollary 28.5.17, since B is indefinite and Cl* Z is trivial, we conclude that J is 
principal, and therefore we can rewrite A = t.(O)x with x € C?. By Lemma 43.4.17, 
we may suppose that x = (t 1) with Im? > 0, so T € H°. 

Finally, the polarization agrees by the uniqueness in Lemma 43.6.23. oO 


We are now ready to finish the proof of Main Theorem 43.6. 14. 


Proof of Main Theorem 43.6.14. By Lemmas 43.6.16, 43.6.22, and 43.6.23, the asso- 
ciation T +> (Az, tr) yields a principally polarized complex abelian surface with QM 
by (O, u). By Proposition 43.6.28, the map is surjective. 


Next, we show the map is well-defined up to the action of I. Let y = i ] eT 
and T’ = yt. Then 


Aer) a = tn(O\er + dy! ic e? ) 


cT+d 
(43.6.29) 


=(ct + d) | to(O)y (i) =(cT+ dy Ae 


Therefore scalar multiplication by (ct + d)~! induces an isomorphism A; > Ayr of 
abelian surfaces; this map preserves the polarization by writing 


(ct + d)x (7) = xy # 


for x € t..(O), and then verifying that 


trd(uxy) = trd(u(xy)(yy)) = trd(uxyy y) = trd(uxy). 


The induced map End(A,;) — End(A;,) obtained from conjugation by a scalar matrix 
is the identity, and the compatibility for the QM is then verified by the fact that for 
a EB, 

U (a) = to(@) = (a). 
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To conclude, suppose that (Az, tr) = (Ar, lr’) with T, T’ € H+: then there exists 
@ € GL2(C) such that dA; = A; and ¢ commutes with t..(@). Since to(a) ®C = 
M2(C), we conclude that ¢ is central in M2(C) and hence scalar. From 


T T 
efi)=>:) 
we conclude y € T and then that é = cT’ + d so T = y7. o 


Remark 43.6.30. Analogous to the case SL2(Z), one may similarly define congruence 
subgroups of T!; the objects then parametrized are QM abelian surfaces equipped with 
a subgroup of torsion points. 


Remark 43.6.31. The “forgetful” map which forgets the QM structure ı gives a map of 
moduli from T\H? to A2(C), but this map is not injective: it factors via the quotient by 
the larger group I (u). In other words, the bijection of Main Theorem 43.6.14 induces a 
bijection between I ( L)\H? and the set of isomorphism classes of principally polarized 
abelian surfaces A such that A has QM by (O, u): accordingly, generically there will 
be two choices of QM structure on a surface A that can be given QM by (O, y). 


Remark 43.6.32. The results above for F = Q extend, but not in a straightforward way, 
to totally real fields F of larger degree n = [F : Q]. If A is an abelian variety with 
dim(A) = g such that A has QM by B over F, then 4n | 2g, so we must consider abelian 
varieties of dimension at least 2n. If equality g = 2n holds, then A is simple, and by 
Albert’s classification of the possible endomorphism algebras of an abelian variety, B 
is either totally definite or totally indefinite. So if t = 1, then F = Q and B is totally 
indefinite. 

Consequently, we must consider abelian varieties of larger dimension. The basic 
construction works as follows. First, one chooses an element u € O such that p? € Zr 
is totally negative. (If Zr has strict class number 1, then one has disc B = D = DZF 
with D totally positive and one can choose u satisfying u? = —D.) Let K = F(V—D); 
note that since K — B we have Bx = B r K = M2(K). Then the complex space 
XB (De parametrizes complex abelian 4n-folds with endomorphisms (QM) by Bx and 
equipped with a particular action on F on the complex differentials of B. Amazingly, 
this moduli interpretation does not depend on the choice of K; but because of this 
choice, it is not canonical as for the case F = Q. 


43.7 Real points, CM points 


Let X! = T\H?. Then X! has the structure of a complex 1-orbifold. We conclude this 
chapter with some discussion about real structures. 


43.7.1. By Example 28.6.5, there exists € € O* such that nrd(e) = —1. Then e? € O! 
and e normalizes O!, so the action of e (as in (33.3.12)) defines an anti-holomophic 
involution on X(T) that is independent of the choice of e: this gives the natural action 
of complex conjugation on X(T). 
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With respect to this real structure, and in view of Main Theorem 43.6.14, we may 
ask if there are any principally polarized abelian surfaces with QM by (O, u) with 
both the surface and the QM defined over R. When B =~ M>(Q), then the element 
e= r a acts by complex conjugation, and the real points are those points on 
the imaginary axis (the points with real j-invariant). More generally, the answer is 
provided by the following special case of a theorem of Shimura [Shi75, Theorem 0]. 


Proposition 43.7.2 (Shimura). If B is a division algebra, then X'(R) = 0. 


Proof. We follow Ogg [Ogg83, §3]. Let 1.(€) = f a If X! (R) + 0, then by 43.7.1 


there exists z € H? such that 


EE eee az +b 

PTE cz+d 
Then az + b = c|z|? + dz; taking imaginary parts we find —a Im z = d Imz so a + d = 
trde = 0 and € — 1 = O. Since B is a division algebra, we conclude € = +l, a 
contradiction. o 


It may nevertheless happen that certain quotients of X! by Atkin—Lehner involutions 
may have real points. 


Remark 43.7.3. A similar issue arises for CM elliptic curves: such a curve, together 
with all its endomorphisms, cannot be defined over R. 


Just as Y(1) = SL2(Z)\H? parametrizes isomorphism classes of elliptic curves, 
among them are countably many elliptic curves whose endomorphism algebra is larger 
than Z: these are the elliptic curves with complex multiplication, and they correspond 
to points in H? that are quadratic irrationalities, so Q(t) = K is an imaginary quadratic 
field and S = Z[t] € K is an imaginary quadratic order. By the theory of complex 
multiplication, the corresponding j-invariants are defined over the ring class field 
H 2 K with Gal(H | K) ~ Pic(S), and there is an explicit action of Gal(H | K) on this 
set. 

In a similar way, on X! we have CM points, given by complex abelian surfaces 
with extra endomorphisms, defined more precisely as follows. 


43.7.4. Let K 2 Q be an imaginary quadratic field and suppose that ix: K — B 
embeds. Let $ = K N O, so that S —> O is optimally embedded. Suppose the image 
of this embedding is given by S = Z[v] where v € O. Let T = 1, be the unique fixed 
point of tœ(v) in H?; a point of H? that arises this way is called a CM point. 

The corresponding abelian surface A; visibly has M2(S) <> End(A;), and in 
particular End(A;)g ~ M2(K) as this is as large as possible. 


43.8 » Canonical models 


In this section, we sketch the theory of canonical models for the curves X!. 
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Theorem 43.8.1 (Shimura [Shi67, Main Theorem I (3.2)]). There exists a projective, 
nonsingular curve Xo defined over Q and a holomorphic map 


p: H? > X(C) 
that induces an analytic isomorphism 


g: T'\H? > XAO). 


43.8.2. The curve X} is made canonical (unique up to isomorphism) according to the 
field of definition of CM points (see 43.7.4). 


Let z € H? be a CM point with S of discriminant D. Let H 2 K be the ring class 
extension H 2 K with Gal(H |K) = Pic(S). Then (z) € XH). Moreover, there is 
an explicit law, known as the Shimura reciprocity law, which describes the action of 
Gal(H |K) on them: to a class [c] € Pic S, we have 


tx(c)O = EO (43.8.3) 
for some é € O, and if o = Frob, € Gal(H | K) under the Artin isomorphism, then 


o((z)) = 6(€7'2). (43.8.4) 


For more detail, see Shimura [Shi67, p. 59]; and for an explicit, algorithmic version, 
see Voight [Voi2006]. 


43.8.5. Before continuing, we link back to the idelic, double-coset point of view, 
motivated in section 38.6 and given in general in section 38.7. 

The difference between a lattice in R? and a lattice in C is an identification of R? 
with C, i.e., an injective R-algebra homomorphism y: C > Endg(R*); we call y a 
complex structure. 

There is a bijection between the set of complex structures and the set CNR = H7+ 
as follows. A complex structure y: C — Endg(R?), by R-linearity is equivalently 
given by the matrix y(i) € GL2(R) satisfying wi? =-1. By rational canonical form, 


every such matrix is similar to & al i.e., there exists 8 € GL2(R) such that 


. 71/9 1 
woe | L 


and $ is well-defined up to the centralizer of a al this matrix acts by fixing 


i € H’, and it follows that this centralizer is precisely R* SO(2). Finally, we have 
GL2(R)/R* SO(2) © H?* under B & fi. 
In this way, 
X! =T'\H? o O!\H** 
© O'\(GL2(R)/R* SO(2)) (43.8.6) 
© B*\(B* /O* x GL2(R)/R* SO(2)). 
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(The final line is just an expression of the fact that B*\B* / O% is a set with one element, 
by strong approximation; it is placed there for comparison with other settings, where 
class numbers may add spice.) So the bijection (43.8.6) says that X! parametrizes 
O-lattices in B with a complex structure up to homothety. In the previous few sections, 
we showed how such lattices equipped with complex structure can be interpreted as a 
moduli space for abelian surfaces with quaternion multiplication. 


We conclude this section with some more parting comments on representability. 


Remark 43.8.7. Let Schg denote the category of schemes over Q under morphisms 
of schemes, and let Set denote the category of sets under all maps of sets. Let 
F : Scho — Set be a contravariant functor. Then X € Schg is a coarse moduli 
space for F (or X coarsely represents F) if there exists a natural transformation 
P: F(—) — Hom(-, X) which satisfies: 


(i) ©: F(k) > Hom(k, X) is bijective if k is algebraically closed (and char k = 0); 
and 

(ii) ® is universal: if (Z, ¥) is another such coarse moduli space, then there is a 
unique commutative diagram 


F (—) ——> Hom(-, Z) 


| 
13! 
y 


Hom(-, X) 


By Yoneda’s lemma, condition (ii) is equivalent to a unique (commuting) morphism 
X>Z. 

We then define a functor Fo : Scho — Set which associates to S the set of 
isomorphism classes of abelian schemes A over S—which can be thought of families 
of abelian surfaces parametrized by S—together with a map 1: O => Ends(A). 

Deligne [Del71] has shown that the functor Fo is coarsely representable by a curve 
Xo over Q. By uniqueness and the solution to the moduli problem over C, there is a 


map T R (1)\H? > XO which is in fact an analytic isomorphism. Together with the 
field of definition of CM points, we recover the canonical model (Theorem 43.8.1). 


43.9 x Modular forms 


In this final section, we sketch aspects of the theory of modular forms for arithmetic 
Fuchsian groups. 

We restore a bit more generality: let F be a totally real field of degree n = [F : Q], 
let B be a quaternion algebra over F that split at exactly one real place corresponding 
to an embedding tœ : B — Mo(R), let O C B be an order, and let T < PSL2(R) be a 
group commensurable with T!(O) = 1..(O!)/{+1}. 

Let Y = T\H?, let X = [\H*™ be its completion, and call the set X\Y the set of 
cusps. We recall the notion of orbifold from section 34.8. Then X has the structure of 
a good complex 1-orbifold with signature (g; e1,...,e,-;6): X has genus g, there are 
exactly r elliptic points P; of orders e;, and 6 cusps Q1, . . ., Qs. The hyperbolic area of 


43.9. x MODULAR FORMS 827 


X is written u(X), and can be computed as the area of a suitable fundamental domain. 
As in the introduction 43.1.6, we make the following definition. 


Definition 43.9.1. Let k € 2Zs9. A map f: H? > C is weight k-invariant under I if 


fz = (cz +d) f) forall y = (: er. (43.9.2) 


A modular form for T of weight k is a holomorphic function f: H? — C that is 
weight k invariant and is holomorphic at the cusps. 


Let M (T) be the C-vector space of modular forms for I and let 
MT) := G MT); (43.9.3) 
kE2Z>0 


then M(T) is a graded C-algebra under multiplication. 
We can understand the degree of divisors of M;(I) as follows. 


Theorem 43.9.4. For f € M;(T), we have 


1 k 
— ord = — X x 
pp #StabrG) P= a 


Proof. See Shimura [Shi71, Proposition 2.16, Theorem 2.20]. (m 
By an application of the theorem of Riemann—Roch and the description of modular 


forms behind Theorem 43.9.4, we find that dimc M;(I) can be expressed in terms of 
k and the signature of I as follows. 


Theorem 43.9.5. We have 


l, if k = 0; 
dime MT) = g + max(0, 6 — ty ifk =2; 
&—ne-D+55+> [5 (1-2) if k > 2. 

Proof. See Shimura [Shi71, Theorem 2.23]. o 


The above formulas can be proven in a different and straightforward way in the 
language of stacky curves. This description gives the following further information on 
M(T). 


Theorem 43.9.6 (Voight-Zureick-Brown [VZB2015]). Let e = max(l,e1,..., €r). 
Then the ring M(T) is generated as a C-algebra by elements of weight at most 6e with 
relations in weight at most 12e. 


(The case of signature (0; 2, 2,3,3; 0) from section 43.2 is described [VZB2015, 
Table (IVa-3)] as a weighted plane curve of degree 12 in P(6, 3, 2).) 


Remark 43.9.7. An appealing mechanism for working explicitly with modular forms 
in the absence of cusps is provided by power series expansions: for an introduction 
with computational aspects, see Voight—Willis [VW2014] and the references therein. 
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Exercises 


1. Let g = 1 and consider a period matrix IT = (w1 w2) with w1, w2 € C. Let 
E = 2 a . Show that in Definition 43.4.5 for E that the condition (i) is 
automatic and condition (ii) is equivalent to Im(w2/w1) > 0. 

2. Let II € Matgy29(C). Show that II is a period matrix for a complex torus if and 


II 
only if (a) the matrix obtained by vertically stacking II on top of its complex 


conjugate TI, is nonsingular. 

3. Let E: Ax A > Z be a Z-bilinear form that satisfies conditions (i) and (ii) 
of Definition 43.4.9 (so a Riemann form but without the condition that E is 
alternating). Show that Eg (and E) are alternating. 

4. Show that the symmetry (43.6.19) implies the equality 


e((i}o(i)) -# (GC) 


from (43.6.18) directly using Exercise 43.3 (without o). 


>5. Let A = V/A be a complex abelian variety. 


(a) Let 6: A > A’ be an isogeny with n = #ker(¢). Show that there exists a 
unique isogeny yw: A’ — A such that y o ¢ = na is multiplication by n on 
A and similarly 6 o y = ny on A’. [Hint: ker ¢ C A[n].] 

(b) Suppose that A is (not necessarily principally) polarized and let A: A —> AY 
be the map induced by (43.4.20). Using (a), show that (43.4.22) gives a 
well-defined involution on End(A) ® Q, still called the Rosati involution 
(attached to the polarized abelian variety A). 


>6. Let V be a finite-dimensional vector space over C. 


(a) Let H: VXV — Cbeanondegenerate Hermitian form on V. Let $ := Re H 
and E := Im H, so that H = S + iE for R-bilinear forms S,E: Vx V — R. 
Show that S is symmetric (i.e., S(y, x) = S(x, y)), E is alternating (i.e., 
E(y, x) = —E(x, y)), and S(x, y) = E(ix, y) for all x, y € V, and moreover 
that E (and S) are nondegenerate. 

(b) Let E be a nondegenerate alternating form on V (as an R-vector space). 
Show there exists a unique nondegenerate Hermitian form H on V with 
Im A = E if and only if E(ix,iy) = E(x, y) for all x, y € V. 

(c) Show that a Hermitian form on V is positive definite if and only if the 
corresponding symmetric form S := Re H is positive definite. 


>7. Prove Proposition 43.2.1. 

>8. In the following exercise, we do a few manipulations with generating functions, 
applied to understand the presentation of the ring of modular forms in the next 
exercise. 
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(a) Prove that 


Z = aLa 2 
kW 4 (dd -t*)-d +f) 


5 |5|“= t+ + 16 
T 4 42 _ 46 4 78 
kD» 3 1-?t*-f°+t 


[Hint: break up the sum by congruence class according to the floor and 


then use geometric series. ] 
(b) Let m2, m3 € Zso. For k € 2Zs0, define 


l, ifk =0; 
Ck = 48, if k =2; 
(k — I(g —1) + mol k/4] + mLk/3], ifk > 4. 


Show that 


y r 1 + gt? +.a4t* + agt? + agt® + gt! + 2? 
ckt” = ——)- 


— g4 — 76 
E (1-7) - rt) 


where 
a4 = 3g + m + m — 4 
aş = 4g + m + 2m3 — 6. 


>9, Prove Proposition 43.2.7 as follows. 


(a) Show that the functions f4, fe are algebraically independent. /Hint: reduce 
to the case where the relation is weighted homogeneous, and plug in z2 to 
show the relation reduces to one of smaller degree. ] 

(b) Show that any relation between f4, g6, h12 is a multiple of r. [Hint: Without 
loss of generality, we may suppose that r is of the form he, € CLfa, golaa. 
Therefore, another relation expresses hın as a rational function in fa, 86. 
Use (a) and unique factorization to show that this purported relation is in 
fact polynomial, and obtain a contradiction from the linear independence 
of fps Se hi2] 

(c) Show that the subring of M(I'') generated by f4, g6, h12 is isomorphic to 


M’ = CL fa, 86, h121/ (fa 86» h12))- 
(d) Show that 


1+2! 


; lyk — 
2, Gime MTD = aa 


kE2Z>0 


and dimç Mx = dime M; for all k. [Hint: use Exercise 43.8.] 
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(f) Conclude that the subring of M(I') generated by f4, g6, hi2 is equal to 
M(1!). [Hint: Suppose that equality does not hold, and consider the min- 
imal degree with a new generator; by dimensions, there must be a new 
relation, but argue that this relation must be among fa, g6, h\2, contradict- 


ing (b).] 


This chapter is licensed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 International License (http://creativecommons.org/licenses/by- 
nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the 
original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s 
Creative Commons license, unless indicated otherwise in a credit line to the material. If 
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use is not permitted by statutory regulation or exceeds the permitted use, you will need 
to obtain permission directly from the copyright holder. 
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„b 
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involution, 36 
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H fundamental set, 631 
j conjugate transpose (of a complex matrix), 27 
(ai, ..., an) diagonal quadratic form on 41, ..., an, 51 
() additive homomorphism of a local field, 515 
,b 
E | [a,b | F) quaternion algebra defined by a, b € F*, char F = 2, 86 
| | absolute value, 185 
te p-adic absolute value, 181 
2Dn binary dihedral (dicyclic) group of order 2n, 176 
2I binary icosahedral group, 175 
20, S4 binary octahedral group, 174 
2T, A4 binary tetrahedral group, 168 
[M : N]R index of R-lattices M,N C V, 146 
æ, elements (of an algebra), 23 
A F-algebra, 39 
a,b fractional ideals (of a noetherian domain), 140 
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ann V annihilator of a (left) module V, 104 
Autr B automorphism group of an F-algebra B, 22 
B adele ring of the algebra B over a global field, 462 
B algebra, 21 
B? reduced trace zero elements of B, 38 
B! reduced norm one elements of B, 38 
B* idele group of an algebra B over a global field, 462 
bd boundary, 610 
B finite adele ring of the algebra B over a global field, 462 
BX finite idele group of an algebra B over a global field, 462 
Bo opposite algebra of the algebra B, 36 
orthogonal direct sum, 51 
BrF Brauer group of a field F, 127 
BrtCl O Brandt class groupoid of the order O, 313 
C field of complex numbers, 21 
Cp(A) centralizer of a subalgebra A C B, 110 
CF idele class group of the global field F, 464 
char F characteristic of a field F, 22 
cl closure, 631 
cif? Q even Clifford algebra of a quadratic form Q, 72 
cif! Q odd Clifford bimodule of a quadratic form Q, 72 
Clif Q Clifford algebra of a quadratic form Q, 69 
Clenco) R class group of R attached to the normalizer of O, 485 
Clea) R class group of R attached to the order O, 473 
ClQ class set of a quadratic module Q, 149 
Clsr O, CIsO (right) class set of an R-order O, 281 
Ch cyclic group of order n, 176 
covol covolume, 284 
CSA(F) set of isomorphism classes of central simple F-algebras, 126 
D division ring, 22 
D? hyperbolic unit disc, 618 
D? hyperbolic unit ball, 669 
d(&@1,...,@n) discriminant of elements in a semisimple algebra B, 242 
D(B) absolute discriminant of the algebra B, 517 
dimp V dimension of an F-vector space V, 21 
disc B discriminant of a quaternion algebra B, 218 
disc(/) discriminant of an R-lattice 7, 242 
disc(Q) discriminant of a quadratic form Q, 52 
discr B R-discriminant of a quaternion algebra B, 230 
Dn dihedral group of (even) order n, 176 
e quadratic nonresidue, 190 
E=Eso elements conjugating a fixed embedding into optimal embeddings 


S — O, 546 


ToN), Pi) 
T(N) 

Tr(s), Fe(s) 
Gen O 

Gen Q 
GN(O) 
G(O) 
GO(Q)(F) 
Gor(O) 

G 
GSO(Q)(F) 


h, I4, I6, Lio 
Iœ) 
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E= Es 6 elements conjugating a fixed embedding into adelic optimal embed- 
dings S — O, 548 

Ex, Gk Eisenstein series, 745 
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the quadratic order S into the quaternion order O, 548 

Embr(S,O) set of optimal embeddings S <+ O of the quadratic order S into the 
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E” Euclidean n-space, 606 

Emb(S,O;T) set of T-conjugacy classes of optimal embeddings S <> O, 546 

F field, 21 

FO product norm | ideles, 464 

F” algebraic closure of the field F, 21 

fY Fourier transform of Schwartz function f, 504 

f(O) conductor of the quaternion R-order O, 397 

So elements of F positive at the real places v € Q, 233 
FSP separable closure of F, 93 
Gal(K | F) Galois group of the Galois extension of fields K 2 F, 24 


standard congruence subgroups of level N, 658 

full congruence subgroup of level N, 658 

standard archimedean T -factors, 448 

genus of an R-order O, 282 

genus of a quadratic module Q, 149 

idelic reduced norm group attached to the normalizer of O, 485 
idelic reduced norm group attached to O, 473 

general orthogonal group of Q, 49 

Gorenstein saturation of the quaternion R-order O, 397 
linear algebraic group, 723 

general special orthogonal group of a quadratic form Q, 55 
(real) Hamiltonians, 23 

pure Hamiltonians, 28 

unit Hamiltonians, 27 

upper half-plane, 608 

union of upper and lower half-planes, 609 

completed upper half-plane, 610 

upper half-space, 663 

product of hyperbolic planes and spaces, 718 
hyperbolic n-space, 670 

hyperbolic plane, 73 

class number of S, 548 

lattice (over a domain, in an algebra), 156 

Igusa invariants, 816 

axis of a € H!, 29 


834 


Id(O) 
J~! 
i, j 


M,N 

m(a, @, @3) 
mass(Cls O) 
M, (A) 


SZ Sle els 
w Vx x 
x 


5 
a 


O 


0X 
OLCI), Or) 
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group of invertible two-sided fractional O-ideals, 298 
quasi-inverse of a lattice 7, 265 

standard generators for a quaternion algebra, 23 

interior, 631 

dual of the R-lattice 7 with respect to the reduced trace, 250 
automorphy factor, 749 

K D F quadratic algebra, 39 

residue field, 186 

projective R-module of rank 1, 149 

Lorentz hyperboloid, 620 

Lorentz model of hyperbolic 3-space, 670 

lattice, 657 

left regular representation, 24 

mass correction term at a prime p for the quaternion order O, 423 
dual of the the invertible R-module L, 245 

Lobachevsky function, 672 

length of the path upsilon, 606 

maximal ideal (of a domain), 143 

lattices (over a domain, inside a vector space), 141 

box product of elements in a quaternion algebra, 247 

mass of the class set of a quaternion order O, 432 

ring of n x n-matrices over a ring A, 21 

localization of an R-lattice M at p, 143 

completion of an R-lattice M at p, 146 

number of T-conjugacy classes of optimal adelic embeddings S= 0, 
548 

number of I’-conjugacy classes of optimal embeddings $ <> O, 547 
adelic (product) measure, 521 

polarization (on a quaternion order), 817 

idelic (product) measure on Q*, 506 

standard Haar measure on Qp, 505 

normalized Haar measure on Q% , 506 

absolute norm, 264 

normalizer in B*, 177 

algebra norm, 39 

reduced norm, 38 

order (in an algebra), 155 

profinite completion of the order O over a global ring, 462 
finite idele group of the order O over a global ring, 462 
left, right order of a lattice 7 (in an algebra), 155 

set of real, ramified places of a quaternion algebra B, 233 
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O(n) 

o! 
O(Q)(F) 
p 

P 

P(A) 
PG 


Pic 

Pic(O, O’) 
Picr(O) 
PIdl(O) 
wT 

PIF 


p 


Oo eeless 


XQ 


~ 
v 


tS a a lO OOO Ono 


= s 
o 2 
aa 
BS > 


Ram B 


orthogonal group, 29 

radical idealizer of the local R-order O, 403 
orthogonal group of Q, 49 

prime ideal (of a domain), 143 

maximal (two-sided) ideal of valuation ring, 201 
projective line over a commutative ring A, 439 
the group G modulo scalars, 594 

Euler totient function, 415 

big period matrix, 809 

uniformizer, 186 

Picard group, 298 

set of homothety classes of lattices with left, right order O, O’, 313 
Picard group of the R-order O, 303 

group of principal two-sided fractional O-ideals, 298 
nontrivial normalizer of an Eichler order, 376 
set of places of a global field F, 227 

maximal ideal (of a valuation ring), 186 
Artin—Schreier group, 191 

standard function on the algebra B, the characteristic function of a 
maximal order, 518 

Dedekind w-function, 423 

standard unitary character, 512 

rational idele group, 460 

field of rational numbers, 21 

product of completions of Q at S, 459 

quadratic form, 49 

quaternion group of order 8, 166 

profinite completion of Q, 460 

projection of rational adele ring away from S, 459 
field of p-adic numbers, 181 

rational adele ring, 458 

field of real numbers, 21 

(commutative) noetherian domain, 140 
valuation ring, 186 

radical of a quadratic form, 53 

Jacobson radical of a ring A, 326 

Jacobson radical of an algebra B, 103 
ramification set of a quaternion algebra B, 218 
reversal map (on a Clifford algebra), 71 

distance (in a metric space), 606 

localization of a domain R at p, 143 

completion of a domain R at p, 146 

ring of S-integers of a global field, 229 


sgndisc(Q) 
SO(n) 
SO(Q)(F) 
Stabg(x) 
StCl O 
SU(n) 


aah NAA 


Ten? V 

Ten V 

© 

O(4, v), O, £) 
T(n) 

Tr 

trd 

TypO 


v 
Us 

V 
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set of places of a global field, 229 

quadratic R-algebra, 206 

signed discriminant of a quadratic form Q, 52 
symmetric group on n letters, 166 

special orthogonal group, 29 

special orthogonal group of the quadratic form Q, 55 
stabilizer of x € X under the action by the group G, 630 
stable (or locally free) class group, 337 

special unitary group, 27 

Bruhat-—Tits tree, 388 

associated (symmetric) bilinear form, 49 

Tamagawa measure, 522 

self-dual measure, 511 

reflection in x, 56 

even tensor algebra of a vector space V, 71 

tensor algebra of a vector space V, 69 

Jacobi theta function, 504 

interior angle of X at the vertex v or edge £, 700 
n-Brandt matrix, 763 

algebra trace, 39 

reduced trace, 38 

type set of an R-order O, 282 

transpose (of a matrix), 27 

path, 606 

open subset of the adeles, 458 

finite-dimensional vector space, 141 

valuation, 186 

Dirichlet domain for I centered at zg, 685 

unit group modular scalars of the left order of the R-lattice J, 415 
Hurwitz element, 165 

Weierstrass g-function, 747 

metric space, 606 

completed zeta function of the quaternion algebra B, 449 
ring of integers, 21 

center of an algebra B, 21 

local zeta function of the algebra B with respect to the function ®, 
218 

orientation of a quadratic form, 78 

zeta function of the quaternion algebra B, 421 
partial zeta function of the number field K, 417 
Dedekind zeta function of the number field K, 417 
zeta function of the quaternion order O, 420 
Riemann zeta function, 416 
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ZF ring of integers of a number field F, 229 

Z(i, j) Lipschitz order, 165 

Zp ring of p-adic integers, 181 

¢*(a) leading coefficient in Laurent series expansion of ¢ at s = a, 434 
Z profinite completion of Z, 457 
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